Search in sources :

Example 1 with CompactRandomCutTreeState

use of com.amazon.randomcutforest.state.tree.CompactRandomCutTreeState in project random-cut-forest-by-aws by aws.

the class RandomCutForestMapper method singlePrecisionForest.

public RandomCutForest singlePrecisionForest(RandomCutForest.Builder<?> builder, RandomCutForestState state, IPointStore<float[]> extPointStore, List<ITree<Integer, float[]>> extTrees, List<IStreamSampler<Integer>> extSamplers) {
    checkArgument(builder != null, "builder cannot be null");
    checkArgument(extTrees == null || extTrees.size() == state.getNumberOfTrees(), "incorrect number of trees");
    checkArgument(extSamplers == null || extSamplers.size() == state.getNumberOfTrees(), "incorrect number of samplers");
    checkArgument(extSamplers != null | state.isSaveSamplerStateEnabled(), " need samplers ");
    checkArgument(extPointStore != null || state.isSaveCoordinatorStateEnabled(), " need coordinator state ");
    Random random = builder.getRandom();
    ComponentList<Integer, float[]> components = new ComponentList<>();
    CompactRandomCutTreeContext context = new CompactRandomCutTreeContext();
    IPointStore<float[]> pointStore = (extPointStore == null) ? new PointStoreMapper().toModel(state.getPointStoreState()) : extPointStore;
    PointStoreCoordinator<float[]> coordinator = new PointStoreCoordinator<>(pointStore);
    coordinator.setTotalUpdates(state.getTotalUpdates());
    context.setPointStore(pointStore);
    context.setMaxSize(state.getSampleSize());
    RandomCutTreeMapper treeMapper = new RandomCutTreeMapper();
    List<CompactRandomCutTreeState> treeStates = state.isSaveTreeStateEnabled() ? state.getCompactRandomCutTreeStates() : null;
    CompactSamplerMapper samplerMapper = new CompactSamplerMapper();
    List<CompactSamplerState> samplerStates = state.isSaveSamplerStateEnabled() ? state.getCompactSamplerStates() : null;
    for (int i = 0; i < state.getNumberOfTrees(); i++) {
        IStreamSampler<Integer> sampler = (extSamplers != null) ? extSamplers.get(i) : samplerMapper.toModel(samplerStates.get(i), random.nextLong());
        ITree<Integer, float[]> tree;
        if (extTrees != null) {
            tree = extTrees.get(i);
        } else if (treeStates != null) {
            tree = treeMapper.toModel(treeStates.get(i), context, random.nextLong());
            sampler.getSample().forEach(s -> tree.addPoint(s.getValue(), s.getSequenceIndex()));
            tree.setConfig(Config.BOUNDING_BOX_CACHE_FRACTION, treeStates.get(i).getBoundingBoxCacheFraction());
        } else {
            // using boundingBoxCahce for the new tree
            tree = new RandomCutTree.Builder().capacity(state.getSampleSize()).randomSeed(random.nextLong()).pointStoreView(pointStore).boundingBoxCacheFraction(state.getBoundingBoxCacheFraction()).centerOfMassEnabled(state.isCenterOfMassEnabled()).storeSequenceIndexesEnabled(state.isStoreSequenceIndexesEnabled()).build();
            sampler.getSample().forEach(s -> tree.addPoint(s.getValue(), s.getSequenceIndex()));
        }
        components.add(new SamplerPlusTree<>(sampler, tree));
    }
    builder.precision(Precision.FLOAT_32);
    return new RandomCutForest(builder, coordinator, components, random);
}
Also used : CommonUtils.checkNotNull(com.amazon.randomcutforest.CommonUtils.checkNotNull) Setter(lombok.Setter) Getter(lombok.Getter) Precision(com.amazon.randomcutforest.config.Precision) CompactSampler(com.amazon.randomcutforest.sampler.CompactSampler) Random(java.util.Random) SamplerPlusTree(com.amazon.randomcutforest.executor.SamplerPlusTree) RandomCutTree(com.amazon.randomcutforest.tree.RandomCutTree) ArrayList(java.util.ArrayList) PointStore(com.amazon.randomcutforest.store.PointStore) Weighted(com.amazon.randomcutforest.sampler.Weighted) Config(com.amazon.randomcutforest.config.Config) PointStoreMapper(com.amazon.randomcutforest.state.store.PointStoreMapper) IPointStore(com.amazon.randomcutforest.store.IPointStore) ComponentList(com.amazon.randomcutforest.ComponentList) PointStoreCoordinator(com.amazon.randomcutforest.executor.PointStoreCoordinator) IComponentModel(com.amazon.randomcutforest.IComponentModel) CompactRandomCutTreeContext(com.amazon.randomcutforest.state.tree.CompactRandomCutTreeContext) CompactSamplerState(com.amazon.randomcutforest.state.sampler.CompactSamplerState) CommonUtils.checkArgument(com.amazon.randomcutforest.CommonUtils.checkArgument) PointStoreState(com.amazon.randomcutforest.state.store.PointStoreState) Collectors(java.util.stream.Collectors) RandomCutForest(com.amazon.randomcutforest.RandomCutForest) ITree(com.amazon.randomcutforest.tree.ITree) List(java.util.List) RandomCutTreeMapper(com.amazon.randomcutforest.state.tree.RandomCutTreeMapper) CompactRandomCutTreeState(com.amazon.randomcutforest.state.tree.CompactRandomCutTreeState) CompactSamplerMapper(com.amazon.randomcutforest.state.sampler.CompactSamplerMapper) IStreamSampler(com.amazon.randomcutforest.sampler.IStreamSampler) RandomCutTree(com.amazon.randomcutforest.tree.RandomCutTree) CompactRandomCutTreeContext(com.amazon.randomcutforest.state.tree.CompactRandomCutTreeContext) CompactSamplerMapper(com.amazon.randomcutforest.state.sampler.CompactSamplerMapper) RandomCutForest(com.amazon.randomcutforest.RandomCutForest) ComponentList(com.amazon.randomcutforest.ComponentList) CompactRandomCutTreeState(com.amazon.randomcutforest.state.tree.CompactRandomCutTreeState) PointStoreCoordinator(com.amazon.randomcutforest.executor.PointStoreCoordinator) RandomCutTreeMapper(com.amazon.randomcutforest.state.tree.RandomCutTreeMapper) PointStoreMapper(com.amazon.randomcutforest.state.store.PointStoreMapper) CompactSamplerState(com.amazon.randomcutforest.state.sampler.CompactSamplerState) Random(java.util.Random)

Example 2 with CompactRandomCutTreeState

use of com.amazon.randomcutforest.state.tree.CompactRandomCutTreeState in project random-cut-forest-by-aws by aws.

the class RandomCutForestMapper method toState.

/**
 * Create a {@link RandomCutForestState} object representing the state of the
 * given forest. If the forest is compact and the {@code saveTreeState} flag is
 * set to true, then structure of the trees in the forest will be included in
 * the state object. If the flag is set to false, then the state object will
 * only contain the sampler data for each tree. If the
 * {@code saveExecutorContext} is true, then the executor context will be
 * included in the state object.
 *
 * @param forest A Random Cut Forest whose state we want to capture.
 * @return a {@link RandomCutForestState} object representing the state of the
 *         given forest.
 * @throws IllegalArgumentException if the {@code saveTreeState} flag is true
 *                                  and the forest is not compact.
 */
@Override
public RandomCutForestState toState(RandomCutForest forest) {
    if (saveTreeStateEnabled) {
        checkArgument(forest.isCompact(), "tree state cannot be saved for noncompact forests");
    }
    RandomCutForestState state = new RandomCutForestState();
    state.setNumberOfTrees(forest.getNumberOfTrees());
    state.setDimensions(forest.getDimensions());
    state.setTimeDecay(forest.getTimeDecay());
    state.setSampleSize(forest.getSampleSize());
    state.setShingleSize(forest.getShingleSize());
    state.setCenterOfMassEnabled(forest.isCenterOfMassEnabled());
    state.setOutputAfter(forest.getOutputAfter());
    state.setStoreSequenceIndexesEnabled(forest.isStoreSequenceIndexesEnabled());
    state.setTotalUpdates(forest.getTotalUpdates());
    state.setCompact(forest.isCompact());
    state.setInternalShinglingEnabled(forest.isInternalShinglingEnabled());
    state.setBoundingBoxCacheFraction(forest.getBoundingBoxCacheFraction());
    state.setSaveSamplerStateEnabled(saveSamplerStateEnabled);
    state.setSaveTreeStateEnabled(saveTreeStateEnabled);
    state.setSaveCoordinatorStateEnabled(saveCoordinatorStateEnabled);
    state.setPrecision(forest.getPrecision().name());
    state.setCompressed(compressionEnabled);
    state.setPartialTreeState(partialTreeStateEnabled);
    if (saveExecutorContextEnabled) {
        ExecutionContext executionContext = new ExecutionContext();
        executionContext.setParallelExecutionEnabled(forest.isParallelExecutionEnabled());
        executionContext.setThreadPoolSize(forest.getThreadPoolSize());
        state.setExecutionContext(executionContext);
    }
    if (saveCoordinatorStateEnabled) {
        PointStoreCoordinator<?> pointStoreCoordinator = (PointStoreCoordinator<?>) forest.getUpdateCoordinator();
        PointStoreMapper mapper = new PointStoreMapper();
        mapper.setCompressionEnabled(compressionEnabled);
        mapper.setNumberOfTrees(forest.getNumberOfTrees());
        PointStoreState pointStoreState = mapper.toState((PointStore) pointStoreCoordinator.getStore());
        state.setPointStoreState(pointStoreState);
    }
    List<CompactSamplerState> samplerStates = null;
    if (saveSamplerStateEnabled) {
        samplerStates = new ArrayList<>();
    }
    List<ITree<Integer, ?>> trees = null;
    if (saveTreeStateEnabled) {
        trees = new ArrayList<>();
    }
    CompactSamplerMapper samplerMapper = new CompactSamplerMapper();
    samplerMapper.setCompressionEnabled(compressionEnabled);
    for (IComponentModel<?, ?> component : forest.getComponents()) {
        SamplerPlusTree<Integer, ?> samplerPlusTree = (SamplerPlusTree<Integer, ?>) component;
        CompactSampler sampler = (CompactSampler) samplerPlusTree.getSampler();
        if (samplerStates != null) {
            samplerStates.add(samplerMapper.toState(sampler));
        }
        if (trees != null) {
            trees.add(samplerPlusTree.getTree());
        }
    }
    state.setCompactSamplerStates(samplerStates);
    if (trees != null) {
        RandomCutTreeMapper treeMapper = new RandomCutTreeMapper();
        List<CompactRandomCutTreeState> treeStates = trees.stream().map(t -> treeMapper.toState((RandomCutTree) t)).collect(Collectors.toList());
        state.setCompactRandomCutTreeStates(treeStates);
    }
    return state;
}
Also used : CommonUtils.checkNotNull(com.amazon.randomcutforest.CommonUtils.checkNotNull) Setter(lombok.Setter) Getter(lombok.Getter) Precision(com.amazon.randomcutforest.config.Precision) CompactSampler(com.amazon.randomcutforest.sampler.CompactSampler) Random(java.util.Random) SamplerPlusTree(com.amazon.randomcutforest.executor.SamplerPlusTree) RandomCutTree(com.amazon.randomcutforest.tree.RandomCutTree) ArrayList(java.util.ArrayList) PointStore(com.amazon.randomcutforest.store.PointStore) Weighted(com.amazon.randomcutforest.sampler.Weighted) Config(com.amazon.randomcutforest.config.Config) PointStoreMapper(com.amazon.randomcutforest.state.store.PointStoreMapper) IPointStore(com.amazon.randomcutforest.store.IPointStore) ComponentList(com.amazon.randomcutforest.ComponentList) PointStoreCoordinator(com.amazon.randomcutforest.executor.PointStoreCoordinator) IComponentModel(com.amazon.randomcutforest.IComponentModel) CompactRandomCutTreeContext(com.amazon.randomcutforest.state.tree.CompactRandomCutTreeContext) CompactSamplerState(com.amazon.randomcutforest.state.sampler.CompactSamplerState) CommonUtils.checkArgument(com.amazon.randomcutforest.CommonUtils.checkArgument) PointStoreState(com.amazon.randomcutforest.state.store.PointStoreState) Collectors(java.util.stream.Collectors) RandomCutForest(com.amazon.randomcutforest.RandomCutForest) ITree(com.amazon.randomcutforest.tree.ITree) List(java.util.List) RandomCutTreeMapper(com.amazon.randomcutforest.state.tree.RandomCutTreeMapper) CompactRandomCutTreeState(com.amazon.randomcutforest.state.tree.CompactRandomCutTreeState) CompactSamplerMapper(com.amazon.randomcutforest.state.sampler.CompactSamplerMapper) IStreamSampler(com.amazon.randomcutforest.sampler.IStreamSampler) CompactSampler(com.amazon.randomcutforest.sampler.CompactSampler) CompactSamplerMapper(com.amazon.randomcutforest.state.sampler.CompactSamplerMapper) PointStoreState(com.amazon.randomcutforest.state.store.PointStoreState) ITree(com.amazon.randomcutforest.tree.ITree) CompactRandomCutTreeState(com.amazon.randomcutforest.state.tree.CompactRandomCutTreeState) PointStoreCoordinator(com.amazon.randomcutforest.executor.PointStoreCoordinator) RandomCutTreeMapper(com.amazon.randomcutforest.state.tree.RandomCutTreeMapper) PointStoreMapper(com.amazon.randomcutforest.state.store.PointStoreMapper) CompactSamplerState(com.amazon.randomcutforest.state.sampler.CompactSamplerState) SamplerPlusTree(com.amazon.randomcutforest.executor.SamplerPlusTree)

Aggregations

CommonUtils.checkArgument (com.amazon.randomcutforest.CommonUtils.checkArgument)2 CommonUtils.checkNotNull (com.amazon.randomcutforest.CommonUtils.checkNotNull)2 ComponentList (com.amazon.randomcutforest.ComponentList)2 IComponentModel (com.amazon.randomcutforest.IComponentModel)2 RandomCutForest (com.amazon.randomcutforest.RandomCutForest)2 Config (com.amazon.randomcutforest.config.Config)2 Precision (com.amazon.randomcutforest.config.Precision)2 PointStoreCoordinator (com.amazon.randomcutforest.executor.PointStoreCoordinator)2 SamplerPlusTree (com.amazon.randomcutforest.executor.SamplerPlusTree)2 CompactSampler (com.amazon.randomcutforest.sampler.CompactSampler)2 IStreamSampler (com.amazon.randomcutforest.sampler.IStreamSampler)2 Weighted (com.amazon.randomcutforest.sampler.Weighted)2 CompactSamplerMapper (com.amazon.randomcutforest.state.sampler.CompactSamplerMapper)2 CompactSamplerState (com.amazon.randomcutforest.state.sampler.CompactSamplerState)2 PointStoreMapper (com.amazon.randomcutforest.state.store.PointStoreMapper)2 PointStoreState (com.amazon.randomcutforest.state.store.PointStoreState)2 CompactRandomCutTreeContext (com.amazon.randomcutforest.state.tree.CompactRandomCutTreeContext)2 CompactRandomCutTreeState (com.amazon.randomcutforest.state.tree.CompactRandomCutTreeState)2 RandomCutTreeMapper (com.amazon.randomcutforest.state.tree.RandomCutTreeMapper)2 IPointStore (com.amazon.randomcutforest.store.IPointStore)2