Search in sources :

Example 1 with BatchTSet

use of edu.iu.dsc.tws.api.tset.sets.batch.BatchTSet in project beam by apache.

the class BeamBatchWorker method executePipeline.

public void executePipeline(BatchTSetEnvironment env) {
    Map<String, CachedTSet> sideInputTSets = new HashMap<>();
    for (Map.Entry<String, BatchTSet<?>> sides : sideInputDataSets.entrySet()) {
        BatchTSet<?> sideTSet = sides.getValue();
        addInputs((BaseTSet) sideTSet, sideInputTSets);
        CachedTSet tempCache = (CachedTSet) sideTSet.cache();
        sideInputTSets.put(sides.getKey(), tempCache);
    }
    for (TSet leaf : leaves) {
        SinkTSet sinkTSet = (SinkTSet) leaf.direct().sink(new Twister2SinkFunction());
        addInputs(sinkTSet, sideInputTSets);
        eval(env, sinkTSet);
    }
}
Also used : SinkTSet(edu.iu.dsc.tws.tset.sets.batch.SinkTSet) BatchTSet(edu.iu.dsc.tws.api.tset.sets.batch.BatchTSet) CachedTSet(edu.iu.dsc.tws.tset.sets.batch.CachedTSet) HashMap(java.util.HashMap) LinkedHashMap(java.util.LinkedHashMap) Twister2SinkFunction(org.apache.beam.runners.twister2.translators.functions.Twister2SinkFunction) ComputeTSet(edu.iu.dsc.tws.tset.sets.batch.ComputeTSet) BuildableTSet(edu.iu.dsc.tws.tset.sets.BuildableTSet) CachedTSet(edu.iu.dsc.tws.tset.sets.batch.CachedTSet) SinkTSet(edu.iu.dsc.tws.tset.sets.batch.SinkTSet) BaseTSet(edu.iu.dsc.tws.tset.sets.BaseTSet) TSet(edu.iu.dsc.tws.api.tset.sets.TSet) BatchTSet(edu.iu.dsc.tws.api.tset.sets.batch.BatchTSet) HashMap(java.util.HashMap) LinkedHashMap(java.util.LinkedHashMap) Map(java.util.Map)

Example 2 with BatchTSet

use of edu.iu.dsc.tws.api.tset.sets.batch.BatchTSet in project beam by apache.

the class BeamBatchWorker method setupTSets.

/**
 * Extract the sideInput TSets and the Leaves from the graph.
 *
 * @param env BatchTSetEnvironment which contains the tSet Graph
 * @param sideInputIds the id's of the side Inputs
 * @param leaveIds the id's of the leaf
 */
private void setupTSets(BatchTSetEnvironment env, Map<String, String> sideInputIds, Set<String> leaveIds) {
    sideInputDataSets = new LinkedHashMap<>();
    leaves = new HashSet<>();
    // reset sources, so that the graph does not have two source objects
    // created during deserialization
    Set<BuildableTSet> newSources = new HashSet<>();
    for (BuildableTSet source : env.getGraph().getSources()) {
        newSources.add((BuildableTSet) env.getGraph().getNodeById(source.getId()));
    }
    env.getGraph().setSources(newSources);
    for (Map.Entry<String, String> entry : sideInputIds.entrySet()) {
        BatchTSet curr = (BatchTSet) env.getGraph().getNodeById(entry.getValue());
        sideInputDataSets.put(entry.getKey(), curr);
    }
    for (String leaveId : leaveIds) {
        leaves.add((TSet) env.getGraph().getNodeById(leaveId));
    }
}
Also used : BatchTSet(edu.iu.dsc.tws.api.tset.sets.batch.BatchTSet) BuildableTSet(edu.iu.dsc.tws.tset.sets.BuildableTSet) HashMap(java.util.HashMap) LinkedHashMap(java.util.LinkedHashMap) Map(java.util.Map) HashSet(java.util.HashSet)

Aggregations

BatchTSet (edu.iu.dsc.tws.api.tset.sets.batch.BatchTSet)2 BuildableTSet (edu.iu.dsc.tws.tset.sets.BuildableTSet)2 HashMap (java.util.HashMap)2 LinkedHashMap (java.util.LinkedHashMap)2 Map (java.util.Map)2 TSet (edu.iu.dsc.tws.api.tset.sets.TSet)1 BaseTSet (edu.iu.dsc.tws.tset.sets.BaseTSet)1 CachedTSet (edu.iu.dsc.tws.tset.sets.batch.CachedTSet)1 ComputeTSet (edu.iu.dsc.tws.tset.sets.batch.ComputeTSet)1 SinkTSet (edu.iu.dsc.tws.tset.sets.batch.SinkTSet)1 HashSet (java.util.HashSet)1 Twister2SinkFunction (org.apache.beam.runners.twister2.translators.functions.Twister2SinkFunction)1