Search in sources :

Example 1 with HoodieCompactor

use of org.apache.hudi.utilities.HoodieCompactor in project hudi by apache.

the class SparkMain method compact.

private static int compact(JavaSparkContext jsc, String basePath, String tableName, String compactionInstant, int parallelism, String schemaFile, int retry, String mode, String propsFilePath, List<String> configs) {
    HoodieCompactor.Config cfg = new HoodieCompactor.Config();
    cfg.basePath = basePath;
    cfg.tableName = tableName;
    cfg.compactionInstantTime = compactionInstant;
    // TODO: Make this configurable along with strategy specific config - For now, this is a generic enough strategy
    cfg.strategyClassName = UnBoundedCompactionStrategy.class.getCanonicalName();
    cfg.parallelism = parallelism;
    cfg.schemaFile = schemaFile;
    cfg.runningMode = mode;
    cfg.propsFilePath = propsFilePath;
    cfg.configs = configs;
    return new HoodieCompactor(jsc, cfg).compact(retry);
}
Also used : HoodieWriteConfig(org.apache.hudi.config.HoodieWriteConfig) HoodieIndexConfig(org.apache.hudi.config.HoodieIndexConfig) Config(org.apache.hudi.utilities.HDFSParquetImporter.Config) HoodieBootstrapConfig(org.apache.hudi.config.HoodieBootstrapConfig) HoodieCompactor(org.apache.hudi.utilities.HoodieCompactor) UnBoundedCompactionStrategy(org.apache.hudi.table.action.compact.strategy.UnBoundedCompactionStrategy)

Aggregations

HoodieBootstrapConfig (org.apache.hudi.config.HoodieBootstrapConfig)1 HoodieIndexConfig (org.apache.hudi.config.HoodieIndexConfig)1 HoodieWriteConfig (org.apache.hudi.config.HoodieWriteConfig)1 UnBoundedCompactionStrategy (org.apache.hudi.table.action.compact.strategy.UnBoundedCompactionStrategy)1 Config (org.apache.hudi.utilities.HDFSParquetImporter.Config)1 HoodieCompactor (org.apache.hudi.utilities.HoodieCompactor)1