Search in sources :

Example 21 with IndexArgs

use of io.anserini.index.IndexArgs in project Anserini by castorini.

the class TrecEndToEndExternalStopwordsTest method getIndexArgs.

@Override
protected IndexArgs getIndexArgs() {
    IndexArgs indexArgs = createDefaultIndexArgs();
    indexArgs.input = "src/test/resources/sample_docs/trec/collection2";
    indexArgs.collectionClass = TrecCollection.class.getSimpleName();
    indexArgs.stopwords = "src/test/resources/sample_docs/trec/collection2/stopwords.txt";
    return indexArgs;
}
Also used : TrecCollection(io.anserini.collection.TrecCollection) IndexArgs(io.anserini.index.IndexArgs)

Example 22 with IndexArgs

use of io.anserini.index.IndexArgs in project Anserini by castorini.

the class PretokenizedIndexEndToEndTest method getIndexArgs.

@Override
IndexArgs getIndexArgs() {
    IndexArgs indexArgs = createDefaultIndexArgs();
    indexArgs.input = "src/test/resources/sample_docs/json/collection_tokenized";
    indexArgs.collectionClass = JsonCollection.class.getSimpleName();
    indexArgs.generatorClass = DefaultLuceneDocumentGenerator.class.getSimpleName();
    indexArgs.pretokenized = true;
    indexArgs.storeRaw = true;
    return indexArgs;
}
Also used : DefaultLuceneDocumentGenerator(io.anserini.index.generator.DefaultLuceneDocumentGenerator) IndexArgs(io.anserini.index.IndexArgs) JsonCollection(io.anserini.collection.JsonCollection)

Aggregations

IndexArgs (io.anserini.index.IndexArgs)22 TrecCollection (io.anserini.collection.TrecCollection)6 CoreCollection (io.anserini.collection.CoreCollection)3 JsonCollection (io.anserini.collection.JsonCollection)3 DefaultLuceneDocumentGenerator (io.anserini.index.generator.DefaultLuceneDocumentGenerator)3 Before (org.junit.Before)3 ObjectMapper (com.fasterxml.jackson.databind.ObjectMapper)2 ObjectNode (com.fasterxml.jackson.databind.node.ObjectNode)2 AclAnthology (io.anserini.collection.AclAnthology)2 AclAnthologyGenerator (io.anserini.index.generator.AclAnthologyGenerator)2 CoreGenerator (io.anserini.index.generator.CoreGenerator)2 BibtexCollection (io.anserini.collection.BibtexCollection)1 C4Collection (io.anserini.collection.C4Collection)1 JsonVectorCollection (io.anserini.collection.JsonVectorCollection)1 TweetCollection (io.anserini.collection.TweetCollection)1 IndexCollection (io.anserini.index.IndexCollection)1 BibtexGenerator (io.anserini.index.generator.BibtexGenerator)1 C4Generator (io.anserini.index.generator.C4Generator)1 TweetGenerator (io.anserini.index.generator.TweetGenerator)1 SearchSolr (io.anserini.search.SearchSolr)1