Search in sources :

Example 26 with StandardCrawlerContainer

use of org.codelibs.fess.crawler.container.StandardCrawlerContainer in project fess-crawler by codelibs.

the class MsExcelExtractorTest method setUp.

@Override
protected void setUp() throws Exception {
    super.setUp();
    StandardCrawlerContainer container = new StandardCrawlerContainer().singleton("msExcelExtractor", MsExcelExtractor.class);
    msExcelExtractor = container.getComponent("msExcelExtractor");
}
Also used : StandardCrawlerContainer(org.codelibs.fess.crawler.container.StandardCrawlerContainer)

Example 27 with StandardCrawlerContainer

use of org.codelibs.fess.crawler.container.StandardCrawlerContainer in project fess-crawler by codelibs.

the class MsPowerPointExtractorTest method setUp.

@Override
protected void setUp() throws Exception {
    super.setUp();
    StandardCrawlerContainer container = new StandardCrawlerContainer().singleton("msPowerPointExtractor", MsPowerPointExtractor.class);
    msPowerPointExtractor = container.getComponent("msPowerPointExtractor");
}
Also used : StandardCrawlerContainer(org.codelibs.fess.crawler.container.StandardCrawlerContainer)

Example 28 with StandardCrawlerContainer

use of org.codelibs.fess.crawler.container.StandardCrawlerContainer in project fess-crawler by codelibs.

the class PdfExtractorTest method setUp.

@Override
protected void setUp() throws Exception {
    super.setUp();
    StandardCrawlerContainer container = new StandardCrawlerContainer().singleton("pdfExtractor", // 
    PdfExtractor.class).singleton("pdfExtractorForPdfPassword", PdfExtractor.class);
    pdfExtractor = container.getComponent("pdfExtractor");
    pdfExtractorForPdfPassword = container.getComponent("pdfExtractorForPdfPassword");
    pdfExtractorForPdfPassword.addPassword(".*test_.*.pdf", "word");
}
Also used : StandardCrawlerContainer(org.codelibs.fess.crawler.container.StandardCrawlerContainer)

Example 29 with StandardCrawlerContainer

use of org.codelibs.fess.crawler.container.StandardCrawlerContainer in project fess-crawler by codelibs.

the class TextExtractorTest method setUp.

@Override
protected void setUp() throws Exception {
    super.setUp();
    StandardCrawlerContainer container = new StandardCrawlerContainer().singleton("textExtractor", TextExtractor.class);
    textExtractor = container.getComponent("textExtractor");
}
Also used : StandardCrawlerContainer(org.codelibs.fess.crawler.container.StandardCrawlerContainer)

Example 30 with StandardCrawlerContainer

use of org.codelibs.fess.crawler.container.StandardCrawlerContainer in project fess-crawler by codelibs.

the class TikaExtractorTest method setUp.

@Override
protected void setUp() throws Exception {
    super.setUp();
    StandardCrawlerContainer container = new StandardCrawlerContainer();
    container.singleton("mimeTypeHelper", MimeTypeHelperImpl.class).singleton("tikaExtractor", TikaExtractor.class).<ExtractorFactory>singleton("extractorFactory", ExtractorFactory.class, factory -> {
        TikaExtractor tikaExtractor = container.getComponent("tikaExtractor");
        factory.addExtractor("text/plain", tikaExtractor);
        factory.addExtractor("text/html", tikaExtractor);
    });
    tikaExtractor = container.getComponent("tikaExtractor");
}
Also used : ExtractorFactory(org.codelibs.fess.crawler.extractor.ExtractorFactory) StandardCrawlerContainer(org.codelibs.fess.crawler.container.StandardCrawlerContainer)

Aggregations

StandardCrawlerContainer (org.codelibs.fess.crawler.container.StandardCrawlerContainer)32 MimeTypeHelperImpl (org.codelibs.fess.crawler.helper.impl.MimeTypeHelperImpl)9 ExtractorFactory (org.codelibs.fess.crawler.extractor.ExtractorFactory)7 TikaExtractor (org.codelibs.fess.crawler.extractor.impl.TikaExtractor)3 MemoryDataHelper (org.codelibs.fess.crawler.helper.MemoryDataHelper)3 SitemapsHelper (org.codelibs.fess.crawler.helper.SitemapsHelper)3 UrlFilterServiceImpl (org.codelibs.fess.crawler.service.impl.UrlFilterServiceImpl)3 File (java.io.File)2 ArchiveStreamFactory (org.apache.commons.compress.archivers.ArchiveStreamFactory)2 ResourceUtil (org.codelibs.core.io.ResourceUtil)2 HcHttpClient (org.codelibs.fess.crawler.client.http.HcHttpClient)2 RobotsTxtHelper (org.codelibs.fess.crawler.helper.RobotsTxtHelper)2 CrawlerWebServer (org.codelibs.fess.crawler.util.CrawlerWebServer)2 PlainTestCase (org.dbflute.utflute.core.PlainTestCase)2 Iterator (java.util.Iterator)1 Map (java.util.Map)1 Set (java.util.Set)1 TimeUnit (java.util.concurrent.TimeUnit)1 Resource (javax.annotation.Resource)1 GenericObjectPool (org.apache.commons.pool2.impl.GenericObjectPool)1