Search in sources :

Example 31 with Tika

use of org.apache.tika.Tika in project tika by apache.

the class CustomMimeInfo method customMimeInfo.

public static String customMimeInfo() throws Exception {
    String path = "file:///path/to/prescription-type.xml";
    MimeTypes typeDatabase = MimeTypesFactory.create(new URL(path));
    Tika tika = new Tika(typeDatabase);
    String type = tika.detect("/path/to/prescription.xpd");
    return type;
}
Also used : MimeTypes(org.apache.tika.mime.MimeTypes) Tika(org.apache.tika.Tika) URL(java.net.URL)

Example 32 with Tika

use of org.apache.tika.Tika in project tika by apache.

the class CustomMimeInfo method customCompositeDetector.

public static String customCompositeDetector() throws Exception {
    String path = "file:///path/to/prescription-type.xml";
    MimeTypes typeDatabase = MimeTypesFactory.create(new URL(path));
    Tika tika = new Tika(new CompositeDetector(typeDatabase, new EncryptedPrescriptionDetector()));
    String type = tika.detect("/path/to/tmp/prescription.xpd");
    return type;
}
Also used : CompositeDetector(org.apache.tika.detect.CompositeDetector) MimeTypes(org.apache.tika.mime.MimeTypes) Tika(org.apache.tika.Tika) URL(java.net.URL)

Example 33 with Tika

use of org.apache.tika.Tika in project tika by apache.

the class AdvancedTypeDetector method detectWithCustomDetector.

public static String detectWithCustomDetector(String name) throws Exception {
    String config = "/org/apache/tika/mime/tika-mimetypes.xml";
    Detector detector = MimeTypesFactory.create(config);
    Detector custom = new Detector() {

        private static final long serialVersionUID = -5420638839201540749L;

        public MediaType detect(InputStream input, Metadata metadata) {
            String type = metadata.get("my-custom-type-override");
            if (type != null) {
                return MediaType.parse(type);
            } else {
                return MediaType.OCTET_STREAM;
            }
        }
    };
    Tika tika = new Tika(new CompositeDetector(custom, detector));
    return tika.detect(name);
}
Also used : CompositeDetector(org.apache.tika.detect.CompositeDetector) CompositeDetector(org.apache.tika.detect.CompositeDetector) Detector(org.apache.tika.detect.Detector) InputStream(java.io.InputStream) Metadata(org.apache.tika.metadata.Metadata) Tika(org.apache.tika.Tika)

Example 34 with Tika

use of org.apache.tika.Tika in project tika by apache.

the class AdvancedTypeDetector method detectWithCustomConfig.

public static String detectWithCustomConfig(String name) throws Exception {
    String config = "/org/apache/tika/mime/tika-mimetypes.xml";
    Tika tika = new Tika(MimeTypesFactory.create(config));
    return tika.detect(name);
}
Also used : Tika(org.apache.tika.Tika)

Example 35 with Tika

use of org.apache.tika.Tika in project tika by apache.

the class TestMimeTypes method setUp.

@Before
public void setUp() throws Exception {
    TikaConfig config = TikaConfig.getDefaultConfig();
    repo = config.getMimeRepository();
    tika = new Tika(config);
    u = new URL("http://mydomain.com/x.pdf?x=y");
}
Also used : TikaConfig(org.apache.tika.config.TikaConfig) Tika(org.apache.tika.Tika) URL(java.net.URL) Before(org.junit.Before)

Aggregations

Tika (org.apache.tika.Tika)54 Test (org.junit.Test)32 Metadata (org.apache.tika.metadata.Metadata)29 ByteArrayInputStream (java.io.ByteArrayInputStream)14 TikaTest (org.apache.tika.TikaTest)12 TikaConfig (org.apache.tika.config.TikaConfig)12 File (java.io.File)8 InputStream (java.io.InputStream)7 URL (java.net.URL)6 TikaInputStream (org.apache.tika.io.TikaInputStream)5 IOException (java.io.IOException)4 HashSet (java.util.HashSet)4 Ignore (org.junit.Ignore)4 FileInputStream (java.io.FileInputStream)3 ArrayList (java.util.ArrayList)3 HashMap (java.util.HashMap)3 Content (org.apache.nutch.protocol.Content)3 Before (org.junit.Before)3 FileOutputStream (java.io.FileOutputStream)2 UnsupportedEncodingException (java.io.UnsupportedEncodingException)2