Search in sources :

Example 1 with XpacketParsingException

use of org.apache.pdfbox.preflight.metadata.XpacketParsingException in project pdfbox by apache.

the class MetadataValidationProcess method validate.

@Override
public void validate(PreflightContext ctx) throws ValidationException {
    try {
        PDDocument document = ctx.getDocument();
        InputStream is = getXpacket(document.getDocument());
        DomXmpParser builder = new DomXmpParser();
        XMPMetadata metadata = builder.parse(is);
        is.close();
        ctx.setMetadata(metadata);
        // 6.7.5 no deprecated attribute in xpacket processing instruction
        if (metadata.getXpacketBytes() != null) {
            addValidationError(ctx, new ValidationError(PreflightConstants.ERROR_METADATA_XPACKET_DEPRECATED, "bytes attribute is forbidden"));
        }
        if (metadata.getXpacketEncoding() != null) {
            addValidationError(ctx, new ValidationError(PreflightConstants.ERROR_METADATA_XPACKET_DEPRECATED, "encoding attribute is forbidden"));
        }
        checkThumbnails(ctx, metadata);
        // Call metadata synchronization checking
        addValidationErrors(ctx, new SynchronizedMetaDataValidation().validateMetadataSynchronization(document, metadata));
        // Call PDF/A Identifier checking
        addValidationErrors(ctx, new PDFAIdentificationValidation().validatePDFAIdentifer(metadata));
        // Call rdf:about checking
        try {
            new RDFAboutAttributeConcordanceValidation().validateRDFAboutAttributes(metadata);
        } catch (DifferentRDFAboutException e) {
            addValidationError(ctx, new ValidationError(PreflightConstants.ERROR_METADATA_RDF_ABOUT_ATTRIBUTE_INEQUAL_VALUE, e.getMessage(), e));
        }
    } catch (XpacketParsingException e) {
        if (e.getError() != null) {
            addValidationError(ctx, e.getError());
        } else {
            addValidationError(ctx, new ValidationError(PreflightConstants.ERROR_METADATA_MAIN, "Unexpected error", e));
        }
    } catch (XmpParsingException e) {
        if (e.getErrorType() == ErrorType.NoValueType) {
            addValidationError(ctx, new ValidationError(PreflightConstants.ERROR_METADATA_UNKNOWN_VALUETYPE, e.getMessage(), e));
        } else if (e.getErrorType() == ErrorType.RequiredProperty) {
            addValidationError(ctx, new ValidationError(PreflightConstants.ERROR_METADATA_PROPERTY_MISSING, e.getMessage(), e));
        } else if (e.getErrorType() == ErrorType.InvalidPrefix) {
            addValidationError(ctx, new ValidationError(PreflightConstants.ERROR_METADATA_ABSENT_DESCRIPTION_SCHEMA, e.getMessage(), e));
        } else if (e.getErrorType() == ErrorType.InvalidType) {
            addValidationError(ctx, new ValidationError(PreflightConstants.ERROR_METADATA_PROPERTY_UNKNOWN, e.getMessage(), e));
        } else if (e.getErrorType() == ErrorType.XpacketBadEnd) {
            throw new ValidationException("Unable to parse font metadata due to : " + e.getMessage(), e);
        } else if (e.getErrorType() == ErrorType.NoSchema) {
            addValidationError(ctx, new ValidationError(PreflightConstants.ERROR_METADATA_ABSENT_DESCRIPTION_SCHEMA, e.getMessage(), e));
        } else if (e.getErrorType() == ErrorType.InvalidPdfaSchema) {
            addValidationError(ctx, new ValidationError(PreflightConstants.ERROR_METADATA_WRONG_NS_URI, e.getMessage(), e));
        } else {
            addValidationError(ctx, new ValidationError(PreflightConstants.ERROR_METADATA_FORMAT, e.getMessage(), e));
        }
    } catch (IOException e) {
        throw new ValidationException("Failed while validating", e);
    }
}
Also used : XpacketParsingException(org.apache.pdfbox.preflight.metadata.XpacketParsingException) ValidationException(org.apache.pdfbox.preflight.exception.ValidationException) ByteArrayInputStream(java.io.ByteArrayInputStream) InputStream(java.io.InputStream) IOException(java.io.IOException) RDFAboutAttributeConcordanceValidation(org.apache.pdfbox.preflight.metadata.RDFAboutAttributeConcordanceValidation) XmpParsingException(org.apache.xmpbox.xml.XmpParsingException) SynchronizedMetaDataValidation(org.apache.pdfbox.preflight.metadata.SynchronizedMetaDataValidation) XMPMetadata(org.apache.xmpbox.XMPMetadata) DomXmpParser(org.apache.xmpbox.xml.DomXmpParser) DifferentRDFAboutException(org.apache.pdfbox.preflight.metadata.RDFAboutAttributeConcordanceValidation.DifferentRDFAboutException) PDDocument(org.apache.pdfbox.pdmodel.PDDocument) ValidationError(org.apache.pdfbox.preflight.ValidationResult.ValidationError) PDFAIdentificationValidation(org.apache.pdfbox.preflight.metadata.PDFAIdentificationValidation)

Example 2 with XpacketParsingException

use of org.apache.pdfbox.preflight.metadata.XpacketParsingException in project pdfbox by apache.

the class MetadataValidationProcess method getXpacket.

/**
 * Return the xpacket from the dictionary's stream
 */
private static InputStream getXpacket(COSDocument cdocument) throws IOException, XpacketParsingException {
    COSObject catalog = cdocument.getCatalog();
    COSBase cb = catalog.getDictionaryObject(COSName.METADATA);
    if (cb == null) {
        // missing Metadata Key in catalog
        ValidationError error = new ValidationError(PreflightConstants.ERROR_METADATA_FORMAT, "Missing Metadata Key in catalog");
        throw new XpacketParsingException("Failed while retrieving xpacket", error);
    }
    // no filter key
    COSDictionary metadataDictionnary = COSUtils.getAsDictionary(cb, cdocument);
    if (metadataDictionnary.getItem(COSName.FILTER) != null) {
        // should not be defined
        ValidationError error = new ValidationError(PreflightConstants.ERROR_SYNTAX_STREAM_INVALID_FILTER, "Filter specified in metadata dictionnary");
        throw new XpacketParsingException("Failed while retrieving xpacket", error);
    }
    if (!(metadataDictionnary instanceof COSStream)) {
        // missing Metadata Key in catalog
        ValidationError error = new ValidationError(PreflightConstants.ERROR_METADATA_FORMAT, "Metadata is not a stream");
        throw new XpacketParsingException("Failed while retrieving xpacket", error);
    }
    COSStream stream = (COSStream) metadataDictionnary;
    return stream.createInputStream();
}
Also used : XpacketParsingException(org.apache.pdfbox.preflight.metadata.XpacketParsingException) COSStream(org.apache.pdfbox.cos.COSStream) COSDictionary(org.apache.pdfbox.cos.COSDictionary) COSObject(org.apache.pdfbox.cos.COSObject) COSBase(org.apache.pdfbox.cos.COSBase) ValidationError(org.apache.pdfbox.preflight.ValidationResult.ValidationError)

Aggregations

ValidationError (org.apache.pdfbox.preflight.ValidationResult.ValidationError)2 XpacketParsingException (org.apache.pdfbox.preflight.metadata.XpacketParsingException)2 ByteArrayInputStream (java.io.ByteArrayInputStream)1 IOException (java.io.IOException)1 InputStream (java.io.InputStream)1 COSBase (org.apache.pdfbox.cos.COSBase)1 COSDictionary (org.apache.pdfbox.cos.COSDictionary)1 COSObject (org.apache.pdfbox.cos.COSObject)1 COSStream (org.apache.pdfbox.cos.COSStream)1 PDDocument (org.apache.pdfbox.pdmodel.PDDocument)1 ValidationException (org.apache.pdfbox.preflight.exception.ValidationException)1 PDFAIdentificationValidation (org.apache.pdfbox.preflight.metadata.PDFAIdentificationValidation)1 RDFAboutAttributeConcordanceValidation (org.apache.pdfbox.preflight.metadata.RDFAboutAttributeConcordanceValidation)1 DifferentRDFAboutException (org.apache.pdfbox.preflight.metadata.RDFAboutAttributeConcordanceValidation.DifferentRDFAboutException)1 SynchronizedMetaDataValidation (org.apache.pdfbox.preflight.metadata.SynchronizedMetaDataValidation)1 XMPMetadata (org.apache.xmpbox.XMPMetadata)1 DomXmpParser (org.apache.xmpbox.xml.DomXmpParser)1 XmpParsingException (org.apache.xmpbox.xml.XmpParsingException)1