Hi,
I would like to validate a XML file which is embedded into a PDF-A file via VeraPDF policy validation but the feature extraction only shows metadata, like fileName, description, subtype and afRelationship. Is it possible to have the file contents in the extraction report?
thanks and kind regards Jochen
Hi Jochen,
This shouldn't be considered a definitive answer but I believe that in theory, this is possible by using a Feature Extraction plugin. In practice, however, I don't believe that such a plugin currently exists. It's an interesting, and entirely valid, twist on using the policy checker. Is the job a one-off?
Best, Carl
On Thu, 4 May 2017 at 18:51 Jochen Staerk jstaerk@usegroup.de wrote:
Hi,
I would like to validate a XML file which is embedded into a PDF-A file via VeraPDF policy validation but the feature extraction only shows metadata, like fileName, description, subtype and afRelationship. Is it possible to have the file contents in the extraction report?
thanks and kind regards Jochen _______________________________________________ Users mailing list Users@lists.verapdf.org http://lists.verapdf.org/listinfo/users
Hi,
checker. Is the job a one-off?
No. I would use veraPDF as a validator against a PDF-A3-embedded-XML-file-based invoice metadata standard called ZUGFeRD ( http://www.ferd-net.de/front_content.php?idcat=231&changelang=4 ).
The ZUGFeRD XML structure is more or less schematron based and that's why that particular "twist" as you call it is so appealing.
-> Looks as I would like to contribute something file extractish to veraPDF.
If you gave me a hint how to start writing an according feature extraction plugin I would like to try. And probably fail. And because I love failing, in a second step I would like to fail on a proper veraPDF ZUGFeRD validation plugin ;-)
Let me guess: First step for me would be to try to build https://github.com/veraPDF/veraPDF-plugins/blob/integration/embeddedfileSamp... and then try some amendments?
kind regards Jochen
users@lists.openpreservation.org