Getting the XML Layer from Hybrid PDF files

Hello.

Starting next year Electronic Invoicing will be introduced and will gradually become mandatory for B2B where I live. This means PDF invoices will have an XML layer conforming to a certain standard.

This will greatly simplify workflows for invoices from unknown sources.

However I am having trouble getting to the XML layer in hybrid PDF documents.

Is this even possible in Mayan? Or do I have to extract the XML content outside Mayan and then pass it via the API and overwrite the OCR content?

4 Likes

Indeed, this is related to ALL EU companies and would give a huge boost.
(eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:32014L0055)