Getting the XML Layer from Hybrid PDF files

jecasc · November 4, 2024, 10:41am

Hello.

Starting next year Electronic Invoicing will be introduced and will gradually become mandatory for B2B where I live. This means PDF invoices will have an XML layer conforming to a certain standard.

This will greatly simplify workflows for invoices from unknown sources.

However I am having trouble getting to the XML layer in hybrid PDF documents.

Is this even possible in Mayan? Or do I have to extract the XML content outside Mayan and then pass it via the API and overwrite the OCR content?

c0d1n6 · November 17, 2024, 12:47pm

Indeed, this is related to ALL EU companies and would give a huge boost.
(eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:32014L0055)