Hi, my ultimate goal is to extract text data from PDFs forms using xfa. Is
it possible to use pdfbox to flatten PDFs with xfa forms ( to simplify text
extraction).

If not can the fields themselves be easily parsed?

I see
https://stackoverflow.com/questions/14454387/pdfbox-how-to-flatten-a-pdf-form
which seems to say that xfa is not flatten able?

I see this class,
https://pdfbox.apache.org/docs/1.8.12/javadocs/org/apache/pdfbox/pdmodel/interactive/form/PDXFA.html,
once I call getDocument, how can I get fields (by name/type/) and contents?

Thanks!

Reply via email to