Hi,

I'm a little newbie with tika and would need some help.

I have many pdf files which i would like to extract metadata, in order to have an xml file (which respect dublin core).

I've followed these links http://www.hascode.com/2012/12/content-detection-metadata-and-content-extraction-with-apache-tika/#Extracting_Metadata_from_a_PDF_using_a_concrete_Parser and http://tika.apache.org/0.8/api/org/apache/tika/metadata/Metadata.html

Do i have to write a program with tika to do it?

How could i do that?

Best regards

Samuel

<<attachment: samuel_desseaux.vcf>>

Reply via email to