Hi

I am a new user of Tika.

I am handling HTML documents... I succeeded to parse the HTML documents to
a "clean" text string.

However, I am interested to get the structure of the documents : what are
the different sections, what are the titles of these sections etc...

Is there a way to do that with Tika?

Thanks!

Benjamin

Reply via email to