Hi Lewis, Thanks. I did take a look at this. I was looking for ways to jsonify data extracted by tika, but that seems like nothing out of box is present with tika (other than getting metadata parsed). The way I have done is by extending one of the SAXHandler and overriding the existing functions to suit my needs.
Regards, Vineet On Sun, Sep 28, 2014 at 10:26 AM, Lewis John Mcgibbney < lewis.mcgibb...@gmail.com> wrote: > Hi Vineet, > > On Sun, Sep 28, 2014 at 1:21 AM, <dev-digest-h...@tika.apache.org> wrote: > > > > > I was wondering if there any in built parser to get help in conversion > from > > XHTML to JSON. > > > > My research showed that there is one named org.apache.io.json which just > > one method implemented. Also, I tried GJSON library to do this, but it > does > > not seem to work with Tika. Any suggestions will be appreciated? > > > > I don't know if you are talking about generally converting from XHTML to > JSON, but within the context of Tika you can try out the serialization > module [0]. > This will serialize your Tika metadata as JSON. > hth > Lewis > > [0] https://svn.apache.org/repos/asf/tika/trunk/tika-serialization/ > -- *Vineet Ghatge*