Hi Lewis,

Thanks. I did take a look at this. I was looking for ways to jsonify data
extracted by tika, but that seems like nothing out of box is present with
tika (other than getting metadata parsed). The way I have done is by
extending one of the SAXHandler and overriding the existing functions to
suit my needs.

Regards,
Vineet

On Sun, Sep 28, 2014 at 10:26 AM, Lewis John Mcgibbney <
lewis.mcgibb...@gmail.com> wrote:

> Hi Vineet,
>
> On Sun, Sep 28, 2014 at 1:21 AM, <dev-digest-h...@tika.apache.org> wrote:
>
> >
> > I was wondering if there any in built parser to get help in conversion
> from
> > XHTML to JSON.
> >
> > My research showed that there is one named org.apache.io.json which just
> > one method implemented. Also, I tried GJSON library to do this, but it
> does
> > not seem to work with Tika. Any suggestions will be appreciated?
> >
> > I don't know if you are talking about generally converting from XHTML to
> JSON, but within the context of Tika you can try out the serialization
> module [0].
> This will serialize your Tika metadata as JSON.
> hth
> Lewis
>
> [0] https://svn.apache.org/repos/asf/tika/trunk/tika-serialization/
>



-- 
*Vineet Ghatge*

Reply via email to