Re: GData index html documents

2006-07-30 Thread Otis Gospodnetic
I can confirm that. Nutch includes it, for instance. Otis - Original Message From: Simon Willnauer <[EMAIL PROTECTED]> To: java-dev@lucene.apache.org Sent: Sunday, July 30, 2006 6:41:08 PM Subject: Re: GData index html documents I got in touch with andy, he told me that it wo

Re: GData index html documents

2006-07-30 Thread Simon Willnauer
I got in touch with andy, he told me that it would be totally alright to include it. Other projects already using it. regards Simon On 7/30/06, Simon Willnauer <[EMAIL PROTECTED]> wrote: Hello all, I'm at a point where I have to retrieve data from entry elements which could contain text, html,

GData index html documents

2006-07-30 Thread Simon Willnauer
Hello all, I'm at a point where I have to retrieve data from entry elements which could contain text, html, xhtml or even xml. So there is not problem so far. Detecting which format the element contains is also pretty easy as each element has a "type" attribute. if there is not such type attribut