Re: Index using URL

Erik Hatcher Fri, 14 Nov 2003 13:26:51 -0800

You should write your own code that creates the Document objects with the fields you wish, with a Field.Keyword for the URL probably. Take what is useful from IndexHTML.java, but don't use it as-is. If you're speaking of pulling the document from a URL now you're talking of doing some HTTP communication, and HttpClient is one clean option for this.

Erik

On Friday, November 14, 2003, at 03:09 PM, Zhou, Oliver wrote:

I'm using lucene demo IndexHTML.java and pdfbox to index pdf files under a file directory. I have a url point to the file directory. I'd like to index the files using the URL instead of the file directory. Any idea how to make IndexHTML take URL for indexing?
Thanks,
Oliver
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Index using URL

Reply via email to