You should write your own code that creates the Document objects with the fields you wish, with a Field.Keyword for the URL probably. Take what is useful from IndexHTML.java, but don't use it as-is. If you're speaking of pulling the document from a URL now you're talking of doing some HTTP communication, and HttpClient is one clean option for this.

Erik


On Friday, November 14, 2003, at 03:09 PM, Zhou, Oliver wrote:


I'm using lucene demo IndexHTML.java and pdfbox to index pdf files under a
file directory. I have a url point to the file directory. I'd like to
index the files using the URL instead of the file directory. Any idea how
to make IndexHTML take URL for indexing?


Thanks,
Oliver



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to