Re: suitability of lucene for project

2004-04-14 Thread Sebastian Ho
I will be searching webpages (url given by user) for keyword (in clinical record). Will that be structured or unstructured? The records might be in a table or a list of urls pointing to individual record webpages. thks sebastian On Tue, 2004-04-13 at 11:15, Stephane James Vaucher wrote: It

suitability of lucene for project

2004-04-12 Thread Sebastian Ho
hi all i am investigating technologies to use for a project which basically retrieves html pages on a regular basis(or whenever there are changes) and allow html parsing to extract specific information, and presenting them as links in a webpage. Note that this is not a general search engine kind

Re: suitability of lucene for project

2004-04-12 Thread Erik Hatcher
No, Lucene is not the right solution for this particular use. It does not include anything to retrieve HTML pages, or parse them. However, if you ever needed full-text search, the Lucene is where it's at. Erik On Apr 12, 2004, at 9:28 PM, Sebastian Ho wrote: hi all i am investigating

Re: suitability of lucene for project

2004-04-12 Thread Stephane James Vaucher
It could be part of you solution, but I don't think so. Let me explain: I've done this a few times something similar to what you describe. I use often use HttpUnit to get information. How you process it, it's up to you. If you want it to be indexed (searchable), you can use Lucene. If you want