Re: Runtime full text search like in Microsoft Windows Search

Erik Hatcher Fri, 15 Jul 2005 17:02:02 -0700


On Jul 15, 2005, at 3:12 PM, [EMAIL PROTECTED] wrote:

If Microsoft Search does as you describe. Isn't it just:

1) Open file
2) Determine file type
3) Convert file content to UTF8, if text based, and you have theAPI to read it. .html, .txt., .doc, .excel, etc.
4) Perform string search, regex.
5) Continue to next file

As far as I know, Lucene is not designed for unindexed search.

The new MemoryIndex might be perfect for this sort of thing. Isuspect Microsoft's search doesn't allow anything but a term/exact-phrase kinda query, so even MemoryIndex might be doing more work (andthus slower) for a fair comparison. However, you'd be able to dorich queries using MemoryIndex and it has been heavily tuned forperformance. The slow part will be simply reading the files andconverting them.


    Erik


Tardif, Sebastien wrote:

How can you use Lucene like the very limited but fast search that
Microsoft Windows Search provide?
The use case is that the users have a CD with lot of files. Iprovide
them a nice user interface. They have the option to generate the full
text search index but they should also be able to search without an
index generated. I know that will be slow, but Microsoft WindowsSearch
is still able to search 500 MB in less than 30 seconds for simpler
matching.
 How can I use Lucene for this simpler, not existing index, search?
Or should I have to hook to operating system specific API likeWin32 on
Windows?


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Runtime full text search like in Microsoft Windows Search

Reply via email to