On Jul 15, 2005, at 3:12 PM, [EMAIL PROTECTED] wrote:

If Microsoft Search does as you describe. Isn't it just:

1) Open file
2) Determine file type
3) Convert file content to UTF8, if text based, and you have the API to read it. .html, .txt., .doc, .excel, etc.
4) Perform string search, regex.
5) Continue to next file

As far as I know, Lucene is not designed for unindexed search.

The new MemoryIndex might be perfect for this sort of thing. I suspect Microsoft's search doesn't allow anything but a term/exact- phrase kinda query, so even MemoryIndex might be doing more work (and thus slower) for a fair comparison. However, you'd be able to do rich queries using MemoryIndex and it has been heavily tuned for performance. The slow part will be simply reading the files and converting them.

    Erik



Tardif, Sebastien wrote:

How can you use Lucene like the very limited but fast search that
Microsoft Windows Search provide?
The use case is that the users have a CD with lot of files. I provide
them a nice user interface. They have the option to generate the full
text search index but they should also be able to search without an
index generated. I know that will be slow, but Microsoft Windows Search
is still able to search 500 MB in less than 30 seconds for simpler
matching.
 How can I use Lucene for this simpler, not existing index, search?
Or should I have to hook to operating system specific API like Win32 on
Windows?


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to