Re: Query parser error

2002-05-30 Thread Peter Carlson
Try using the newest release and read the Queryparser syntax. http://jakarta.apache.org/lucene/docs/queryparsersyntax.html Escape characters were just added. --Peter On 5/29/02 10:53 PM, Harpreet S Walia [EMAIL PROTECTED] wrote: Hi I am trying to search words which have characters { ,

Adding segments after exiting program

2002-05-30 Thread Hyong Ko
Hello. Is there a way to add segments to the existing index files after exiting the program, ex. opening and adding segments? How do I preserve my previously indexed files? Thank you very much. _ Get your FREE download of MSN

RE: MS Word Search ??

2002-05-30 Thread Armbrust, Daniel C.
This might be worth looking into for those who need to parse word, excel, powerpoint, or other MS file types of microsofts. openoffice - www.openoffice.org knows how to parse all of the microsoft formats (at least all that I've tried so far) - and then, you can a do a save as, and write out the

segment count

2002-05-30 Thread Hyong Ko
I added a segment using IndexWriter.addDocument. Then I called IndexWriter.optimize (IndexWriter.close works too) to generate index files to do a search. Then I added another segment using IndexWriter.addDocument. The total segment count should be 2, but instead it's 3. Any ideas? Is this the

new product

2002-05-30 Thread ip m
Hi all, A new product that uses Lucene is available at http://www.intellectronix.com/ipm/ Among other things IPM provides powerful gui tools for creating and managing sophisticated Lucene indexes. Could you please include it in Powered by Lucene? Thanks!

RE: MS Word Search ??

2002-05-30 Thread Bruce Altner
This is a good lead but it prompts me to ask this: if tools like openoffice and others (like Acrobat Distiller) know how to reformat Excel, PowerPoint, and Word it means that the data formats of these files, as streams, must be public knowledge. If so, where do you get this information? I

Re: MS Word Search ??

2002-05-30 Thread Andrew C. Oliver
Bruce Altner wrote: This is a good lead but it prompts me to ask this: if tools like openoffice and others (like Acrobat Distiller) know how to reformat Excel, PowerPoint, and Word it means that the data formats of these files, as streams, must be public knowledge. If so, where do you get

Re: segment count

2002-05-30 Thread Hyong Ko
Everytime segments are merged, the segment counter (based on an index number which is zero based) is incremented by one to name the index files. From: Hyong Ko Reply-To: Lucene Users List To: [EMAIL PROTECTED] Subject: segment count Date: Thu, 30 May 2002 17:52:57 + I added a segment

Re: MS Word Search ??

2002-05-30 Thread Ian Lea
... openoffice - www.openoffice.org knows how to parse all of the microsoft ... #2 - if open office is programmatically drivable (which I don't know if it is), fire up a copy of open office and use it to convert the files as necessary. See an earlier post to this list:

Re: segment count

2002-05-30 Thread Ian Lea
Lucene doesn't store one document per segment. See http://marc.theaimsgroup.com/?l=lucene-userm=102079295608850w=2 for detail on the files created. On is this the right way ...?, here is an extract from the javadoc 1. Create Document's by adding Field's. 2. Create an IndexWriter and add

Re: segment count

2002-05-30 Thread Hyong Ko
Thank you very much for your prompt response. In order to make a search, the mergeSegments() function must be called right? Otherwise IndexSearcher won't have the most updated index files to work with to do a search. I guess my point is that do I have to intermittenly call Optimize or Close

Re: Index Persistency

2002-05-30 Thread Brandon Jockman
You shouldn't have to. It depends on the type of index you are using. Typically indexes are stored on the file system. They can however be RAM based that you would need to serialize to persist, or I believe you could use other storage mediums such as CD rom. However typically, it is persisted