RE: Query creation

2003-12-04 Thread Aviran
You'll need to apply some kind of filter or add another field to the index which contains only the first word (Yes you'll need to rebuild the index in this case) -Original Message- From: Armbrust, Daniel C. [mailto:[EMAIL PROTECTED] Sent: Thursday, December 04, 2003 5:49 PM To: 'Lucene

RE: How would you delete an entry that was indexed like this

2003-12-05 Thread Aviran
This is kind of a problem, in order to delete documents using terms you need to have a keyword field which contain a unique value, otherwise you might ending deleting more then you want. -Original Message- From: Mike Hogan [mailto:[EMAIL PROTECTED] Sent: Friday, December 05, 2003 1:06 PM

Lucene Search has poor cpu utilization on a 4-CPU machine

2004-07-12 Thread Aviran
). My question is: do the fields byNumber and byName have to be synchronized and what can happen if I'll change them to be ArrayList and HashMap which are not synchronized ? Can this corrupt the index or the integrity of the results? Thanks, Aviran

RE: Sorting on tokenized fields

2004-07-21 Thread Aviran
You can create a new field which contains the full untokened string and use it as a sort field. -Original Message- From: Florian Sauvin [mailto:[EMAIL PROTECTED] Sent: Tuesday, July 20, 2004 20:13 PM To: Lucene Users List Subject: Sorting on tokenized fields I see in the Javadoc that

RE: Sort: 1.4-rc3 vs. 1.4-final

2004-07-21 Thread Aviran
implementation. My guess is that you are right and there is a problem with the cache although I couldn't find what that is yet. Aviran -Original Message- From: Greg Gershman [mailto:[EMAIL PROTECTED] Sent: Wednesday, July 21, 2004 9:22 AM To: [EMAIL PROTECTED] Subject: Sort: 1.4-rc3 vs. 1.4-final

RE: Sort: 1.4-rc3 vs. 1.4-final

2004-07-21 Thread Aviran
a new IndexReader. Aviran -Original Message- From: Greg Gershman [mailto:[EMAIL PROTECTED] Sent: Wednesday, July 21, 2004 13:13 PM To: Lucene Users List Subject: RE: Sort: 1.4-rc3 vs. 1.4-final I've done a bit more snooping around; it seems that in FieldSortedHitQueue.getCachedComparator

RE: Sort: 1.4-rc3 vs. 1.4-final

2004-07-21 Thread Aviran
I just saw this post, I guess we both came to the same conclusion. The only problem is that the cached object never gets released, and a new one will get created every time you open a new IndexReader Aviran -Original Message- From: Greg Gershman [mailto:[EMAIL PROTECTED] Sent

RE: Sort: 1.4-rc3 vs. 1.4-final

2004-07-21 Thread Aviran
I will post a patch soon Aviran -Original Message- From: Doug Cutting [mailto:[EMAIL PROTECTED] Sent: Wednesday, July 21, 2004 13:56 PM To: Lucene Users List Subject: Re: Sort: 1.4-rc3 vs. 1.4-final The key in the WeakHashMap should be the IndexReader, not the Entry. I think

RE: rebuild index

2004-07-22 Thread Aviran
Why don't you just build a new index in a different location and at the end add the missing documents from the old index to the new one, and then delete the old index. Aviran -Original Message- From: Sergiu Gordea [mailto:[EMAIL PROTECTED] Sent: Thursday, July 22, 2004 10:49 AM

RE: When does IndexReader pick up changes?

2004-07-29 Thread Aviran
IndexReader will pick the changes as it is opened. If new documents are added to the index you need to open a new IndexReader in order for it to pick up the changes Aviran -Original Message- From: Stephane James Vaucher [mailto:[EMAIL PROTECTED] Sent: Thursday, July 29, 2004 0:00 AM

RE: When does IndexReader pick up changes?

2004-07-29 Thread Aviran
in order for it to pick up the changes Aviran -Original Message- From: Stephane James Vaucher [mailto:[EMAIL PROTECTED] Sent: Thursday, July 29, 2004 0:00 AM To: Lucene Users List Subject: Re: When does IndexReader pick up changes? IIRC, if you use a searcher, changes

RE: Question on number of fields in a document.

2004-08-04 Thread Aviran
You should be fine, no problem with the number of fields -Original Message- From: John Z [mailto:[EMAIL PROTECTED] Sent: Wednesday, August 04, 2004 12:23 PM To: [EMAIL PROTECTED] Subject: Question on number of fields in a document. Hi I had a question related to number of fields in a

RE: index and search question

2004-08-09 Thread Aviran
yes -Original Message- From: Dmitrii PapaGeorgio [mailto:[EMAIL PROTECTED] Sent: Monday, August 16, 2004 9:23 AM To: [EMAIL PROTECTED] Subject: index and search question Ok so when I index a file such as below Document doc = new Document(); doc.Add(Field.Text(contents, new

RE: Indexing and Searching Database in Lucene

2004-08-20 Thread Aviran
You need to create a lucene index from the database. Just index the columns and the records from the database. It will be useful to have also a field in lucene that contains the database's primary key, so you can retrieve the actual record from the database Aviran -Original Message

RE: Sort Search Result

2004-08-24 Thread Aviran
Look at SortField http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/SortField .html -Original Message- From: Natarajan.T [mailto:[EMAIL PROTECTED] Sent: Tuesday, August 24, 2004 11:35 AM To: 'Lucene Users List' Subject: Sort Search Result FYI, How can I get the

RE: Searching MySql index using lucene

2004-08-24 Thread Aviran
Just read your data from the database and create a Lucene Index for the columns you want to search -Original Message- From: sivalingam T [mailto:[EMAIL PROTECTED] Sent: Tuesday, August 24, 2004 9:52 AM To: [EMAIL PROTECTED] Cc: [EMAIL PROTECTED] Subject: Searching MySql index using

RE: problem with SortField[] in search method (newbie)

2004-09-15 Thread Aviran
You can only sort on indexed field. (even more than that, it'll work properly only on Untokenized fields, ie keyword). Aviran -Original Message- From: Wermus Fernando [mailto:[EMAIL PROTECTED] Sent: Wednesday, September 15, 2004 13:13 PM To: [EMAIL PROTECTED] Subject: problem

RE: Questions related to closing the searcher

2004-09-23 Thread Aviran
The best way is to use IndexReader's getCurrentVersion() method to check whether the index has changed. If it has, just get a new Searcher http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/index/IndexReade r.html#getCurrentVersion(java.lang.String) Aviran -Original Message

RE: Keyword query confusion

2004-09-24 Thread Aviran
The StandardAnalyzer removes the 1 as it is a stop word. There are two ways you can work around this problem. 1 as you mentioned is to create a Query object programmatically. 2 You can use WhiteSpace Analyzer instead of StandardAnalyzer. Aviran -Original Message- From: Fred Toth [mailto

RE: how to find field that has any value

2004-10-11 Thread Aviran
empty and then query for -test:empty Aviran -Original Message- From: MATL (Mats Lindberg) [mailto:[EMAIL PROTECTED] Sent: Wednesday, October 06, 2004 16:27 PM To: Lucene Users List Subject: how to find field that has any value Hello i have a probably simple question for some of you

RE: A simple newbee question . How do i exclude a field ?

2004-10-11 Thread Aviran
For the records that don't contain a field you can put a bogus value such as empty and then you can query on -UD:empty Aviran http://aviran.mordos.com -Original Message- From: Robinson Raju [mailto:[EMAIL PROTECTED] Sent: Saturday, October 09, 2004 10:25 AM To: Lucene Users List

RE: Hebrew support

2004-09-28 Thread Aviran
As far as I know there is no Analyzer for Hebrew. Aviran -Original Message- From: Alex Kiselevski [mailto:[EMAIL PROTECTED] Sent: Tuesday, September 28, 2004 3:12 AM To: [EMAIL PROTECTED] Subject: Hebrew support Hello, Do you know something about hebrew support in Lucene Thanks

RE: Sorting on a long string

2004-09-29 Thread Aviran
Currently Lucene can only sort on a Keyword field properly. I guess your field is tokenized, which in this case the sort does not work properly. A patch has been suggested to fix this problem ( but has not been applied yet ) http://issues.apache.org/bugzilla/show_bug.cgi?id=30382 Aviran

RE: multiple threads

2004-10-04 Thread Aviran
You should not have more then one IndexWriter. (You can have multiple IndexReaders, but only one IndexWriter). Aviran -Original Message- From: Justin Swanhart [mailto:[EMAIL PROTECTED] Sent: Friday, October 01, 2004 19:14 PM To: [EMAIL PROTECTED] Subject: multiple threads As I

RE: Null or no analyzer

2004-10-19 Thread Aviran
You can use WhiteSpaceAnalyzer Aviran http://aviran.mordos.com -Original Message- From: Rupinder Singh Mazara [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 19, 2004 11:23 AM To: Lucene Users List Subject: Null or no analyzer Hi All I have a question regarding selection

RE: Null or no analyzer

2004-10-20 Thread Aviran
AFIK if the term Election 2004 will be between quotation marks this should work fine. Aviran http://aviran.mordos.com -Original Message- From: Morus Walter [mailto:[EMAIL PROTECTED] Sent: Wednesday, October 20, 2004 2:25 AM To: Lucene Users List Subject: RE: Null or no analyzer Aviran

RE: Spell checker

2004-10-20 Thread Aviran
Here http://issues.apache.org/bugzilla/showattachment.cgi?attach_id=13009 Aviran http://aviran.mordos.com -Original Message- From: Lynn Li [mailto:[EMAIL PROTECTED] Sent: Wednesday, October 20, 2004 10:52 AM To: 'Lucene Users List' Subject: RE: Spell checker Where can I download

RE: index files version and lucene 1.4

2004-10-21 Thread Aviran
Lucene 1.4 changed the file format for indexes. You can access a old index using lucene 1.4 but you can't access index which was created using lucene 1.4 with older versions. I suggest you rebuild your index using lucene 1.4 Aviran http://aviran.mordos.com -Original Message- From: arnaud

RE: Faster highlighting with TermPositionVectors

2004-11-03 Thread Aviran
, Aviran -Original Message- From: Bruce Ritchie [mailto:[EMAIL PROTECTED] Sent: Friday, October 29, 2004 1:15 AM To: Lucene Users List Subject: RE: Faster highlighting with TermPositionVectors Mark, Thanks to the recent changes (see CVS) in TermFreqVector support we can now make use of term

RE: Steamming

2004-11-15 Thread Aviran
I don't understand what kind of examples you need. All there is to it is just use a different analyzer. Take a look at Snowball analyzer in lucene's sand box. Aviran http://www.aviransplace.com -Original Message- From: Miguel Angel [mailto:[EMAIL PROTECTED] Sent: Monday, November 15

RE: best ways of using IndexSearcher

2004-11-17 Thread Aviran
Yes, IndexSearcher is thread safe. Aviran http://www.aviransplace.com -Original Message- From: Abhay Saswade [mailto:[EMAIL PROTECTED] Sent: Tuesday, November 16, 2004 15:16 PM To: Lucene Users List Subject: Re: best ways of using IndexSearcher Hello, Can I use single instance

RE: lucene transaction and roll back implementation

2004-11-18 Thread Aviran
AFIK there is no transaction not rollback support in lucene Aviran http://www.aviransplace.com -Original Message- From: John Wang [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 17, 2004 20:25 PM To: [EMAIL PROTECTED] Subject: lucene transaction and roll back implementation Hi

RE: finalize delete without optimize

2004-12-09 Thread Aviran
Lucene standard API does not support this kind of operation. Aviran http://www.aviransplace.com -Original Message- From: John Wang [mailto:[EMAIL PROTECTED] Sent: Wednesday, December 08, 2004 17:32 PM To: [EMAIL PROTECTED] Subject: Re: finalize delete without optimize Hi folks

RE: InderWriter.optimize()

2004-12-09 Thread Aviran
Beside merging the segments, optimize also physically deletes all the deleted documents from the index (When you call delete, lucene only marks the documents as deleted, they physically deleted when you call optimize). Aviran http://www.aviransplace.com -Original Message- From: Yura

RE: Retrieving all docs in the index

2004-12-09 Thread Aviran
In this case you'll have to add another field with a fixed value to all the documents and query on that field Aviran http://www.aviransplace.com -Original Message- From: Ravi [mailto:[EMAIL PROTECTED] Sent: Thursday, December 09, 2004 14:04 PM To: Lucene Users List Subject: RE

RE: sorting tokenized field

2004-12-10 Thread Aviran
I have suggested a solution for this problem ( http://issues.apache.org/bugzilla/show_bug.cgi?id=30382 ) you can use the patch suggested there and recompile lucene. Aviran http://www.aviransplace.com -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Friday, December

RE: Querying Question

2003-04-03 Thread Aviran Mordo
You should not tokenize the file name instead you should use doc.add(new Field(name, value, true, true, true)); Or Doc.add(Field.keyword(name,value)); Aviran -Original Message- From: Rob Outar [mailto:[EMAIL PROTECTED] Sent: Thursday, April 03, 2003 5:27 PM To: Lucene

RE: Sort results by date alone?

2003-05-29 Thread Aviran Mordo
I think I saw a solution for this in the past. Try to search the mailing list. Anyway you can always use the SearchBean which is in lucene sandbox to sort by any field. -Original Message- From: news [mailto:[EMAIL PROTECTED] On Behalf Of David Weitzman Sent: Tuesday, May 27, 2003 8:26 PM

RE: Wildcard workaround

2003-05-29 Thread Aviran Mordo
You can also index the file names with a leading character. For instance index file1.exe will be indexed as _file1.exe and always add the leading character to the search term. So if the user input is *.exe your query should be _*.exe and if the user input fi* you'll change it to _fi* Aviran

RE: query question in trouble

2003-06-11 Thread Aviran Mordo
In is probably a STOP word in your analyzer -Original Message- From: Ryan Clifton [mailto:[EMAIL PROTECTED] Sent: Wednesday, June 11, 2003 3:13 PM To: Lucene Users List Subject: query question in trouble Hello, Upon reviewing the results of some queries recently I noticed that the

RE: date ranges.....

2003-06-27 Thread Aviran Mordo
Use RangeQuery to search on the date field -Original Message- From: host unknown [mailto:[EMAIL PROTECTED] Sent: Friday, June 27, 2003 10:39 AM To: [EMAIL PROTECTED] Subject: date ranges. Hi all Here's my scenario I'm building a calendaring application and using Lucene (one

RE: Using Lucene in an multiple index/large io scenario

2003-06-30 Thread Aviran Mordo
You'll probably need to optimize the index more often. This will reduce the number of files lucene open. Also if you can merge several fields into one, it will also reduce the number of files. Aviran -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Monday, June

RE: Results sorted by date instead of score?

2003-07-03 Thread Aviran Mordo
You'll need to sort the results after you collected them. There is a project called SortedField in lucene's contribution or sandbox (I don't remember exactly) which will help you sort by any field. -Original Message- From: Wilton, Reece [mailto:[EMAIL PROTECTED] Sent: Wednesday, July 02,

RE: Maybe a stupid question?

2003-07-10 Thread Aviran Mordo
You can add as many fields with the same name as your heart desire on the same document. This will give you multiple values Aviran -Original Message- From: Olivier Cochet [mailto:[EMAIL PROTECTED] Sent: Thursday, July 10, 2003 10:43 AM To: Lucene Users List Subject: Maybe a stupid

RE: keyword indexing

2003-07-16 Thread Aviran Mordo
If you are searching on keyword you might need to use TermQuery in order to have an exact match -Original Message- From: Jan Agermose [mailto:[EMAIL PROTECTED] Sent: Wednesday, July 16, 2003 1:04 PM To: [EMAIL PROTECTED] Subject: keyword indexing I'm having some problems with chars in

Bug: TermQuery toString - incorrect

2003-07-30 Thread Aviran Mordo
use Lucene 1.3 RC1. Aviran

RE: Newbie Questions

2003-08-26 Thread Aviran Mordo
1. You need to use MultiFieldQueryParser 2. I think you should use PorterStemFilter instead of fuzzy query http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/analysis/Por terStemFilter.html -Original Message- From: Mark Woon [mailto:[EMAIL PROTECTED] Sent: Tuesday, August 26,

RE: RC2 requires reindexing?

2003-08-29 Thread Aviran Mordo
You can find RC2 in CVS -Original Message- From: Jan Agermose [mailto:[EMAIL PROTECTED] Sent: Friday, August 29, 2003 6:32 AM To: Lucene Users List Subject: Re: RC2 requires reindexing? Ok, on the first posting about RC2 i looked for et, but as I did not find any RC2 I guessed he was