You'll need to apply some kind of filter or add another field to the index
which contains only the first word (Yes you'll need to rebuild the index in
this case)
-Original Message-
From: Armbrust, Daniel C. [mailto:[EMAIL PROTECTED]
Sent: Thursday, December 04, 2003 5:49 PM
To: 'Lucene
This is kind of a problem, in order to delete documents using terms you need
to have a keyword field which contain a unique value, otherwise you might
ending deleting more then you want.
-Original Message-
From: Mike Hogan [mailto:[EMAIL PROTECTED]
Sent: Friday, December 05, 2003 1:06 PM
).
My question is: do the fields byNumber and byName have to be synchronized
and what can happen if I'll change them to be ArrayList and HashMap which
are not synchronized ? Can this corrupt the index or the integrity of the
results?
Thanks,
Aviran
You can create a new field which contains the full untokened string and use
it as a sort field.
-Original Message-
From: Florian Sauvin [mailto:[EMAIL PROTECTED]
Sent: Tuesday, July 20, 2004 20:13 PM
To: Lucene Users List
Subject: Sorting on tokenized fields
I see in the Javadoc that
implementation. My guess is that you are right and there is a problem with
the cache although I couldn't find what that is yet.
Aviran
-Original Message-
From: Greg Gershman [mailto:[EMAIL PROTECTED]
Sent: Wednesday, July 21, 2004 9:22 AM
To: [EMAIL PROTECTED]
Subject: Sort: 1.4-rc3 vs. 1.4-final
a new IndexReader.
Aviran
-Original Message-
From: Greg Gershman [mailto:[EMAIL PROTECTED]
Sent: Wednesday, July 21, 2004 13:13 PM
To: Lucene Users List
Subject: RE: Sort: 1.4-rc3 vs. 1.4-final
I've done a bit more snooping around; it seems that in
FieldSortedHitQueue.getCachedComparator
I just saw this post, I guess we both came to the same conclusion.
The only problem is that the cached object never gets released, and a new
one will get created every time you open a new IndexReader
Aviran
-Original Message-
From: Greg Gershman [mailto:[EMAIL PROTECTED]
Sent
I will post a patch soon
Aviran
-Original Message-
From: Doug Cutting [mailto:[EMAIL PROTECTED]
Sent: Wednesday, July 21, 2004 13:56 PM
To: Lucene Users List
Subject: Re: Sort: 1.4-rc3 vs. 1.4-final
The key in the WeakHashMap should be the IndexReader, not the Entry. I
think
Why don't you just build a new index in a different location and at the end
add the missing documents from the old index to the new one, and then delete
the old index.
Aviran
-Original Message-
From: Sergiu Gordea [mailto:[EMAIL PROTECTED]
Sent: Thursday, July 22, 2004 10:49 AM
IndexReader will pick the changes as it is opened.
If new documents are added to the index you need to open a new IndexReader
in order for it to pick up the changes
Aviran
-Original Message-
From: Stephane James Vaucher [mailto:[EMAIL PROTECTED]
Sent: Thursday, July 29, 2004 0:00 AM
in order for
it to pick up the changes
Aviran
-Original Message-
From: Stephane James Vaucher [mailto:[EMAIL PROTECTED]
Sent: Thursday,
July 29, 2004 0:00 AM
To: Lucene Users List
Subject: Re: When does IndexReader
pick up changes?
IIRC, if you use a searcher, changes
You should be fine, no problem with the number of fields
-Original Message-
From: John Z [mailto:[EMAIL PROTECTED]
Sent: Wednesday, August 04, 2004 12:23 PM
To: [EMAIL PROTECTED]
Subject: Question on number of fields in a document.
Hi
I had a question related to number of fields in a
yes
-Original Message-
From: Dmitrii PapaGeorgio [mailto:[EMAIL PROTECTED]
Sent: Monday, August 16, 2004 9:23 AM
To: [EMAIL PROTECTED]
Subject: index and search question
Ok so when I index a file such as below
Document doc = new Document();
doc.Add(Field.Text(contents, new
You need to create a lucene index from the database.
Just index the columns and the records from the database.
It will be useful to have also a field in lucene that contains the
database's primary key, so you can retrieve the actual record from the
database
Aviran
-Original Message
Look at SortField
http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/SortField
.html
-Original Message-
From: Natarajan.T [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 24, 2004 11:35 AM
To: 'Lucene Users List'
Subject: Sort Search Result
FYI,
How can I get the
Just read your data from the database and create a Lucene Index for the
columns you want to search
-Original Message-
From: sivalingam T [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 24, 2004 9:52 AM
To: [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]
Subject: Searching MySql index using
You can only sort on indexed field. (even more than that, it'll work
properly only on Untokenized fields, ie keyword).
Aviran
-Original Message-
From: Wermus Fernando [mailto:[EMAIL PROTECTED]
Sent: Wednesday, September 15, 2004 13:13 PM
To: [EMAIL PROTECTED]
Subject: problem
The best way is to use IndexReader's getCurrentVersion() method to check
whether the index has changed. If it has, just get a new Searcher
http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/index/IndexReade
r.html#getCurrentVersion(java.lang.String)
Aviran
-Original Message
The StandardAnalyzer removes the 1 as it is a stop word.
There are two ways you can work around this problem.
1 as you mentioned is to create a Query object programmatically.
2 You can use WhiteSpace Analyzer instead of StandardAnalyzer.
Aviran
-Original Message-
From: Fred Toth [mailto
empty and then query for
-test:empty
Aviran
-Original Message-
From: MATL (Mats Lindberg) [mailto:[EMAIL PROTECTED]
Sent: Wednesday, October 06, 2004 16:27 PM
To: Lucene Users List
Subject: how to find field that has any value
Hello
i have a probably simple question for some of you
For the records that don't contain a field you can put a bogus value such as
empty and then you can query on -UD:empty
Aviran
http://aviran.mordos.com
-Original Message-
From: Robinson Raju [mailto:[EMAIL PROTECTED]
Sent: Saturday, October 09, 2004 10:25 AM
To: Lucene Users List
As far as I know there is no Analyzer for Hebrew.
Aviran
-Original Message-
From: Alex Kiselevski [mailto:[EMAIL PROTECTED]
Sent: Tuesday, September 28, 2004 3:12 AM
To: [EMAIL PROTECTED]
Subject: Hebrew support
Hello,
Do you know something about hebrew support in Lucene
Thanks
Currently Lucene can only sort on a Keyword field properly.
I guess your field is tokenized, which in this case the sort does not work
properly.
A patch has been suggested to fix this problem ( but has not been applied
yet )
http://issues.apache.org/bugzilla/show_bug.cgi?id=30382
Aviran
You should not have more then one IndexWriter. (You can have multiple
IndexReaders, but only one IndexWriter).
Aviran
-Original Message-
From: Justin Swanhart [mailto:[EMAIL PROTECTED]
Sent: Friday, October 01, 2004 19:14 PM
To: [EMAIL PROTECTED]
Subject: multiple threads
As I
You can use WhiteSpaceAnalyzer
Aviran
http://aviran.mordos.com
-Original Message-
From: Rupinder Singh Mazara [mailto:[EMAIL PROTECTED]
Sent: Tuesday, October 19, 2004 11:23 AM
To: Lucene Users List
Subject: Null or no analyzer
Hi All
I have a question regarding selection
AFIK if the term Election 2004 will be between quotation marks this should
work fine.
Aviran
http://aviran.mordos.com
-Original Message-
From: Morus Walter [mailto:[EMAIL PROTECTED]
Sent: Wednesday, October 20, 2004 2:25 AM
To: Lucene Users List
Subject: RE: Null or no analyzer
Aviran
Here http://issues.apache.org/bugzilla/showattachment.cgi?attach_id=13009
Aviran
http://aviran.mordos.com
-Original Message-
From: Lynn Li [mailto:[EMAIL PROTECTED]
Sent: Wednesday, October 20, 2004 10:52 AM
To: 'Lucene Users List'
Subject: RE: Spell checker
Where can I download
Lucene 1.4 changed the file format for indexes. You can access a old index
using lucene 1.4 but you can't access index which was created using lucene
1.4 with older versions.
I suggest you rebuild your index using lucene 1.4
Aviran
http://aviran.mordos.com
-Original Message-
From: arnaud
,
Aviran
-Original Message-
From: Bruce Ritchie [mailto:[EMAIL PROTECTED]
Sent: Friday, October 29, 2004 1:15 AM
To: Lucene Users List
Subject: RE: Faster highlighting with TermPositionVectors
Mark,
Thanks to the recent changes (see CVS) in TermFreqVector
support we can now make use of term
I don't understand what kind of examples you need. All there is to it is
just use a different analyzer.
Take a look at Snowball analyzer in lucene's sand box.
Aviran
http://www.aviransplace.com
-Original Message-
From: Miguel Angel [mailto:[EMAIL PROTECTED]
Sent: Monday, November 15
Yes, IndexSearcher is thread safe.
Aviran
http://www.aviransplace.com
-Original Message-
From: Abhay Saswade [mailto:[EMAIL PROTECTED]
Sent: Tuesday, November 16, 2004 15:16 PM
To: Lucene Users List
Subject: Re: best ways of using IndexSearcher
Hello,
Can I use single instance
AFIK there is no transaction not rollback support in lucene
Aviran
http://www.aviransplace.com
-Original Message-
From: John Wang [mailto:[EMAIL PROTECTED]
Sent: Wednesday, November 17, 2004 20:25 PM
To: [EMAIL PROTECTED]
Subject: lucene transaction and roll back implementation
Hi
Lucene standard API does not support this kind of operation.
Aviran
http://www.aviransplace.com
-Original Message-
From: John Wang [mailto:[EMAIL PROTECTED]
Sent: Wednesday, December 08, 2004 17:32 PM
To: [EMAIL PROTECTED]
Subject: Re: finalize delete without optimize
Hi folks
Beside merging the segments, optimize also physically deletes all the
deleted documents from the index (When you call delete, lucene only marks
the documents as deleted, they physically deleted when you call optimize).
Aviran
http://www.aviransplace.com
-Original Message-
From: Yura
In this case you'll have to add another field with a fixed value to all the
documents and query on that field
Aviran
http://www.aviransplace.com
-Original Message-
From: Ravi [mailto:[EMAIL PROTECTED]
Sent: Thursday, December 09, 2004 14:04 PM
To: Lucene Users List
Subject: RE
I have suggested a solution for this problem (
http://issues.apache.org/bugzilla/show_bug.cgi?id=30382 ) you can use the
patch suggested there and recompile lucene.
Aviran
http://www.aviransplace.com
-Original Message-
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: Friday, December
You should not tokenize the file name instead you should use
doc.add(new Field(name, value,
true, true, true));
Or
Doc.add(Field.keyword(name,value));
Aviran
-Original Message-
From: Rob Outar [mailto:[EMAIL PROTECTED]
Sent: Thursday, April 03, 2003 5:27 PM
To: Lucene
I think I saw a solution for this in the past. Try to search the mailing
list.
Anyway you can always use the SearchBean which is in lucene sandbox to
sort by any field.
-Original Message-
From: news [mailto:[EMAIL PROTECTED] On Behalf Of David Weitzman
Sent: Tuesday, May 27, 2003 8:26 PM
You can also index the file names with a leading character. For instance
index file1.exe will be indexed as _file1.exe and always add the
leading character to the search term.
So if the user input is *.exe your query should be _*.exe and if the
user input fi* you'll change it to _fi*
Aviran
In is probably a STOP word in your analyzer
-Original Message-
From: Ryan Clifton [mailto:[EMAIL PROTECTED]
Sent: Wednesday, June 11, 2003 3:13 PM
To: Lucene Users List
Subject: query question in trouble
Hello,
Upon reviewing the results of some queries recently I noticed that the
Use RangeQuery to search on the date field
-Original Message-
From: host unknown [mailto:[EMAIL PROTECTED]
Sent: Friday, June 27, 2003 10:39 AM
To: [EMAIL PROTECTED]
Subject: date ranges.
Hi all
Here's my scenario
I'm building a calendaring application and using Lucene (one
You'll probably need to optimize the index more often. This will reduce
the number of files lucene open. Also if you can merge several fields
into one, it will also reduce the number of files.
Aviran
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Sent: Monday, June
You'll need to sort the results after you collected them. There is a
project called SortedField in lucene's contribution or sandbox (I don't
remember exactly) which will help you sort by any field.
-Original Message-
From: Wilton, Reece [mailto:[EMAIL PROTECTED]
Sent: Wednesday, July 02,
You can add as many fields with the same name as your heart desire on
the same document. This will give you multiple values
Aviran
-Original Message-
From: Olivier Cochet [mailto:[EMAIL PROTECTED]
Sent: Thursday, July 10, 2003 10:43 AM
To: Lucene Users List
Subject: Maybe a stupid
If you are searching on keyword you might need to use TermQuery in order
to have an exact match
-Original Message-
From: Jan Agermose [mailto:[EMAIL PROTECTED]
Sent: Wednesday, July 16, 2003 1:04 PM
To: [EMAIL PROTECTED]
Subject: keyword indexing
I'm having some problems with chars in
use Lucene 1.3 RC1.
Aviran
1. You need to use MultiFieldQueryParser
2. I think you should use PorterStemFilter instead of fuzzy query
http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/analysis/Por
terStemFilter.html
-Original Message-
From: Mark Woon [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 26,
You can find RC2 in CVS
-Original Message-
From: Jan Agermose [mailto:[EMAIL PROTECTED]
Sent: Friday, August 29, 2003 6:32 AM
To: Lucene Users List
Subject: Re: RC2 requires reindexing?
Ok, on the first posting about RC2 i looked for et, but as I did not
find any RC2 I guessed he was
48 matches
Mail list logo