Thank you.
From: dawid.we...@gmail.com
Date: Thu, 19 Jul 2012 13:34:26 +0200
Subject: Re: RAM or SSD...
To: java-user@lucene.apache.org
Read this:
http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html
Dawid
On Thu, Jul 19, 2012 at 1:32 PM, Dragon Fly dragon-fly
, 2012 at 10:39 PM, Toke Eskildsen t...@statsbiblioteket.dk
wrote:
On Wed, 2012-07-18 at 17:50 +0200, Dragon Fly wrote:
If I want to improve performance, which of the following is better and
why?
1. Buy a machine with a lot of RAM and use a RAMDirectory for the index.
As others has
Hi,
If I want to improve performance, which of the following is better and why?
1. Buy a machine with a lot of RAM and use a RAMDirectory for the index.
2. Put the index on a solid state drive.
By the way, my index is about 30 GB. Thank you.
variant
that takes a Sort parameter?
Best
Erick
On Sun, Feb 26, 2012 at 8:30 AM, Dragon Fly dragon-fly...@hotmail.com wrote:
Hi,
Let's say I have 6 documents and each document has 2 fields (i.e.
CustomerName and OrderDate). For example:
Doc 1John20120115
Doc 2Mary
Hi,
Let's say I have 6 documents and each document has 2 fields (i.e. CustomerName
and OrderDate). For example:
Doc 1John20120115
Doc 2Mary20120113
Doc 3Peter 20120117
Doc 4Kate20120208
Doc 5John20120211
Doc 6Alan20110423
Is there a way to
recent.
In the 2.9 both IndexReader and IndexWriter have commit() methods.
Best
Erick
On Tue, Jan 3, 2012 at 8:35 AM, Dragon Fly dragon-fly...@hotmail.com wrote:
Hi, I'm using Lucene 2.0 and was wondering how to flush/commit index data
to disk. It doesn't look like there is a flush
Hi, I'm using Lucene 2.0 and was wondering how to flush/commit index data to
disk. It doesn't look like there is a flush() or commit() method in the 2.0
IndexWriter. Is there a way to flush the data without calling close()? Thank
you.
Hi,
Which of the following method actually loads the document from disk?
(1) Document document = searcher.doc (docId);
OR
(2) string value = document.get (FirstNameField);
It's probably searcher.doc but I just want to be sure. Thank you.
Hi,
I have a question regarding RAMDirectory. I have a 5 GB index on disk and it
is opened like the following:
searcher = new IndexSearcher (new RAMDirectory (indexDirectory));
Approximately how much memory is needed to load the index? 5GB of memory or
10GB because of Unicode? Does the
: +49 421 218 65505
http://www.pangaea.de/
E-mail: uschind...@pangaea.de
-Original Message-
From: Dragon Fly [mailto:dragon-fly...@hotmail.com]
Sent: Thursday, July 23, 2009 3:38 PM
To: java-user@lucene.apache.org
Subject: Loading an index into memory
Hi
Let's say I have 3 fields in a document (Type, FirstName, and LastName). For
example:
Document 0
--
Type: Public
FirstName: John
LastName: Deere
If I execute the following boolean query, document 0 is returned.
Type:Public OR FirstName:Candy OR LastName:Deere
, and bookmark_title_35:Another
bookmark in the same document, and I can search bookmark titles by specific
users.
On Wed, Feb 11, 2009 at 3:55 PM, Dragon Fly dragon-fly...@hotmail.comwrote:
Hi,
Let's say I have a single document with 2 fields (namely Field1 and
Field2). 2 values are added
Hi,
Let's say I have a single document with 2 fields (namely Field1 and Field2). 2
values are added to each field like below.
// Add 2 values to Field1.
doc.Add (new Field (Field1, A, Field.Store.YES, Field.Index.UN_TOKENIZED));
doc.Add (new Field (Field1, B, Field.Store.YES,
...@syr.edu
To: java-user@lucene.apache.org
Hi Dragon Fly,
You could split the original document into multiple Lucene Documents,
one for each array index, all sharing the same DocID field value.
Then your queries just work. But you'd have to do result
consolidation, removing duplicate original
http://www.gossamer-threads.com/lists/lucene/java-user/
Date: Fri, 7 Nov 2008 14:27:38 -0700
From: [EMAIL PROTECTED]
To: java-user@lucene.apache.org
Subject: searchable archives
Hey,
Is this list available somewhere that you can search the entire archives at
one time?
Thanks,
Chad
Thank you both for your help.
Date: Fri, 31 Oct 2008 09:06:50 +0100
From: [EMAIL PROTECTED]
To: java-user@lucene.apache.org
Subject: Re: Read all the data from an index
Erick Erickson wrote:
I'm not sure what *could* be easier than looping with IndexSearcher.doc(),
looping from 1 to
Hi,
I have an old index that was built a few months ago. The data that I used to
build the index has been deleted from the database. I'd like to read all the
data from the old index to build a new index. Which Lucene API calls should I
use to read all the data from the old index? Thank you
Well, that's trickier than you might think. You can easily get
all the STORED data just by getting doc IDs 1-MaxDoc(). But
reconstructing the data from data that is NOT stored is more
difficult. Luke tries, but it may be a lossy process.
Best
Erick
On Thu, Oct 30, 2008 at 3:24 PM, Dragon
Let's say my index has two fields (Type and Description). Type is either 0 or
1 and Description is a string (up to 250 characters). I'd like to execute the
following search:
+Description:Honda* +Type:0
Would the query run faster if I specify the Type first:
+Type:0
I'd like to delete some documents from my index. Should I use the
DeleteDocument method in the IndexReader class or the IndexModifier class? Does
it make a difference which one I use? Thank you.
_
Get more from your digital life.
IndexReader back and forth to do the deletions.
IndexModifier is deprecated.
Mike
Dragon Fly wrote:
I'd like to delete some documents from my index. Should I use the
DeleteDocument method in the IndexReader class or the IndexModifier
class? Does it make a difference which one I use
Perhaps I didn't explain myself clearly so please let me try it again. I'm
happy with the search/indexing performance. However, my index gets fully
optimized every 4 hours and the time it takes to fully optimize the index is
longer than I like. Is there anything that I can do to speed up
is made active.
Date: Wed, 30 Jul 2008 14:54:03 +0100
From: [EMAIL PROTECTED]
To: java-user@lucene.apache.org
Subject: Re: Index optimization ...
Why do you run an optimize every 4 hours?
--
Ian.
On Wed, Jul 30, 2008 at 2:46 PM, Dragon Fly [EMAIL PROTECTED] wrote:
Perhaps I didn't
2008 15:03:37 +0100
From: [EMAIL PROTECTED]
To: java-user@lucene.apache.org
Subject: Re: Index optimization ...
OK, but why do you need to optimize before every swap? Have you tried
with less frequent optimizes?
--
Ian.
On Wed, Jul 30, 2008 at 3:00 PM, Dragon Fly [EMAIL PROTECTED
, at 9:46 AM, Dragon Fly wrote:
Perhaps I didn't explain myself clearly so please let me try it
again. I'm happy with the search/indexing performance. However,
my index gets fully optimized every 4 hours and the time it takes to
fully optimize the index is longer than I like
I'd like to shorten the time it takes to optimize my index and am willing to
sacrifice search and indexing performance. Which parameters (e.g. merge
factor) should I change? Thank you.
_
Stay in touch when you're away with Windows
Thanks.
Date: Mon, 24 Mar 2008 21:03:13 -0700
From: [EMAIL PROTECTED]
To: java-user@lucene.apache.org
Subject: RE: Field values ...
: The Id and Phone fields are stored. So I can just do a MatchAllQuery as
: you suggested. I have read about field selectors on this mailing list
:
The Id and Phone fields are stored. So I can just do a MatchAllQuery as you
suggested. I have read about field selectors on this mailing list but have
never used it. Does anyone know where I can find some sample code? Thank you.
Date: Sat, 22 Mar 2008 16:03:54 -0700
From: [EMAIL PROTECTED]
What's the easiest way to extract the values of 2 fields from each document in
the index. For example, each document has 5 fields:
Id Name Address Phone Preference
I'd like to extract the values for the Id and Phone fields for each document in
the index. Thank you.
, 2008 at 9:55 AM, Dragon Fly [EMAIL PROTECTED]
wrote:
What's the easiest way to extract the values of 2 fields from each
document in the index. For example, each document has 5 fields:
Id Name Address Phone Preference
I'd like to extract the values for the Id and Phone
20, 2008 at 9:55 AM, Dragon Fly [EMAIL PROTECTED]
wrote:
What's the easiest way to extract the values of 2 fields from each
document in the index. For example, each document has 5 fields:
Id Name Address Phone Preference
I'd like to extract the values for the Id
Hi Robert,
Did you run into any performance issues (because multiple searchers accessed a
single index on a shared directory)? Also, did you employ some redundancy
scheme to ensure that the shared directory is always available? Thank you.
To: java-user@lucene.apache.org
Subject: Re: Lucene
Hi,
I'd like to find out if I can do the following with Lucene (on Windows).
On server A:
- An index writer creates/updates the index. The index is physically stored on
server A.
- An index searcher searches against the index.
On server B:
- Maps to the index directory.
- An index searcher
Dragon Fly wrote:
Hi,
I'd like to find out if I can do the following with Lucene (on
Windows).
On server A:
- An index writer creates/updates the index. The index is
physically stored on server A.
- An index searcher searches against the index.
On server B:
- Maps
Hi,
My application needs to close/open the index searcher periodically so that
newly added documents are visible. Is there a way to determine if there are
any pending searches running against an index searcher or do I have to do my
own reference counting? Thank you.
@lucene.apache.org
Subject: Re: Sort by date with Lucene 2.2.0 ...
Date: Fri, 19 Oct 2007 19:50:46 +0200
On Thursday 18 October 2007 21:35, Dragon Fly wrote:
I'm am trying to sort a date field in my index but I'm seeing strange
results. I have searched the Lucene user mail archive for Datetools
the obvious (a specialty of mine)...
Erick
On 10/18/07, Dragon Fly [EMAIL PROTECTED] wrote:
Hi,
I'm am trying to sort a date field in my index but I'm seeing strange
results. I have searched the Lucene user mail archive for Datetools but
still couldn't figure out the problem
Hi,
I'm am trying to sort a date field in my index but I'm seeing strange results.
I have searched the Lucene user mail archive for Datetools but still couldn't
figure out the problem.
The date field is indexed as follows (i.e. DateTools is used, date field is
stored and untokenized):
Mike, which version of Lucene supports lazy loading? Thanks.
From: Michael McCandless [EMAIL PROTECTED]
Reply-To: java-user@lucene.apache.org
To: java-user@lucene.apache.org
Subject: Re: Field compression too slow
Date: Fri, 11 Aug 2006 06:59:58 -0400
I can share the data.. but it would be
My index gets rebuilt every night so I probably can afford
to construct the filters right after the index is rebuilt. How
do I check each document (for empty fields) though? Would
I use an IndexReader to loop through the documents? If so,
which method(s) in the IndexReader class should I use?
Thank you very much.
From: Erick Erickson [EMAIL PROTECTED]
Reply-To: java-user@lucene.apache.org
To: java-user@lucene.apache.org
Subject: Re: Empty fields ...
Date: Wed, 19 Jul 2006 09:48:04 -0400
Try something like
TermDocs termDocs = reader.termDocs();
termDocs.seek(new
Thanks for the quick reply, Erick. A couple of follow-up questions though.
If I had 200 fields (instead of 10), would you still recommend the same
approach? Would 200 filters use up too much memory? Would 200 filters
be too slow to construct/search?
From: Erick Erickson [EMAIL PROTECTED]
Hi,
I have an index that contains 3 fields: Book Id, Book Title, and Related
Book Ids.
For example:
=
Book Id Book Title Related Book Ids
A0001 Title 1 A0003, A0004
A0002 Title 2
A0003 Title 3 A0001, A0002
A0004 Title
I'll give it a try, thanks.
From: Yonik Seeley [EMAIL PROTECTED]
Reply-To: java-user@lucene.apache.org
To: java-user@lucene.apache.org
Subject: Re: Performance ...
Date: Mon, 22 May 2006 11:40:46 -0400
On 5/22/06, Dragon Fly [EMAIL PROTECTED] wrote:
The search results of my Lucene application
Hi,
The search results of my Lucene application are always sorted
alphabetically.
Therefore, score and relevance are not needed. With that said, is there
anything that I can disable to:
(a) Improve the search performance
(b) Reduce the size of the index
(c) Shorten the indexing time
Thank
Hi,
What is the best way to implement the following?
Document 1 contains the following text:
THE CZECH REPUBLIC ORGANIZATION
Document 2 contains the following text:
THE CZE ORGANISATION
Synonym rules:
(1) CZECH REPUBLIC -- CZE
(2) CZE -- CZECH REPUBLIC
(3) ORGANIZATION -- ORG,
46 matches
Mail list logo