RE: RAM or SSD...

2012-07-20 Thread Dragon Fly
Thank you. From: dawid.we...@gmail.com Date: Thu, 19 Jul 2012 13:34:26 +0200 Subject: Re: RAM or SSD... To: java-user@lucene.apache.org Read this: http://blog.thetaphi.de/2012/07/use-lucenes-mmapdirectory-on-64bit.html Dawid On Thu, Jul 19, 2012 at 1:32 PM, Dragon Fly dragon-fly

RE: RAM or SSD...

2012-07-19 Thread Dragon Fly
, 2012 at 10:39 PM, Toke Eskildsen t...@statsbiblioteket.dk wrote: On Wed, 2012-07-18 at 17:50 +0200, Dragon Fly wrote: If I want to improve performance, which of the following is better and why? 1. Buy a machine with a lot of RAM and use a RAMDirectory for the index. As others has

RAM or SSD...

2012-07-18 Thread Dragon Fly
Hi, If I want to improve performance, which of the following is better and why? 1. Buy a machine with a lot of RAM and use a RAMDirectory for the index. 2. Put the index on a solid state drive. By the way, my index is about 30 GB. Thank you.

RE: Most recent document within a group ...

2012-02-27 Thread Dragon Fly
variant that takes a Sort parameter? Best Erick On Sun, Feb 26, 2012 at 8:30 AM, Dragon Fly dragon-fly...@hotmail.com wrote: Hi, Let's say I have 6 documents and each document has 2 fields (i.e. CustomerName and OrderDate). For example: Doc 1John20120115 Doc 2Mary

Most recent document within a group ...

2012-02-26 Thread Dragon Fly
Hi, Let's say I have 6 documents and each document has 2 fields (i.e. CustomerName and OrderDate). For example: Doc 1John20120115 Doc 2Mary20120113 Doc 3Peter 20120117 Doc 4Kate20120208 Doc 5John20120211 Doc 6Alan20110423 Is there a way to

RE: Commit data to disk ...

2012-01-05 Thread Dragon Fly
recent. In the 2.9 both IndexReader and IndexWriter have commit() methods. Best Erick On Tue, Jan 3, 2012 at 8:35 AM, Dragon Fly dragon-fly...@hotmail.com wrote: Hi, I'm using Lucene 2.0 and was wondering how to flush/commit index data to disk. It doesn't look like there is a flush

Commit data to disk ...

2012-01-03 Thread Dragon Fly
Hi, I'm using Lucene 2.0 and was wondering how to flush/commit index data to disk. It doesn't look like there is a flush() or commit() method in the 2.0 IndexWriter. Is there a way to flush the data without calling close()? Thank you.

Document loading

2009-10-06 Thread Dragon Fly
Hi, Which of the following method actually loads the document from disk? (1) Document document = searcher.doc (docId); OR (2) string value = document.get (FirstNameField); It's probably searcher.doc but I just want to be sure. Thank you.

Loading an index into memory

2009-07-23 Thread Dragon Fly
Hi, I have a question regarding RAMDirectory. I have a 5 GB index on disk and it is opened like the following: searcher = new IndexSearcher (new RAMDirectory (indexDirectory)); Approximately how much memory is needed to load the index? 5GB of memory or 10GB because of Unicode? Does the

RE: Loading an index into memory

2009-07-23 Thread Dragon Fly
: +49 421 218 65505 http://www.pangaea.de/ E-mail: uschind...@pangaea.de -Original Message- From: Dragon Fly [mailto:dragon-fly...@hotmail.com] Sent: Thursday, July 23, 2009 3:38 PM To: java-user@lucene.apache.org Subject: Loading an index into memory Hi

Boolean query ...

2009-03-19 Thread Dragon Fly
Let's say I have 3 fields in a document (Type, FirstName, and LastName). For example: Document 0 -- Type: Public FirstName: John LastName: Deere If I execute the following boolean query, document 0 is returned. Type:Public OR FirstName:Candy OR LastName:Deere

RE: Fields with multiple values...

2009-02-12 Thread Dragon Fly
, and bookmark_title_35:Another bookmark in the same document, and I can search bookmark titles by specific users. On Wed, Feb 11, 2009 at 3:55 PM, Dragon Fly dragon-fly...@hotmail.comwrote: Hi, Let's say I have a single document with 2 fields (namely Field1 and Field2). 2 values are added

Fields with multiple values...

2009-02-11 Thread Dragon Fly
Hi, Let's say I have a single document with 2 fields (namely Field1 and Field2). 2 values are added to each field like below. // Add 2 values to Field1. doc.Add (new Field (Field1, A, Field.Store.YES, Field.Index.UN_TOKENIZED)); doc.Add (new Field (Field1, B, Field.Store.YES,

RE: Fields with multiple values...

2009-02-11 Thread Dragon Fly
...@syr.edu To: java-user@lucene.apache.org Hi Dragon Fly, You could split the original document into multiple Lucene Documents, one for each array index, all sharing the same DocID field value. Then your queries just work. But you'd have to do result consolidation, removing duplicate original

RE: searchable archives

2008-11-07 Thread Dragon Fly
http://www.gossamer-threads.com/lists/lucene/java-user/ Date: Fri, 7 Nov 2008 14:27:38 -0700 From: [EMAIL PROTECTED] To: java-user@lucene.apache.org Subject: searchable archives Hey, Is this list available somewhere that you can search the entire archives at one time? Thanks, Chad

RE: Read all the data from an index

2008-11-03 Thread Dragon Fly
Thank you both for your help. Date: Fri, 31 Oct 2008 09:06:50 +0100 From: [EMAIL PROTECTED] To: java-user@lucene.apache.org Subject: Re: Read all the data from an index Erick Erickson wrote: I'm not sure what *could* be easier than looping with IndexSearcher.doc(), looping from 1 to

Read all the data from an index

2008-10-30 Thread Dragon Fly
Hi, I have an old index that was built a few months ago. The data that I used to build the index has been deleted from the database. I'd like to read all the data from the old index to build a new index. Which Lucene API calls should I use to read all the data from the old index? Thank you

RE: Read all the data from an index

2008-10-30 Thread Dragon Fly
Well, that's trickier than you might think. You can easily get all the STORED data just by getting doc IDs 1-MaxDoc(). But reconstructing the data from data that is NOT stored is more difficult. Luke tries, but it may be a lossy process. Best Erick On Thu, Oct 30, 2008 at 3:24 PM, Dragon

Wildcard query ...

2008-10-08 Thread Dragon Fly
Let's say my index has two fields (Type and Description). Type is either 0 or 1 and Description is a string (up to 250 characters). I'd like to execute the following search: +Description:Honda* +Type:0 Would the query run faster if I specify the Type first: +Type:0

Deleting documents ...

2008-08-10 Thread Dragon Fly
I'd like to delete some documents from my index. Should I use the DeleteDocument method in the IndexReader class or the IndexModifier class? Does it make a difference which one I use? Thank you. _ Get more from your digital life.

RE: Deleting documents ...

2008-08-10 Thread Dragon Fly
IndexReader back and forth to do the deletions. IndexModifier is deprecated. Mike Dragon Fly wrote: I'd like to delete some documents from my index. Should I use the DeleteDocument method in the IndexReader class or the IndexModifier class? Does it make a difference which one I use

RE: Index optimization ...

2008-07-30 Thread Dragon Fly
Perhaps I didn't explain myself clearly so please let me try it again. I'm happy with the search/indexing performance. However, my index gets fully optimized every 4 hours and the time it takes to fully optimize the index is longer than I like. Is there anything that I can do to speed up

RE: Index optimization ...

2008-07-30 Thread Dragon Fly
is made active. Date: Wed, 30 Jul 2008 14:54:03 +0100 From: [EMAIL PROTECTED] To: java-user@lucene.apache.org Subject: Re: Index optimization ... Why do you run an optimize every 4 hours? -- Ian. On Wed, Jul 30, 2008 at 2:46 PM, Dragon Fly [EMAIL PROTECTED] wrote: Perhaps I didn't

RE: Index optimization ...

2008-07-30 Thread Dragon Fly
2008 15:03:37 +0100 From: [EMAIL PROTECTED] To: java-user@lucene.apache.org Subject: Re: Index optimization ... OK, but why do you need to optimize before every swap? Have you tried with less frequent optimizes? -- Ian. On Wed, Jul 30, 2008 at 3:00 PM, Dragon Fly [EMAIL PROTECTED

RE: Index optimization ...

2008-07-30 Thread Dragon Fly
, at 9:46 AM, Dragon Fly wrote: Perhaps I didn't explain myself clearly so please let me try it again. I'm happy with the search/indexing performance. However, my index gets fully optimized every 4 hours and the time it takes to fully optimize the index is longer than I like

Index optimization ...

2008-07-28 Thread Dragon Fly
I'd like to shorten the time it takes to optimize my index and am willing to sacrifice search and indexing performance. Which parameters (e.g. merge factor) should I change? Thank you. _ Stay in touch when you're away with Windows

RE: Field values ...

2008-03-25 Thread Dragon Fly
Thanks. Date: Mon, 24 Mar 2008 21:03:13 -0700 From: [EMAIL PROTECTED] To: java-user@lucene.apache.org Subject: RE: Field values ... : The Id and Phone fields are stored. So I can just do a MatchAllQuery as : you suggested. I have read about field selectors on this mailing list :

RE: Field values ...

2008-03-24 Thread Dragon Fly
The Id and Phone fields are stored. So I can just do a MatchAllQuery as you suggested. I have read about field selectors on this mailing list but have never used it. Does anyone know where I can find some sample code? Thank you. Date: Sat, 22 Mar 2008 16:03:54 -0700 From: [EMAIL PROTECTED]

Field values ...

2008-03-20 Thread Dragon Fly
What's the easiest way to extract the values of 2 fields from each document in the index. For example, each document has 5 fields: Id Name Address Phone Preference I'd like to extract the values for the Id and Phone fields for each document in the index. Thank you.

RE: Field values ...

2008-03-20 Thread Dragon Fly
, 2008 at 9:55 AM, Dragon Fly [EMAIL PROTECTED] wrote: What's the easiest way to extract the values of 2 fields from each document in the index. For example, each document has 5 fields: Id Name Address Phone Preference I'd like to extract the values for the Id and Phone

RE: Field values ...

2008-03-20 Thread Dragon Fly
20, 2008 at 9:55 AM, Dragon Fly [EMAIL PROTECTED] wrote: What's the easiest way to extract the values of 2 fields from each document in the index. For example, each document has 5 fields: Id Name Address Phone Preference I'd like to extract the values for the Id

RE: Lucene on a cluster environment

2008-03-19 Thread Dragon Fly
Hi Robert, Did you run into any performance issues (because multiple searchers accessed a single index on a shared directory)? Also, did you employ some redundancy scheme to ensure that the shared directory is always available? Thank you. To: java-user@lucene.apache.org Subject: Re: Lucene

Search against an index on a mapped drive ...

2008-03-14 Thread Dragon Fly
Hi, I'd like to find out if I can do the following with Lucene (on Windows). On server A: - An index writer creates/updates the index. The index is physically stored on server A. - An index searcher searches against the index. On server B: - Maps to the index directory. - An index searcher

RE: Search against an index on a mapped drive ...

2008-03-14 Thread Dragon Fly
Dragon Fly wrote: Hi, I'd like to find out if I can do the following with Lucene (on Windows). On server A: - An index writer creates/updates the index. The index is physically stored on server A. - An index searcher searches against the index. On server B: - Maps

Closing index searchers ...

2007-11-29 Thread Dragon Fly
Hi, My application needs to close/open the index searcher periodically so that newly added documents are visible. Is there a way to determine if there are any pending searches running against an index searcher or do I have to do my own reference counting? Thank you.

RE: Sort by date with Lucene 2.2.0 ...

2007-10-23 Thread Dragon Fly
@lucene.apache.org Subject: Re: Sort by date with Lucene 2.2.0 ... Date: Fri, 19 Oct 2007 19:50:46 +0200 On Thursday 18 October 2007 21:35, Dragon Fly wrote: I'm am trying to sort a date field in my index but I'm seeing strange results. I have searched the Lucene user mail archive for Datetools

RE: Sort by date with Lucene 2.2.0 ...

2007-10-19 Thread Dragon Fly
the obvious (a specialty of mine)... Erick On 10/18/07, Dragon Fly [EMAIL PROTECTED] wrote: Hi, I'm am trying to sort a date field in my index but I'm seeing strange results. I have searched the Lucene user mail archive for Datetools but still couldn't figure out the problem

Sort by date with Lucene 2.2.0 ...

2007-10-18 Thread Dragon Fly
Hi, I'm am trying to sort a date field in my index but I'm seeing strange results. I have searched the Lucene user mail archive for Datetools but still couldn't figure out the problem. The date field is indexed as follows (i.e. DateTools is used, date field is stored and untokenized):

Re: Field compression too slow

2006-08-11 Thread Dragon Fly
Mike, which version of Lucene supports lazy loading? Thanks. From: Michael McCandless [EMAIL PROTECTED] Reply-To: java-user@lucene.apache.org To: java-user@lucene.apache.org Subject: Re: Field compression too slow Date: Fri, 11 Aug 2006 06:59:58 -0400 I can share the data.. but it would be

Re: Empty fields ...

2006-07-19 Thread Dragon Fly
My index gets rebuilt every night so I probably can afford to construct the filters right after the index is rebuilt. How do I check each document (for empty fields) though? Would I use an IndexReader to loop through the documents? If so, which method(s) in the IndexReader class should I use?

Re: Empty fields ...

2006-07-19 Thread Dragon Fly
Thank you very much. From: Erick Erickson [EMAIL PROTECTED] Reply-To: java-user@lucene.apache.org To: java-user@lucene.apache.org Subject: Re: Empty fields ... Date: Wed, 19 Jul 2006 09:48:04 -0400 Try something like TermDocs termDocs = reader.termDocs(); termDocs.seek(new

Re: Empty fields ...

2006-07-18 Thread Dragon Fly
Thanks for the quick reply, Erick. A couple of follow-up questions though. If I had 200 fields (instead of 10), would you still recommend the same approach? Would 200 filters use up too much memory? Would 200 filters be too slow to construct/search? From: Erick Erickson [EMAIL PROTECTED]

Related documents ...

2006-06-12 Thread Dragon Fly
Hi, I have an index that contains 3 fields: Book Id, Book Title, and Related Book Ids. For example: = Book Id Book Title Related Book Ids A0001 Title 1 A0003, A0004 A0002 Title 2 A0003 Title 3 A0001, A0002 A0004 Title

Re: Performance ...

2006-05-23 Thread Dragon Fly
I'll give it a try, thanks. From: Yonik Seeley [EMAIL PROTECTED] Reply-To: java-user@lucene.apache.org To: java-user@lucene.apache.org Subject: Re: Performance ... Date: Mon, 22 May 2006 11:40:46 -0400 On 5/22/06, Dragon Fly [EMAIL PROTECTED] wrote: The search results of my Lucene application

Performance ...

2006-05-22 Thread Dragon Fly
Hi, The search results of my Lucene application are always sorted alphabetically. Therefore, score and relevance are not needed. With that said, is there anything that I can disable to: (a) Improve the search performance (b) Reduce the size of the index (c) Shorten the indexing time Thank

Synonyms ...

2006-04-21 Thread Dragon Fly
Hi, What is the best way to implement the following? Document 1 contains the following text: THE CZECH REPUBLIC ORGANIZATION Document 2 contains the following text: THE CZE ORGANISATION Synonym rules: (1) CZECH REPUBLIC -- CZE (2) CZE -- CZECH REPUBLIC (3) ORGANIZATION -- ORG,