When Lucene first issues a query, it caches a hash of sort values (one
value per document, plus a bit more if you are sorting on strings),
which takes a while. Therefore, when our application first starts up,
we issue one query per sort type. As I understand, it doesn't matter
what the query is
Anyone have any ballpark stats about sorting a single field versus sorting
multiple fields? I understand every implementation is different, but I'm
just trying to get a sense of what to expect before I revamp my index.
We need fairly fine-grained sorting of items, so I have a field with the
How often are you updating your index? Are you closing your old
IndexSearchers after switching over to the new index? You'll need to
close the searchers in order to release the file handle. This was the
same issue I was experiencing:
) and int sorting will involve caching an
int[]. Unique string values are shared in the array, but the String
values plus the String[] will always take up more room than the int[].
-Yonik
Now hiring -- http://forms.cnet.com/slink?231706
On 11/9/05, Monsur Hossain [EMAIL PROTECTED] wrote:
Hi
Ah, I got it. retArray is an array of ints; in order to return the string
value, it needs the mterms array to do the mapping. Thanks, Yonik!
Monsur
-Original Message-
From: Yonik Seeley [mailto:[EMAIL PROTECTED]
Sent: Thursday, November 10, 2005 1:33 PM
To:
Hi all. I have a question about sorting. Lucene in Action says: For
numeric types, each field being sorted for each document in the index
requires that four bytes be cached. For String types, each unique term is
also cached for each document.
I want to make sure I'm understanding this
fast.
How much data is being searched there?
Otis
--- Monsur Hossain [EMAIL PROTECTED] wrote:
Hey all. We just relaunched our search feature over here at
Xanga.com; the Blogs, Metros and Blogrings sections are powered by
Lucene.NET! You can
check it out here:
http
Hey all. We just relaunched our search feature over here at Xanga.com; the
Blogs, Metros and Blogrings sections are powered by Lucene.NET! You can
check it out here:
http://search.xanga.com/
This is only the beginning of what we want to do with search and Lucene. I
want to thank everyone on
I'm a little late to this thread. But is there any performance difference
between the compound index format and the multifile index format when
*searching*? The Lucene book mentions a performance difference when
*indexing*, but not when searching.
Monsur
-Original Message-
From:
Ramya. I don't have an answer to your specific lock file question, but
a couple thoughts.
You say you're using multiple threads to index 50,000 documents. Have
you tried a single thread version first? I'd try that, and then scale
out to multiple threads as needed. We index over ten times that
-Original Message-
From: Chris Lamprecht [mailto:[EMAIL PROTECTED]
Sent: Thursday, April 28, 2005 7:53 PM
Since the stored fields index would basically just be a
database, perhaps this is better served using a traditional
relational database (or even use the OS's file system).
I'm using Lucene.NET, but I had a similar issue with Visual Studio. With
Visual Studio open, my application would randomly crash with the same error
when I tried to run it from the command line. I'd recommend shutting down
all running apps and then see if the error happens in Ant. You could
Just tried this on my linux laptop - with IndexSearcher uncommented, I
still get a single .cfs file.
Hmmm, rereading this, I'm curious to know how/why this works in Linux.
Consider this scenario:
1) Create a new index
2) Create a new IndexSearcher pointing to that index.
3) Run an
I ran this test a little differently than letting the
IndexSearcher get garbage collected. Instead, I explicitly closed the
searcher (reader) and reopened it periodically.
Thanks Chuck, this is all really helpful. That explicit close() is what
allows the files stored up in deletable to
.
No problem at all! We have two index directories between
which we switch back and forth though?
Frank
-Original Message-
From: Monsur Hossain [mailto:[EMAIL PROTECTED]
Sent: Friday, April 29, 2005 12:11 AM
To: lucene-user@jakarta.apache.org
Subject: IndexSearcher hanging
files.
Thanks,
Monsur
-Original Message-
From: Chuck Williams [mailto:[EMAIL PROTECTED]
Sent: Thursday, April 28, 2005 10:09 PM
To: java-user@lucene.apache.org
Subject: Re: IndexSearcher hanging on to old index files in Windows
Monsur Hossain writes (4/28/2005 4:44 PM):
Hi
16 matches
Mail list logo