Is there a way to determine whether or not an index that was left locked
due to some improper system shutdown needs repair?
My code does the following as part of starting up and creating an
IndexWriter for an existing index that was created in a prior session:
if
I notice some duplicated entries in my index, my just looking at it,
and I suspect there might be more than those I found out. Is there a
way to detect duplicate documents in an index?
Emerson Cargnin
-
To unsubscribe, e-mail:
Hi Mark, thanks for your response.
Here are my thoughts on your suggestion:
I believe it would be a good idea to merge similar query expansion code.
I also agree that the situation of fuzzy query is similar to the synonym
query use-case, in the sense of having a root term and some related,
Thanks, Jens. Seems like this would be pretty complicated.
It seems the best way would be not have a separate daemon for indexing
modifiied documents, but just have the reindexing part in the backend itself
(it would know when any documents were modifiied), but since it would involve
some
Chris,
Should this patch work against the current code base? I'm getting this
error:
D:\lucene-1.9patch -b -p0 -i nio-lucene-1.9.patch
patching file src/java/org/apache/lucene/index/CompoundFileReader.java
patching file src/java/org/apache/lucene/index/FieldsReader.java
missing header for
Chris,
My apologies - this error was apparently caused by a file format mismatch
(probably line endings).
Thanks,
Peter
On 3/13/06, Peter Keegan [EMAIL PROTECTED] wrote:
Chris,
Should this patch work against the current code base? I'm getting this
error:
D:\lucene-1.9patch -b -p0 -i
I'm confused about how to set the COMMIT lock timeout since the version
1.9.1 release.
I note that this can't be set from system properties anymore
(CHANGES.txt, changes in run time behaviour 7), but am unable to find
the replacement setter method promised for IndexWriter.
Can anyone point
How does the information change in each of these customer's documents?
I would think if they were very dynamic then updates to the single index
would not be great for you. But if the updates were just now and then,
then given the performance of lucene that the single index would be just
fine.
On 3/13/06, emerson cargnin [EMAIL PROTECTED] wrote:
I notice some duplicated entries in my index, my just looking at it,
and I suspect there might be more than those I found out. Is there a
way to detect duplicate documents in an index?
Emerson Cargnin
If there is a field with a unique
Hi,
Can someone please explain how does IndexSearcher and IndexWriter works in
conjuction. As far as i know after reading all the posts in newgroup, it
seems everything works fine if we have one IndexWriter thread and multiple
IndexSearcher thread. But my doubt here is, looking at IndexSearcher
On 3/13/06, Chuck Williams [EMAIL PROTECTED] wrote:
Is there a way to determine whether or not an index that was left locked
due to some improper system shutdown needs repair?
Depends what you mean by repair. If there was a crash during index
modification, I think the index should normally
Hi Nikhil
We are using the index accessor contribution. For more information see:
http://www.nabble.com/Fwd%3A-Contribution%3A-LuceneIndexAccessor-t17416.html#a47049
This should help you to co-ordinate the IndexSearcher and IndexWriter.
Patrick
On 13/03/06, Nikhil Goel [EMAIL PROTECTED] wrote:
Hi Patrick,
thanks for writing back but my question is:- do we really need to write
something new to achieve what I want to achieve. By going thru Lucene
Tutorials, i dont think there is a need to do such a thing:-
http://blog.danbartels.com/archive/2004/09/09/186.aspx
Indexing and searching
On 3/13/06, Nikhil Goel [EMAIL PROTECTED] wrote:
Can someone please explain how does IndexSearcher and IndexWriter works in
conjuction.
The trick is that once segment files are written, they are never
modified (except for the segments file itself). New documents are
added to new segments, not
We, John Wiley Sons (http://www3.interscience.wiley.com/), are looking
for a Lucene expert to assist with our migration from Verity to Lucene
(up to six weeks work, starting this coming Monday, 20 March). The
candidate must be based in the UK, preferably in or close to London,
as we would like
On Montag 13 März 2006 15:50, Jim Bedford-roberts wrote:
I note that this can't be set from system properties anymore
(CHANGES.txt, changes in run time behaviour 7), but am unable to find
the replacement setter method promised for IndexWriter.
Seems these have been forgotten. They can easily
Daniel Naber ponders:
Seems these have been forgotten. They can easily be added, but I still
wonder what the use case is to set these values?
The default value isn't magic. The appropriate value is
context-specific. I've got some people using Lucene on machines with
slow disks, and we need
: The Searching process then would have to re-open it's RAMDirectory.
the key to all of this being that there are constructors for RAMDirectory
that make it very easy to load in the contents of an FSDirectory.
: Or you check the version of the fs-based index from time to time, to see
: when it
: The trick is that once segment files are written, they are never
: modified (except for the segments file itself). New documents are
: added to new segments, not existing segments. When segments are
: merged, a new bigger segment is created. This way, the view of the
: index for a specific
I am curious why the character / sorts before the space.
For example,
Apple/banana is good for you.
Sorts before
Apple banana is good for you
Is there something I can do to make it sort correctly?
Regards,
Bob
-
To
On 3/13/06, Bob Cheung [EMAIL PROTECTED] wrote:
I am curious why the character / sorts before the space.
For example,
Apple/banana is good for you.
Sorts before
Apple banana is good for you
Are you sure that the field is untokenized, and that you are sorting
in the correct direction?
---BeginMessage---
Hi all,
If I want to embed the index files into another file (say of
extension *.luc, so now all the index files are flattened inside this new file),
can I still use the index without having to extract out the index files to a
temp folder?
aditya
---End
RAMDirectory is indeed currently limited to 2GB. This would not be too
hard to fix. Please file a bug report. Better yet, attach a patch.
I assume you're running a 64bit JVM. If so, then MMapDirectory might
also work well for you.
Doug
z shalev wrote:
this is in continuation of a
I'm pretty sure. The other characters sorted according to the ASCII
sequence. It's only the slash sorted before the space. That's why I
wonder whether slash is treated differently.
Btw, this is the statement the sort field is added to the document.
doc.add(Field.UnIndexed(_s +
24 matches
Mail list logo