Name fields in a range between "A Balladeer*" TO "A
Perfect Circle*" and get only terms back which are starting with that
terms? Is there a way to accomplish that in Java and try it in luke?
And is there a way to sort resultsets in luke?
Cheers,
Thomas
--
Thomas Be
t Circle"] gives zero results. Tried it
also with braces around the term and such stupid things, even if they
shouldn't be needed in a range query.
I'm kinda clueless.
Cheers,
Thomas
Ian Lea wrote:
Hi
Are you sure your range queries should have wild card asterisks on the
end? Loo
Btw. I tried the wildcard since I found something on google, which noted
wildcards together with StartsWith queries.
Thomas Becker wrote:
Hi Ian,
no the wild cards should not be necessary. That was just the last try
out of some. I now the exact content of both fields in my range query.
The
Howdy all,
I have a issue with java running out of memory after the search
has been running for a while. We are using 1.9.1 release and I check
the indexreader's version to determine if I need to get a new searcher
for searches (so I pick up any changes to the index). I am seeing jumps
i
objects aren't
properly disposed.
-drj
On 10/26/06, Aigner, Thomas < [EMAIL PROTECTED]> wrote:
>
> Howdy all,
>
> I have a issue with java running out of memory after the
search
> has been running for a while. We are using 1.9.1 release and I check
> the indexrea
I have seen numerous posts on warming up a searcher, but was wondering
if someone could post their code that would spin off another thread to
warm up a searcher, then switch to the new one when it is warmed up?
-
To unsubscri
how to update a field in lucene?
lt stemmer or do I have to stem the texts before
indexing ?
Does a multi-language stemmer exists ?
(sorry if the answers are in the documentation, I didn't manage to
fully read it)
Thanks in advance !
Thomas Klein.
-
To
Howdy all,
I have a question on reading many documents and time to do this.
I have a loop on the hits object reading a record, then writing it to a
file. When there is only 1 user on the Index Searcher, this process to
read say 100,000 takes around 3 seconds. This is slow, but can
have it in the loop?
On Dec 7, 2006, at 1:14 PM, Aigner, Thomas wrote:
>
>
>
>
> Howdy all,
>
>
>
> I have a question on reading many documents and time to do this.
> I have a loop on the hits object reading a record, then writing it
> to a
> file
is.search(query, hc);
-Original Message-
From: Aigner, Thomas [mailto:[EMAIL PROTECTED]
Sent: Thursday, December 07, 2006 1:36 PM
To: java-user@lucene.apache.org
Subject: RE: Reading Performance
Thanks Grant and Erik for your suggestions. I will try both of them and
let you know i
How could I optimize my index creation?
// setUseCompoundFile(?);
// setMaxBufferedDocs(?);
// setMergeFactor(?);
How could I reduce the disk access because I work with more than 100
documents?
Thanks
ink.
Otis
- Original Message
From: WATHELET Thomas <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Monday, January 29, 2007 4:16:22 AM
Subject: Index creation
How could I optimize my index creation?
// setUseCompoundFile(?);
// setMaxBufferedDocs(?);
/
I'm using RAMDirectory
If the number of documents to index is less than the maxBufferedDocs
properties nothing is write into my index.
ex: RAMDirectory ramDir =new RAMDirectory ();
this.indexWriter.addIndexes(new Directory[] { ramDir });
ramWriter.close();
indexWriter.
P.S. At one point I tried doing an in-memory index using the
RAMDirectory and then merging it with an on-disk index and it didn't
work. The RAMDirectory never flushed to disk... leaving me with an
empty index.
Only when the number of documents is greather than the maxBufferedDocs
properties.
What
s was fixed a while back.
Running HEAD or at least 2.0.0?
Otis
- Original Message
From: WATHELET Thomas <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Tuesday, January 30, 2007 11:22:52 PM
Subject: RAMDirectory 2
P.S. At one point I tried doing an in-memory inde
do something like this:
public class Index extends IndexModifier { ...
public int deleteDocuments(String field, String value) throws
IOException {
return super.deleteDocuments(new Term(field, value));
}
use like this :
index.deleteDocuments(filed name, field value);
_
Parse your date with this classe DateTools.stringToDate to search and
DateTools.dateToString() to store into index.
-Original Message-
From: 李寻欢晕菜了 [mailto:[EMAIL PROTECTED]
Sent: 26 February 2007 11:17
To: java-user@lucene.apache.org
Subject: how to query range of Date by given date st
I don't really understand the difference between using the ramDirectory
and using IndexWriter.
What's the difference between using ramDirectory instead of using
IndexWriter with those properties set to:
setMergeFactor(1000);setMaxMergeDocs(1);setMaxBufferedDocs(1);
lto:[EMAIL PROTECTED]
Sent: 28 February 2007 16:29
To: java-user@lucene.apache.org
Subject: Re: RamDirectory vs IndexWriter
Le Mercredi 28 Février 2007 16:19, WATHELET Thomas a écrit :
> I don't really understand the difference between using the ramDirectory
> and using IndexWriter.
&
Walt,
I am no expert, but it sounds like you need to associate many
dates to a single record. Can this be handled as you would a synonym?
Basically add a token at the same offset as the row itself? i.e. you
would have a record that would also have a date field that has 3 offsets
that woul
How can I add the Package org.apache.lucene.search.highlight into my projects
because the standart Lucene api 2.1.0 do not content this package?
org.apache.lucene.search.highlight
The contrib/highlighter directory contains the jar file that is needed.
Ulf
On 04.03.2007, at 10:58, WATHELET Thomas wrote:
> How can I add the Package org.apache.lucene.search.highlight i
tting
and returning docs IDs.
I'm indexing a some date field like that :
20060612223456
2006061222
20060612
200606
2006
In your opinion, will range query be fast enough or will I need to
create, for exemple, one index per month and then open and parse them
regarding querys ?
R
process.
http://www.getopt.org/luke/
Stop words affect the performance. The seize of the index without stop
words is much small ( up to 40%), because they occur soo often.
Thomas
aslam bari wrote:
Ok, Thats fine. Thanks
Now what if i don't want to stop any word, means i want lucene not to ignore
ss (Analyzer), which is a simple copy of the
StandardAnalyzer, and expand it with the appropriate Filter in the
method "tokenStream".
Hope this helps...
Thomas
[EMAIL PROTECTED] wrote:
Hi,
Lucene provides a PorterStemFilter which uses PorterStemmer.
Is there any way I c
rrent document.
For the calculation of the idf, you can use the provided formula from
the "DefaultSimilarity".
To get the document frequency, which is necessary to calculate the idf,
you can call:
reader.docFreq(term)
Hope this helps...
Thomas
Sengly Heng wrote:
Hello Luceners,
//toDo: count the occurrence of the terms
}
}
}
Hope this helps.
Thomas
Sengly Heng wrote:
Hello all,
I would like to extract the term freq vector from the hit results as a
total
vector not by document.
I have searched the
complete JUnit test which behaves exactly as my manager. The main
methods to look at are testIndexManagement (the
test),getIndexWriter,addDocument,performExactSearchIntoIndex,performApproximativeSearchIntoIndex.
Thank you for your help,
Best regards,
T
complete JUnit test which behaves exactly as my manager. The main
methods to look at are testIndexManagement (the
test),getIndexWriter,addDocument,performExactSearchIntoIndex,performApproximativeSearchIntoIndex.
Thank you for your help,
Best regards,
T
High,
Is-it possible to update a document's field without deleting the
document and add it again into the index?
Witch analyser I have to use to find text like this ''?
Howdy all,
I am having a problem with inserting/updating records into my
index. I have approximately 1.5M records in the index taking about 2.5G
space when optimized.
If I want to update 1000 records, I delete the old item and insert the
new one. This is taking a LONG time to accomplis
wapped to a reader for reads. By keeping this
index
small speeds up the constant opening and closing. Searching is
straightforward using the MultiSearcher.
I don't know anything about the lock problem - it's not something I've
ever
seen (I'm using 1.4.3).
Regards
Pa
Howdy all, have a quick question for you...
I am seeing quite a difference between optimized index and one that is
not optimized. I have read a few papers that say that it shouldn't
matter, but I am seeing 7X speed or better when the index is optimized.
Is it possible that I am creating the origi
ber of segments, and hence speed searching.
-Yonik
Now hiring -- http://forms.cnet.com/slink?231706
On 11/16/05, Aigner, Thomas <[EMAIL PROTECTED]> wrote:
> Howdy all, have a quick question for you...
>
> I am seeing quite a difference between optimized index and one that is
>
Hi all,
Is there an index changed event that I can jump on that will
tell me when my index has been updated so I can close and reopen my
searcher to get the new changes?
I can't seem to find the event, but see some tools that might
accomplish this (DLESE DPC software components?).
the last time you checked. The index's version number
changes whenever the index is updated.
Peter
> -Original Message-
> From: Aigner, Thomas [mailto:[EMAIL PROTECTED]
> Sent: Monday, November 21, 2005 3:48 PM
> To: java-user@lucene.apache.org
> Subject: Lucene
Is there an error list for lucene errors? I am creating a project that
will do something specific for certain lucene errors that occur (Lock
file out there, etc).
Thanks,
Tom
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For
Hi all,
Is anyone experiencing possible memory problems on LINUX with
Lucene search? Here is our scenario, we have a service that lives on
LINUX that takes all incoming request through a port and does the
search. Only 1 IndexSearcher is instantiated to do this from our
service. When I r
s that the thread has X Mb of memory,
it's really the same physical block as all the others.
You see this all the time in a Tomcat app server box, where each Http
Connector is a thread, and appears as it's own process.
cheers,
Paul Smith
On 17/01/2006, at 7:11 AM, Aigner, Thomas wrote:
I'm still a bit new to Lucene, but the way I think you can handle it is
using a Boolean query.
//Create a bool Query
BooleanQuery boolQuery = new BooleanQuery();
Loop through each query and add it to the boolQuery
//Addend the boolean queries together Using the And syntax
boolQuery.add(queryX, t
I am curious what would be the difference between searching for a number
verses a character.
I have a large index consisting of a few fields (So index would look
something like: " 123123123 my description my catalog"
Searching for 12* is much slower than searching for de*
I don't have a
deas from that, you can impliment a much faster version of
PrefixQuery that doesn't score documents based on term frequency ...
which
may be ok depending on your needs.
: Date: Mon, 30 Jan 2006 13:51:15 -0500
: From: "Aigner, Thomas" <[EMAIL PROTECTED]>
: Reply-To: java-
Hello all,
I have a couple of questions for the community about the 1.9
Lucene version. As I understand it, this has not been released and I
can't find an approximate date for release (I know you can download the
development version and compile it). I see a nightly build going on
(http:/
Anyone have a comment on the below message?
-Original Message-
From: Aigner, Thomas
Sent: Wednesday, February 08, 2006 11:50 AM
To: java-user@lucene.apache.org
Subject: 1.9 lucene version
Hello all,
I have a couple of questions for the community about the 1.9
Lucene version
I believe that the files are actually deleted from lucene when the
optimize is run.
-Original Message-
From: Dan Armbrust [mailto:[EMAIL PROTECTED]
Sent: Monday, February 13, 2006 12:27 PM
To: java-user@lucene.apache.org
Subject: When do files in 'deleteable' get deleted?
If I am using l
. But with
this solution i get sometimes a "More than 32 required/prohibited
clauses in query." Exception. Anyone a solution for my that problem or
some sites for me where i could look at (with some examples)? Or is
there a way to generate (relativly fast) a subindex?
Thanks a lot
What is the disadvantage of doing that?
Michael D. Curtin schrieb:
Thomas Papke wrote:
i am a "newby" in usage of Apache Lucene. If have a relativly big
database indexed by lucene (about 300MB File). Up to now - all users
could search over the hole index. How to restrict the re
;
Or is there an better way to prohibite something? Filters maybe? Anyone
a short example how to implement a lucene filter?
Thanks a lot, Thomas
Michael D. Curtin schrieb:
Volodymyr Bychkoviak wrote:
This is not the case.
maxClauseCount limit number of terms that can fit into BooleanQuery
d
I made a multi search into my Lucene index. It's work properly but I
would like to know if it's possible to know in witch index de document
belong to.
Thanks a lot.
-Original Message-
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: 06 March 2006 16:25
To: java-user@lucene.apache.org
Subject: Re: Multisearch
On Mar 6, 2006, at 10:05 AM, WATHELET Thomas wrote:
> I made a multi search into my Lucene index. It's work proper
uot;Ab*". So i don't need the
hole document and i need this information realy fast.
Thx,
Thomas
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
I've created an index with the Lucene version 1.9 and when I try to open
this index I have always this error mesage:
java.lang.ArrayIndexOutOfBoundsException.
if I use an index built with the lucene version 1.4.3 it's working.
Wath's wrong?
High,
I have a trouble this the indexation process, sometimes I retrieve an
error like the file segments.new can't be rename or delete something
like that.
What's happened?
From: Patrick Kimber [mailto:[EMAIL PROTECTED]
Sent: woensdag 15 maart 2006 10:32
To: java-user@lucene.apache.org
Subject: Re: segments.new
Hi Thomas
I have been getting similar errors and am trying to investigate the
cause.
My current thinking is that it is caused by my virus checker opening
the files.
s) of Lucene for a few
months now...
Luc
-Original Message-
From: Patrick Kimber [mailto:[EMAIL PROTECTED]
Sent: woensdag 15 maart 2006 10:32
To: java-user@lucene.apache.org
Subject: Re: segments.new
Hi Thomas
I have been getting similar errors and am trying to investigate the
caus
How to replace this Expression Query query =
QueryParser.parse(this.searchvalue, "text", new StandardAnalyzer()); in
Lucene 1.9.1 because the method parse is deprecated?
quot;, new StandardAnalyzer());
Query query = qp.parse(this.searchvalue);
Cheers,
Tim.
-Original Message-
From: WATHELET Thomas [mailto:[EMAIL PROTECTED]
Sent: 22 March 2006 11:25
To: java-user@lucene.apache.org
Subject: Lucene 1.9.1 Query
How to replace this Expression Q
) );
Query query = qp.parse( searchvalue );
regards,
Koji
> -Original Message-
> From: WATHELET Thomas [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, March 22, 2006 8:25 PM
> To: java-user@lucene.apache.org
> Subject: Lucene 1.9.1 Query
>
>
> How to r
I use Lucene 1.9.1
How to parse an unc path like \\tom\share\5\tom.doc
in a query to search in the index key
field?
String key="\\tom\share\5\tom.doc ";
Ex: Hits hits = multisearch.search(new TermQuery(new Term("key",
QueryParser.escape(key;
I ask this question because this key exist int
Howdy all,
I am having a performance issue. When I do a search for items,
getting more information takes a long time.
Ex. If there are 1M hits (I know, why look for that many or even allow
it, but let's say we return 1M hits). When the user wants to see the
last 25, it takes a LONG time
Is it possible to write into the index and delete some documents in the same
time?
If yes could you show me me how to proceed.
I use Lucene 1.9.1
Thanks in advence.
, WATHELET Thomas wrote:
> Is it possible to write into the index and delete some documents in
the
> same time?
Yes, have a look at the IndexModifier class.
--
http://www.danielna
Ok thanks
-Original Message-
From: Tom Hill [mailto:[EMAIL PROTECTED]
Sent: Sat 3/25/2006 12:39 AM
To: java-user@lucene.apache.org
Cc:
Subject: Re: delte documents into index
Hi Thomas
use one IndexModifier object for all your process
-Original Message-
From: Daniel Naber [mailto:[EMAIL PROTECTED]
Sent: Sat 3/25/2006 12:07 PM
To: java-user@lucene.apache.org
Cc:
Subject: Re: delte documents into index
Why I retrive hits with this query :
+doccontent:avian +doctype:AM +docdate:[2005033122000 TO
2006062022000]
and not with this one
+doccontent:avian influenza +doctype:AM +docdate:[2005033122000 TO
2006062022000]
al-
De: WATHELET Thomas [mailto:[EMAIL PROTECTED]
Enviado el: miércoles, 21 de junio de 2006 15:40
Para: java-user@lucene.apache.org
Asunto: BooleanQuery
Why I retrive hits with this query :
+doccontent:avian +doctype:AM +docdate:[2005033122000 TO
2006062022000]
and not wit
I'm creating my index file and in the same time I try to do some
searches inside.
Sometimes I retrieve this error message:
"\\tradluxstmp01\JavaIndex\tra\index_FR\_335.fnm (The system cannot find
the file specified)"
What I have to do or what's happen?
Hi,
I have a question about using sorting in Lucene, because we experienced
some OutOfMemory errors. If I understand it correctly, each unique term
in a field is read into a cache, when I use Searcher.search(Query query,
Sort sort) with one SortField. So even if my query only finds 5
documents, Luc
That is exactly what I did when I started to realize the effects of using
Lucene sorting with millions of documents in the index. I used STORED fields
and sorted the results with a generic Comparator, which is configured for a
field and a search order. I only do this if the query did not return
When I create an index withe the class IndexModifier in Lucene 1.9.1there is a
lock file created on a temp folder.
My question is: Is it possible to disable this option?
If yes how to procede?
It's not possible to change lockDir because it's a final static
varriables?
Is it possible to change the lockDir?
-Original Message-
From: Michael McCandless [mailto:[EMAIL PROTECTED]
Sent: 29 June 2006 22:26
To: java-user@lucene.apache.org
Subject: Re: Lock File
> When I create an ind
Ok thanks I understand now.
Thanks a lot.
-Original Message-
From: Michael McCandless [mailto:[EMAIL PROTECTED]
Sent: 30 June 2006 16:10
To: java-user@lucene.apache.org
Subject: Re: Lock File
> It's not possible to change lockDir because it's a final static
> varriables?
> Is it possib
How to parse this kind of query?
COM(2006) 0001
How to parse this query COM(2005) 0123 in LukeAll?
I have this result cocnumber: com
ssage-
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: 11 July 2006 15:14
To: java-user@lucene.apache.org
Subject: Re: Query using parenthesis
On Jul 11, 2006, at 8:57 AM, WATHELET Thomas wrote:
> How to parse this query COM(2005) 0123 in LukeAll?
> I have this result cocnumber: com
&
d is:
SEC(2006) 0350
The resulting query that Lucene produces :
+docnumber:SEC\(2006\) 0350
I'm using the class simpleanalyser for the search.
And I have no result but this data SEC(2006) 0350 is present in the
index "docnumber field"
Thank's in advence
Wathelet
Thomas
-Origi
ownloading a copy of Luke so you can examine your index and see exactly
what got indexed. That's saved me a bunch of times
I wonder whether you want to store this UN_TOKENIZED instead?
Best
Erick
On 7/11/06, WATHELET Thomas <[EMAIL PROTECTED]> wrote:
>
> I have an index w
When the indexing process still running on a index and I try to search
something on this index I retrive this error message:
java.io.FileNotFoundException:
\\tradluxstmp01\JavaIndex\tra\index_EN\_2hea.fnm (The system cannot find
the file specified)
How can I solve this.
For the index process I use IndexModifier class.
That happens when I try to search something into the index in the same
time that the index process still running.
the code for indexing:
System.setProperty("org.apache.lucene.lockDir", System
.getProperty("user.dir"));
s possible that you're
writer
hasn't written anything yet.
3> (so I can't count). Have you used Luke to open your index to see if
that
works (and the file is in the place you expect)?
FWIW
Erick
On 8/1/06, WATHELET Thomas <[EMAIL PROTECTED]> wrote:
>
> For the ind
writer had closed before you looked.
Erick
On 8/1/06, WATHELET Thomas <[EMAIL PROTECTED]> wrote:
>
> It's the same when I try to open the index with luke
>
> -Original Message-
> From: Erick Erickson [mailto:[EMAIL PROTECTED]
> Sent: 01 August 2006 15:24
> T
uns on the index and
parallally search function is also running.
thx,
supriya
WATHELET Thomas wrote:
> I'm sure that it's the good location.
> When the index process is finished then I can access the index.
> I know why but I don't know how to solve it.
> When I in
Yes
-Original Message-
From: Michael McCandless [mailto:[EMAIL PROTECTED]
Sent: 01 August 2006 17:10
To: java-user@lucene.apache.org
Subject: Re: FileNotFoundException
> I think its a directory access synchronisation problem, I have also
> posted about this before. The scenario can be
Ok if I well understood I have to put the lock file at the same place in
my indexing process and searching process.
-Original Message-
From: Michael McCandless [mailto:[EMAIL PROTECTED]
Sent: 01 August 2006 17:14
To: java-user@lucene.apache.org
Subject: Re: FileNotFoundException
> Yes
Ok thanks a lot.
-Original Message-
From: Michael McCandless [mailto:[EMAIL PROTECTED]
Sent: 01 August 2006 17:19
To: java-user@lucene.apache.org
Subject: Re: FileNotFoundException
> Ok if I well understood I have to put the lock file at the same place
in
> my indexing process and searc
Is it possible to update fields in an existing index.
If yes how to proceed.
Is it possible to change index structure in an existing index.
If yes how to proceed.
If I want to add a new field for exemple into an existing index
-Original Message-
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: 23 August 2006 11:57
To: java-user@lucene.apache.org
Subject: Re: Change index structure
On Aug 23, 2006, at 3:50 AM, WATHELET Thomas wrote:
> Is
Thanks a lot.
-Original Message-
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: 23 August 2006 14:26
To: java-user@lucene.apache.org
Subject: Re: Change index structure
On Aug 23, 2006, at 6:22 AM, WATHELET Thomas wrote:
> If I want to add a new field for exemple into an exist
Hi,
I have an index with a field 'content' (tokenized, stored, indexed)
using Lucene 1.9.1.
I tried to search this text in exact string: "european parliament
resolution on the Commission report on the regional meetings arranged by
the Commission in 1998-1999 on the common fisheries policy after 2
index and I wonder why my WildcardFilter sometimes returns the correct
search results and sometimes not. What is the difference between steping
through the term docs with termDocs.next() and using the read-method.
Can anybodey explain that?
Thanks in advance,
Thomas
dcard filters. But
filters really seem to be quite fast.
Thanks again,
Thomas
-Ursprüngliche Nachricht-
Von: Erick Erickson [mailto:[EMAIL PROTECTED]
Gesendet: Tuesday, September 19, 2006 3:59 PM
An: java-user@lucene.apache.org
Betreff: Re: Question about termDocs.read(docs, freqs)
I
Hello all,
I am VERY new to Lucene and we are trying out Lucene to see if
it will accomplish the vast majority of our search functions.
I have a question about a good way to index some of our product
description codes. We have description codes like 21-MA-GAB and other
punctuatio
on
On 6/28/05, Aigner, Thomas <[EMAIL PROTECTED]> wrote:
> Hello all,
>
> I am VERY new to Lucene and we are trying out Lucene to see if
> it will accomplish the vast majority of our search functions.
>
> I have a question about a good way to in
#x27;t matter.
Two other issues are code/data size (ICU can be big) and the
performance hit while indexing documents.
-- Ken
>Aigner, Thomas wrote:
>
>>Hello all,
>>
>> I am VERY new to Lucene and we are trying out Lucene to see if
>>it will accomplish the v
Hi all,
I am creating a punctuation filter to filter certain punctuation
out of the token stream. I am getting a "The field t.termText is not
visible" error. I'm not sure what I would need to include to make this
property visible (I am still new to Lucene and Java for that matter). I
copi
(IndexWriter.java:310)
at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:294)
The version of Lucene is 1.4.3. The Java class only has one main method
implemented. Any clue?
-Thomas
--- End Message ---
-
To
)
at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java:294)
The version of Lucene is 1.4.3. The Java class only has one main method
implemented. Any clue?
-Thomas
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For
101 - 200 of 231 matches
Mail list logo