Re: Warmup queries

2009-07-03 Thread Ian Lea
I think you should warm as much as possible: queries, filters, sorts, whatever. And I think that the warmup searches should match stuff that is available in the index. You'll get warming of internal lucene structures and also OS disk IO cache. Taking the last n queries executed on the previous s

Insuffient resources during optimize merge

2009-07-03 Thread Carl Austin
Hi, I have an issue when optimizing a large index (40GB). Basically the optimize falls over with an IOException stating there are not enough system resources, within a RandomAccessFile.readBytes call followed back to the CompoundFileWriter close method. I assume this is when creating a new CFS fil

Re: Insuffient resources during optimize merge

2009-07-03 Thread Manish Joshi
I assume it must be memory related issue.Not sure just a guess. -Manish On Fri, Jul 3, 2009 at 3:42 PM, Carl Austin wrote: > Hi, > > I have an issue when optimizing a large index (40GB). Basically the > optimize falls over with an IOException stating there are not enough > system resources, with

Re: Insuffient resources during optimize merge

2009-07-03 Thread Ian Lea
Hi Google throws up some hits for this error relating to obscure problems on Windows with UNC paths and shared rather than local storage. If that's not relevant it might help to know what OS you are on, and versions of lucene and java. As a workaround you could perhaps try not using the compoun

RE: Insuffient resources during optimize merge

2009-07-03 Thread Carl Austin
Thanks Ian, This is using Windows Server 2000 or 2003, however I believe that the storage is all local. I will of course check, but am pretty sure. The workaround is also something I may try, so thank you for that. Carl -Original Message- From: Ian Lea [mailto:ian@gmail.com] Sent:

RE: Insuffient resources during optimize merge

2009-07-03 Thread Carl Austin
Sorry, I forgot to include that I am using lucene 2.4.0 and Java 1.5 Thanks Carl -Original Message- From: Ian Lea [mailto:ian@gmail.com] Sent: 03 July 2009 11:51 To: java-user@lucene.apache.org Subject: Re: Insuffient resources during optimize merge Hi Google throws up some hits

NYC Apache Lucene/Solr/Nutch/etc. Meetup

2009-07-03 Thread Grant Ingersoll
Hi All, (sorry for the cross-post) For those in NYC, there will be a Lucene ecosystem (Lucene/Solr/Mahout/ Nutch/Tika/Droids/Lucene ports) Meetup on July 22, hosted by MTV Networks and co-sponsored with Lucid Imagination. For more info and to RSVP, see http://www.meetup.com/NYC-Apache-Lucene

Re: NYC Apache Lucene/Solr/Nutch/etc. Meetup

2009-07-03 Thread Brendan Grainger
Hi Grant, It's saying this meeting is closed to new attendees when I try to rsvp to the meeting. Anyway around that? Thanks On Jul 3, 2009, at 8:11 AM, Grant Ingersoll wrote: Hi All, (sorry for the cross-post) For those in NYC, there will be a Lucene ecosystem (Lucene/Solr/ Mahout/Nutch

Punctuation in Whitespace Analyzer

2009-07-03 Thread Max Lynch
Hello, I am having an issue with analyzers. Right now, when I do a search, I am searching for a whole name. For example, if I have a document like this: "This is the document text. John Smith is mentioned right here, he is in the john. Smith is his last name. His full name is John Smith." If

How to use RegexTermEnum

2009-07-03 Thread Raf
Hi, I am trying to solve the following problem: In my index I have a "url" field added as Field.Store.YES, Field.Index.NOT_ANALYZED and I must use this field as a "key" to identify a document. The problem is that sometimes two urls can differ only because they contain a different session id: i.e.

Re: Insuffient resources during optimize merge

2009-07-03 Thread Simon Willnauer
Hey Carl, do you have any anti-virus running on that machine by any chance? If so could you try to turn it off and try again. simon On Fri, Jul 3, 2009 at 1:53 PM, Carl Austin wrote: > Sorry, I forgot to include that I am using lucene 2.4.0 and Java 1.5 > > Thanks > > Carl > > -Original Messa

Re: How to use RegexTermEnum

2009-07-03 Thread Erick Erickson
WARNING: I haven't actually tried using RegexTermEnum in a long time, but... I *think* that the constructor positions you at the first term that matches, without calling next(). At least there's nothing I saw in the documentation that indicates you need to call next() before calling term(). Assum

Storing a serialized object ?

2009-07-03 Thread MilleBii
I want to store in the index a data structure and load it back at search time. Is it safe to serialize the java object store it and load it back later ? Presumably I need to store it binary, right ? Otherwise I need to create my own store & load methods, waste of time. -- -MilleBii-

search for percent char with lucene

2009-07-03 Thread shbn
Hi, I am trying to find results with percent chars without any success. for exemple the word " bla bla 1%" when i type 'bla bla' i got results, but if i try to type '%' i got 0 results. does any one have any idea.. thanks -- View this message in context: http://www.nabble.com/search-for-perce

addIndexesNoOptimize

2009-07-03 Thread John Wang
Hi guys: Running into a question with IndexWriter.addIndexesNoOptimize: I am trying to expand a smaller index by replicating it into a larger index. So I am adding the same directory N times. I get an exception because noDupDirs(dirs) fails. For this call, is this check neccessary?

Re: Storing a serialized object ?

2009-07-03 Thread Erick Erickson
H. I'm having trouble understanding what you want to accomplish and why you think storing a java object is appropriate to do in a Lucene index. Perhaps you could expand on your use case here. Best Erick On Fri, Jul 3, 2009 at 3:32 PM, MilleBii wrote: > I want to store in the index a data s

Re: search for percent char with lucene

2009-07-03 Thread Erick Erickson
You have to tell us what analyzers you are using. Many analyzers will throw out non alpha-num characters. Even better, a small, self-contained test case illustrating your problem would help us help you. Best Erick On Fri, Jul 3, 2009 at 5:11 PM, shbn wrote: > > Hi, > I am trying to find result