Re: ClassCastException when writing to index writer

2008-10-06 Thread Paul Chan
Hi Edwin, Would love to pose the code, but I don't have the Terracotta source code at the moment as I only downloaded the binaries. But Terracotta is an open source project so you can checkout their code base ( http://www.terracotta.org/web/display/orgsite/DownloadCatalog). I have no idea how th

Re: ClassCastException when writing to index writer

2008-10-06 Thread Edwin Lee
Hi Paul, now i'm curious to know how the overriden java.util.Vector implements the clone method. Any chance of posting a snippet? ;) Cheers, Edwin --- Paul Chan <[EMAIL PROTECTED]> wrote: > Seems like I found the culprit to the problem. Because I am using > Terracotta to doing JVM clusteri

Re: ClassCastException when writing to index writer

2008-10-06 Thread Paul Chan
Seems like I found the culprit to the problem. Because I am using Terracotta to doing JVM clustering, it has overriden many classes of java.util.*. If I take out Terracotta from my configuration, then I don't get the ClassCastException problem any longer. I will talk to the Terracotta folks to s

Re-tokenized fields disappear

2008-10-06 Thread John Griffin
My previous question may be moot but as is it is still a problem. Here's a little more info on my problem. The same named fields contain two pieces of information, a code "B05" and a value "1" as follows. The value can be a range such as 1 to 5 or 1 to 100. "codesearch", "B05 1" This field

advice on using Lucene for sorting based on payloads

2008-10-06 Thread Alexander Devine
Hi Luceners, I have a particular sorting problem and I wanted some advice on what the best implementation approach would be. We currently use Lucene as the searching engine for our vacation rental website. Each vacation rental property is represented by a single document in Lucene. We want to add

Re-tokenized fields disappear

2008-10-06 Thread John Griffin
Guys, I have documents with multiple stored, tokenized fields of the same name but different values in them such as: "codesearch", "B01" "codesearch", "B0105" "codesearch", "Q01" Etc; I receive a new code to add to the document so I create a copy of the document, call deleteFields

Re: ArrayIndexOutOfBoundsException in FastCharStream.readChar

2008-10-06 Thread Edwin Smith
I upgraded to the latest, 3.3.2 and had the same problem, even though it was clearly a different lexer reading the text.   I did find some problems with the reader I was using, and it now reads some files that it didn't before, so it may still be some reader problem I haven't identified, but th

Re: bunch of newbie queries, PS

2008-10-06 Thread rolarenfan
Hoss -- Thanks for reply -- >Paul: your subject linke seems to indicate that your email is a followup >to previous questions, however i don't seem to have ever seen those >questions posted on the list, and i can't seem to find any previous emails >from you in any of hte online archives -- so

Re: Field names with : in it.

2008-10-06 Thread samd
Hello and thanks for the response. Yes the title should have been values. I'll take at look at the link you provided. Thanks again! Erick Erickson wrote: > > You have to escape it in a query. And did you mean to title your > post as field *names*? because I can't imagine how that would > work.

Re: Field names with : in it.

2008-10-06 Thread Erick Erickson
You have to escape it in a query. And did you mean to title your post as field *names*? because I can't imagine how that would work. Field *values*, sure See http://lucene.apache.org/java/docs/queryparsersyntax.html, search for "escape" On Mon, Oct 6, 2008 at 3:54 PM, samd <[EMAIL PROTECTED]>

Field names with : in it.

2008-10-06 Thread samd
It appears if a search field value entered contains a ':' causes some issues since if I have a field named a.b.c and the user enters a value of foo the constructed query through the api generates something like +a.b.c:foo If they enter in a value of foo:bar I get +foo:bar where foo is substitut

Re: ArrayIndexOutOfBoundsException in FastCharStream.readChar

2008-10-06 Thread Edwin Smith
No particular reason. It is just what I had loaded last and hadn't upgraded. It sounds like there might be good reason to do that now.   Thanks for the tip.   Ed - Original Message From: Steven A Rowe <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Monday, October 6, 2008 3:1

RE: ArrayIndexOutOfBoundsException in FastCharStream.readChar

2008-10-06 Thread Steven A Rowe
Hi Edwin, I don't know specifically what's causing the exception you're seeing, but note that in Lucene 2.3.0+, the JavaCC-generated version of StandardTokenizer (where your exception originates) has been replaced with a JFlex-generated version - see

Re: Single searcher vs Multi Searcher

2008-10-06 Thread Anshum
Hi Ganesh, About the memory consumption while sorting, it would end up using similar amounts, perhaps even more.. like in the case of regular parallel programming algorithms (hoping that you intend to search using a parallel multi searcher). Would you have to query particular indexes only for a par

ArrayIndexOutOfBoundsException in FastCharStream.readChar

2008-10-06 Thread Edwin Smith
I am stumped and have not seen any other reference to this problem. I am getting the following exception on everything I try to index. Does anyone know what my problem might be? Thanks, Ed java.lang.ArrayIndexOutOfBoundsException at org.apache.lucene.analysis.standard.FastCharStream.readChar(

Re: ArrayIndexOutOfBoundsException in FastCharStream.readChar

2008-10-06 Thread Edwin Smith
Oh, and in case it matters, I'm using Lucene 2.2.0. Ed - Original Message I am stumped and have not seen any other reference to this problem. I am getting the following exception on everything I try to index. Does anyone know what my problem might be? Thanks, Ed java.lang.ArrayI

Re: bunch of newbie queries, PS

2008-10-06 Thread Chris Hostetter
: Subject: bunch of newbie queries, PS Paul: your subject linke seems to indicate that your email is a followup to previous questions, however i don't seem to have ever seen those questions posted on the list, and i can't seem to find any previous emails from you in any of hte online archives

Re: ClassCastException when writing to index writer

2008-10-06 Thread Paul Chan
Hi Edwin, Yes, I am running the TestVector against the same class path as my actual application. My class path contains the Lucene JAR files. This is very bizzard because my TestVector class extends Vector and it works, but for some reason SegmentInfos also extends Vector but it doesnt work. I d

Re: ClassCastException when writing to index writer

2008-10-06 Thread Edwin Lee
Hi Paul, When you say that your dummy TestVector class works, are you running it using the same classpaths that you used to run your actual application? Does it contain the Lucene JAR files? Regards, Edwin --- Paul Chan <[EMAIL PROTECTED]> wrote: > I have tried to recompile Lucene 2.3.2 und

RE: Memory eaten up by String, Term and TermInfo?

2008-10-06 Thread Edwin Lee
Hi, Probably off-topic, but just like to plug a bit on my blog post here: http://tinyurl.com/4vytcc :p (incidentally, Java GC is one of my favourite topics) It's not very detailed, but i would like to think it's a good place to start reading... Just like to point out a couple of things: 1. If yo

RE: Memory eaten up by String, Term and TermInfo?

2008-10-06 Thread Peter Cheng
Oh yes, I forget to mention that MaxPermSize is very useful, and may be another key that solves my problem. I haven't tried UseConcMarkSweepGC and the other two parameters, and I will try them instead of my own GC thread to see whether the problem can also be solved. Thanks Brian! Regards, Gong

Re: Document larger than setRAMBufferSizeMB()

2008-10-06 Thread Michael McCandless
OK it sounds like you need to increase the RAM your JVM is allowed to use, or, make your documents smaller. Mike Aditi Goyal wrote: Thanks for showing interest Mike. The OOME comes in the middle of setting a value of one of the field in the doc. That field has a fairly large value. May b

RE: Memory eaten up by String, Term and TermInfo?

2008-10-06 Thread Beard, Brian
I played around with GC quite a bit in our app and found the following java settings to help a lot (Used with jboss, but should be good for any jvm). set JAVA_OPTS=%JAVA_OPTS% -XX:MaxPermSize=512M -XX:+UseConcMarkSweepGC -XX:+CMSPermGenSweepingEnabled -XX:+CMSClassUnloadingEnabled While these set

Re: Document larger than setRAMBufferSizeMB()

2008-10-06 Thread Aditi Goyal
Thanks for showing interest Mike. The OOME comes in the middle of setting a value of one of the field in the doc. That field has a fairly large value. May be that could have been the reason.? On Fri, Oct 3, 2008 at 4:57 PM, Michael McCandless < [EMAIL PROTECTED]> wrote: > > Note that large stor

RE: Advise for Mediabase with Lucene

2008-10-06 Thread Ard Schrijvers
Hello Mathias, IMHO sounds like you are planning to re-invent the wheel while all things you want (AFAICS) are already largely available as open source projects, and perhaps more important, open standards. Your hierarchical data storage sounds like jsr-170 and jsr-283 are the open standard solu