Re: Query without Analyzer

2007-09-11 Thread Erick Erickson
Sure, but it's probably not what you want to do. You can use a PerFieldAnalyzerWrapper to analyze different fields with different analyzers. So your category field could be both indexed and analyzed with, say, WhitespaceAnalyzer or some such. Or, as testn suggests, store it UN_TOKENIZED. Each

Re: Storing Host and IP Information in Lucene

2007-09-11 Thread Erick Erickson
What kind of analyzers are you using for this? Might it work for you to put it in a special field and analyze and search with, say, WhitespaceAnalyzer (or an analyzer that lowercases and breaks on whitespace)? PerFIeldAnalyzerWrapper is your friend. Best Erick On 9/10/07, AnkitSinghal [EMAIL

Re: Searching Exact Word from Index

2007-09-11 Thread Erick Erickson
No, but you can create your own custom analyzer that does whatever you want it to. Best Erick On 9/11/07, Jawahar Lal [EMAIL PROTECTED] wrote: Can we made StandardAnalyzer not to cut the word on '-' by setting some property or something like that ? On 11/09/2007, Laxmilal Menaria [EMAIL

SV: SV: OutOfMemoryError tokenizing a boring text file

2007-09-11 Thread Per Lindberg
Från: Chris Hostetter [mailto:[EMAIL PROTECTED] : Setting writer.setMaxFieldLength(5000) (default is 1) : seems to eliminate the risk for an OutOfMemoryError, that's because it now gives up after parsing 5000 tokens. : To me, it appears that simply calling :new Field(content,

Re: Searching Exact Word from Index

2007-09-11 Thread Jae Joo
Use the string type for the field. !-- The StrField type is not analyzed, but indexed/stored verbatim. - StrField and TextField support an optional compressThreshold which limits compression (if enabled in the derived fields) to values which exceed a certain size (in

Span queries and complex scoring

2007-09-11 Thread melix
Hi, I'm working on an application which requires a complex scoring (based on semantics analysis). The scoring must be highly configurable, and I've found ways to do that, but I'm facing a discrete but annoying problem. All my queries are, basically, complex span queries. I mean for example a

Re: Questions Lucene

2007-09-11 Thread Steven Rowe
Hi Durga, I have moved this discussion to the java-user list, since the java-dev list is devoted to development of the Java Lucene library, and not to questions about its capabilities. My answers are inline below. [EMAIL PROTECTED] wrote: 1) What are the various languages supported by

How to get documents similar to other document ?

2007-09-11 Thread Ariel
Hi every body: My question is if there is an api function of lucene to obtain similar documents to other document comparing the term frequence vector of a field ??? I supposed a lot of people have asked this before but I haven't found the answer neither with google nor api lucene. This could be a

Re: Implement a filter to the search results

2007-09-11 Thread Erick Erickson
What version of Lucene are you using? I don't think the form b.add(query, false, false) is current. Be that as it may, I'm not quite sure what you mean by filter. If I'm reading your intent correctly, wouldn't requiring each non-empty field accomplish what you want? Which would be something like

Re: Implement a filter to the search results

2007-09-11 Thread M . K
Lucene 1.4.3 ignor the filter words. My question was: I have a search form which has an input area for key search and also three optional select boxs *Catagory, Department and Year. * How can I implement that? Is it correct the way I wrote in my last email. Thanks, MK On 9/11/07, Erick

Re: Implement a filter to the search results

2007-09-11 Thread Erick Erickson
Well, it depends upon how you indexed things G Your previous post was along the right lines as far as I can see though... Erick Assuming that you have On 9/11/07, M. K [EMAIL PROTECTED] wrote: Lucene 1.4.3 ignor the filter words. My question was: I have a search form which has an

Re: Implement a filter to the search results

2007-09-11 Thread M . K
Thanks Erick. MK On 9/11/07, Erick Erickson [EMAIL PROTECTED] wrote: Well, it depends upon how you indexed things G Your previous post was along the right lines as far as I can see though... Erick Assuming that you have On 9/11/07, M. K [EMAIL PROTECTED] wrote: Lucene 1.4.3

Re: How to get documents similar to other document ?

2007-09-11 Thread Grant Ingersoll
See the MoreLikeThis functionality in the contrib package, also search this archive for MoreLikeThis. On Sep 11, 2007, at 11:50 AM, Ariel wrote: Hi every body: My question is if there is an api function of lucene to obtain similar documents to other document comparing the term frequence

Re: Searching Exact Word from Index

2007-09-11 Thread Grant Ingersoll
Note, this is a Solr construct and does not apply to a custom Lucene application, even if the intent is correct. On Sep 11, 2007, at 9:34 AM, Jae Joo wrote: Use the string type for the field. !-- The StrField type is not analyzed, but indexed/stored verbatim. - StrField and

Re: Span queries and complex scoring

2007-09-11 Thread Paul Elschot
Cedric, In case your requirements allow this, try and use subclass of Spans that has a score() method that returns a value that is used together with the other span info to provide a score value to your own SpanScorer at the top level. This score value can summarize the influence of the

Re: How to get documents similar to other document ?

2007-09-11 Thread Ariel
Excuse me, Could you give more details ? Are you telling me that functionality exists ? Which class should I use for this ? I hope not being bothering you. Greetings On 9/11/07, Grant Ingersoll [EMAIL PROTECTED] wrote: See the MoreLikeThis functionality in the contrib package, also search this

Re: Searching Exact Word from Index

2007-09-11 Thread Jae Joo
Sorry. I was confused Solr and Lucene mailing list. Jae On 9/11/07, Grant Ingersoll [EMAIL PROTECTED] wrote: Note, this is a Solr construct and does not apply to a custom Lucene application, even if the intent is correct. On Sep 11, 2007, at 9:34 AM, Jae Joo wrote: Use the string type