how to control terms to be highlighted?

2005-11-30 Thread Harini Raghavan
Hi, I have a requirement to highlight search keywords in the results and display the matching fragment of the text with the results. I am using the Hits highlighting mentioned in Lucene in Action. Here is the search query(BooleanQuery) I am passing to the IndexSearcher and QueryScorer:

Why are tokens not being indexed?

2005-11-30 Thread Combs, Craig
I have a body of text which is being added to a document as unstored. All the words in the body text are coming through in the token stream for analyzing. For some reason I can search on some of tokens and others I can not. Take the following string: L'amministrazione di Uniface View consente

Re: how to control terms to be highlighted?

2005-11-30 Thread mark harwood
Is there anyway to restrict the highlighter to highlight only the values mentioned for the field 'Content'? The problem lies in the QueryTermsExtractor class which is typically used to provide the Highlighter with the list of strings to identify in the text. It currently has no filter for

Re: Why are tokens not being indexed?

2005-11-30 Thread Erik Hatcher
What Analyzer are you using? Have a look at the Analyzer demo with Lucene in Action's code, or from my java.net article so you can analyze your analyzer. Also try out Luke, it really is handy for seeing inside your index. And is your text really long ( 10,000 terms)? If so, you'll

A couple of questions regarding load balancing and failover

2005-11-30 Thread Daniel Pfeifer
Hi, I am working for a major Application Service Provider in Europe and we have now since a couple of months very successfully used Lucene 1.4. We are overall very pleased with it but as the load on the application which uses Lucene increased we were forced to invest in better hardware and also

RE: Why are tokens not being indexed?

2005-11-30 Thread Combs, Craig
I wrote my own analyzer: When I view the tokens returned from the StemFilter the words I'm searching for are returned from the Token Next() function. My terms are under 10,000. I have used luke before how can I make it use a custom analyzer? Thank you for the information. general code and

FunctionQuery

2005-11-30 Thread Yonik Seeley
I finally got around to updating FunctionQuery: http://issues.apache.org/jira/browse/LUCENE-446 Comments suggestions welcome. -Yonik Now hiring -- http://forms.cnet.com/slink?231706 - To unsubscribe, e-mail: [EMAIL PROTECTED]

Lucene in Action 2nd edition brainstorming

2005-11-30 Thread Erik Hatcher
On 29 Nov 2005, at 14:39, Malcolm wrote: Are you going to write another addition with lots of Servlet code? If that's the case put me down for an advance copy.Lucene and servlets is a direction I may be going in the future. To reiterate what Otis said, there wouldn't be much need to cover

Lucene + LSI

2005-11-30 Thread Chandana
Have any one implemented LSI in Lucene? Kindly let me know how hard/easy it is. thanks chandana

Re: Lucene + LSI

2005-11-30 Thread Lorenzo Viscanti
It depends on the kind of implementation you are thinking of. You can use Lucene to create the inputs to the LSI, and then use them in your own system. I've written that code and it works, for searches and clustering. But if you are figuring out an LSI enhanced Lucene search system (based on a

Re: Looking for Similarity Subclass Examples

2005-11-30 Thread Erik Hatcher
On 29 Nov 2005, at 19:19, Jack Xue wrote: Hello list, I am working on boosting query performance by writing my own Similarity Class (Rewrite tf(), idf(), boost(), lengthNorm()). I've gone through the defaultSimilarity class. Does anyone know other examples of Similarity Class

Re: Lucene + LSI

2005-11-30 Thread gekkokid
sorry have to ask - whats LSI - latent semantic indexing? _gk - Original Message - From: Lorenzo Viscanti [EMAIL PROTECTED] To: java-user@lucene.apache.org; [EMAIL PROTECTED] Sent: Thursday, December 01, 2005 12:02 AM Subject: Re: Lucene + LSI It depends on the kind of

Re: Lucene + LSI

2005-11-30 Thread rrshwrk
Yeah, its Latent Semantic Indexing On 12/1/05, gekkokid [EMAIL PROTECTED] wrote: sorry have to ask - whats LSI - latent semantic indexing? _gk - Original Message - From: Lorenzo Viscanti [EMAIL PROTECTED] To: java-user@lucene.apache.org; [EMAIL PROTECTED] Sent: Thursday, December

Re: how to control terms to be highlighted?

2005-11-30 Thread Harini Raghavan
Hi Mark, It would be great if you can make this change and send the QueryTermsExtractor class. I am invoking the QueryScorer(Query) contructor. Should I use QueryScorer(Query query, IndexReader reader, String fieldName) instead for this to work? Thanks, Harini mark harwood wrote: Is there

Re: Search Problem

2005-11-30 Thread wu fox
try to use mime format just as application/pdf and to see if it is feasible - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: how to control terms to be highlighted?

2005-11-30 Thread Chris Hostetter
I don't know what your application is, and I have no experience with the Highlighter code, so forgive me if this is a silly suggestion: It looks like you are building a query up programaticaly, which contains some words to search on, and some other stuff that's mainly being used to filter the