Re: Boost/Scoring question

2007-02-02 Thread Antony Bowesman
Thanks a lot for your answers Hoss. This list is really well supported! Antony Chris Hostetter wrote: : It's the index time boost, rather than query time boost. This short example : shows the behaviour of searches for A... index boosts! ... totally didn't occur to me that was what you

Re: Lucene Javadoc Exception - cause?

2007-02-02 Thread Michael McCandless
Josh Joy wrote: I was implementing some calls to Lucene, though was curious if there was some documentation I was missing that indicated why a method throws an exception. Example, IndexReader - deleteDocuments() - what is the root cause as to why it throws IOException? I'm trying to

Re: bad queryparser bug

2007-02-02 Thread Peter Keegan
(If i could go back in time and stop the AND/OR/NOT//|| aliases from being added to the QueryParser -- i would) Yes, this is the cause of the confusion. Our users are accustomed to the boolean logic syntax from a legacy search engine (also common to many other engines). We'll have to convert

Adding headlines, path

2007-02-02 Thread DECAFFMEYER MATHIEU
Hi all, I have simple questions for which I can't find an answer by googling : 1) I want to add headlines for a document : Field headlinesField = new Field(headlines, headlines, Field.Store.YES, Field.Index.TOKENIZED); But how do I separate the headlines between them ? Let's say I want to add

Re: bad queryparser bug

2007-02-02 Thread Erik Hatcher
On Feb 1, 2007, at 5:03 PM, Peter Keegan wrote: OK, I see that I'm not the first to discover this behavior of QueryParser. Can anyone vouch for the integrity of the PrecedenceQueryParser here: http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/

Prolems by lucene-2.0 integration into an existing java application

2007-02-02 Thread Mirko Kämpf
Hello, I am asking for help in the following problem: With lucene-2.0 I developed a module to do a textsearch in a folder on the harddisk. Everything works fine. The module get the searchstring and returns a Vector with the filnemames for further proceedings. The tester is in the same project

indexWriter

2007-02-02 Thread e.j.w.vanbloem
Hello, Does anyone know why this does not work? The fileDir is of the class File and points to an empty temp directory. I tried switching true and false import org.apache.lucene.index.IndexWriter; File indexDir = new File(c:\temp); IndexWriter writer = new IndexWriter(indexDir, new

RE : indexWriter

2007-02-02 Thread Virlouvet Olivier
Hi Erik The \ must be escaped in the path, i.e : ...new File(c:\\temp); Regards, Olivier [EMAIL PROTECTED] a écrit : Hello, Does anyone know why this does not work? The fileDir is of the class File and points to an empty temp directory. I tried switching true and false import

Re: Multiword Highlighting

2007-02-02 Thread Mark Miller
I have been away from this for a week, but my interest has started building again. The whole spans implementation seems to work great for finding the actual hits but there is a somewhat annoying limitation: because I am using Spans it seems I can only either highlight the entire found span or

RE: RE : indexWriter

2007-02-02 Thread e.j.w.vanbloem
I already tried that but does not help do I also have to create a file in that dir? Or is it some other problem import org.apache.lucene.index.IndexWriter; File indexDir = new File(c:\\temp); IndexWriter writer = new IndexWriter(indexDir, new StandardAnalyzer(), false);

RE: RE : indexWriter

2007-02-02 Thread e.j.w.vanbloem
got it, thanx Van: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Verzonden: vr 2-2-2007 16:59 Aan: java-user@lucene.apache.org Onderwerp: RE: RE : indexWriter I already tried that but does not help do I also have to create a file in that dir? Or is it some

Re: Multiword Highlighting

2007-02-02 Thread mark harwood
Hi Mark, Have you looked at the returned spans from any other potential problem scenarios (other than the 3 word one you suggest) e.g. complex nested SpanOr or SpanNot logic? Or there just may not be much interest There's certainly interest on my part on seeing this merged with the existing

Re: Adding headlines, path

2007-02-02 Thread Otis Gospodnetic
1) Field headlinesField = new Field(headlines, headline1, Field.Store.YES, Field.Index.TOKENIZED); Field headlinesField = new Field(headlines, headline2, Field.Store.YES, Field.Index.TOKENIZED); ... 2) path + url - you are probably looking at the demo. That is *just* a demo and those

RE: Adding headlines, path

2007-02-02 Thread DECAFFMEYER MATHIEU
Thank u Otis ! Last question about this : In what way will this differ ? A) Field headlinesField = new Field(headlines, hello, Field.Store.YES, Field.Index.TOKENIZED); Field headlinesField = new Field(headlines, world, Field.Store.YES, Field.Index.TOKENIZED); B) Field headlinesField = new

Re: Adding headlines, path

2007-02-02 Thread Otis Gospodnetic
It won't differ, I believe. Should be easy to test. Otis - Original Message From: DECAFFMEYER MATHIEU [EMAIL PROTECTED] To: java-user@lucene.apache.org Sent: Friday, February 2, 2007 12:07:37 PM Subject: RE: Adding headlines, path Thank u Otis ! Last question about this : In what way

how to store and reuse query

2007-02-02 Thread Ng Vinny
Dear List, After processing some text, I have constructed a Boolean query , e.g. title:Lucene author:Otis, and I would like to store the query to reuse when the same piece of text is issued (to avoid re-processing of the same text). I thought of storing the string resulted from Query.toString()

IDFrequency

2007-02-02 Thread DECAFFMEYER MATHIEU
Hi, The score depends of 1. the query 2. the matched document 3. the index. I don't really understand why the index must influence the score (why it ahs been implemented that way). Let's say I have this page Logistics.htm I have just one time the word experience in it. It will get a high

Re: how to store and reuse query

2007-02-02 Thread Erick Erickson
I doubt that this is a worthwhile place to spend your time. You're introducing possibilities for error and saving very, very, very little time I suspect. I'd only try this if you have *proof* that constructing the query takes enough time to matter. Have you timed query construction to see if it's

Re: IDFrequency

2007-02-02 Thread Soeren Pekrul
DECAFFMEYER MATHIEU wrote: The score depends of 1. the query 2. the matched document 3. the index. I don't really understand why the index must influence the score (why it ahs been implemented that way). The score should be the similarity (inverse distance) between the query and the matched

Re: Multiword Highlighting

2007-02-02 Thread Mark Miller
mark harwood wrote: Hi Mark, Have you looked at the returned spans from any other potential problem scenarios (other than the 3 word one you suggest) e.g. complex nested SpanOr or SpanNot logic? Nothing super intense, but I haved look at some semi complex nesting and it all looks great if

Re: how to store and reuse query

2007-02-02 Thread Erik Hatcher
On Feb 2, 2007, at 1:02 PM, Ng Vinny wrote: After processing some text, I have constructed a Boolean query , e.g. title:Lucene author:Otis, and I would like to store the query to reuse when the same piece of text is issued (to avoid re-processing of the same text). Solr (can't help but

adding keywords to a field

2007-02-02 Thread S Edirisinghe
Hi, Is is possible to add keywords to a field? for example if I add a field to a document and then I save it to the index, after that I find it again for edit, and want to add a nother keyword for the field. What is the new api call to add to a stored keyword field? thanks

Re: adding keywords to a field

2007-02-02 Thread Erick Erickson
No. Update-in-place has been on the wish list for some time. To modify a document, you must delete and re-add it. Which can be a problem if the document has fields that are not stored Best Erick On 2/2/07, S Edirisinghe [EMAIL PROTECTED] wrote: Hi, Is is possible to add keywords to a

Stefan Raspl/Germany/IBM is out of the office.

2007-02-02 Thread Stefan Raspl
I will be out of the office starting 02/03/2007 and will not return until 02/12/2007. I will respond to your message when I return. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL