Re: SpanFirstQuery is not taking wildcard characters (like *) as a logical operator for the preffix

2008-11-28 Thread naveen.a
Hi, Thanks for your reply, The actual problem is when the user is entering data in a textfield, the suggession list should come below the textfield, so when the character 'l' is entered, the suggessions should come as starts with 'l' and if he entered the full word say 'library ' and then he giv

Re: how to search for starts with multiple words in lucene

2008-11-28 Thread naveen.a
Hi, Thanks for your replies, please go to this link for the actual problem http://www.nabble.com/SpanFirstQuery-is-not-taking-wildcard-characters-(like-*)-as-a-logical-operator-for-the-preffix-td20719556.html#a20719556 http://www.nabble.com/SpanFirstQuery-is-not-taking-wildcard-characters-(like

Latest stable release?

2008-11-28 Thread Chris Bamford
Hi Can anyone tell me what the latest stable release is? http://lucene.apache.org/java/docs/index.html doesn't say. Thanks, - Chris - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECT

Re: Latest stable release?

2008-11-28 Thread Ian Lea
Ummm ... I know it because there was an email sent to this list on 11-Oct saying "Release 2.4.0 of Lucene is now available!". Doesn't explicily say stable, but that is the implication. I'm not sure about any convention but it seems a fair bet. -- Ian. On Fri, Nov 28, 2008 at 11:32 AM, Chris

Maintain last indexed information in a file or DB

2008-11-28 Thread Ganesh
I am using Lucene v2.4. I am indexing files from various folder and i have to maintain a bookmark of what i have last indexed in each folder. Initially i thought to save the state in each respective folder. Index Wrtier always has documents in memory and it commits in a intervals. In an unexp

Re: Maintain last indexed information in a file or DB

2008-11-28 Thread Ian Lea
I'm a bit confused about what exactly is stored in folder and index and database, but how about you store the bookmark information in the same lucene index that you are using for the file data. One lucene document per folder, with fields something like folder: /some/dir/somewhere bookmark: some_b

Re: calculating score - implementing your own 'Scorer' - how to?..

2008-11-28 Thread Grant Ingersoll
Hi Vlad, I believe you can achieve this using the function package (i.e. Function Queries, org.apache.lucene.search.function).The other thing that comes to mind is, if you don't truly need the bit mask for other things, is to encode it into the Field as a boost value. This may be com

Re: Maintain last indexed information in a file or DB

2008-11-28 Thread Ganesh
My application is similar to google or msn desktop but the data would be voluminous. Some set of files are there in each folder and new files could be added to this folder. I have to pick the new one and index it. I could very well add some fields like folder name, filename, modified datetime et

Re: Transforming german umlaute like ö,ä,ü,ß into oe, ae, ue, ss

2008-11-28 Thread csantos
Hi, I'm a newbie with Lucene and I started some testing with Lucene 2.2.0. I developed my own Analyser and my own Filter based on examples found here, that is: public class DiacriticAnalyser extends GermanAnalyzer { @Override public TokenStream tokenStream(String fieldName, Reader reade

Re: Latest stable release?

2008-11-28 Thread Ian Lea
2.4.0. -- Ian. On Fri, Nov 28, 2008 at 11:16 AM, Chris Bamford <[EMAIL PROTECTED]> wrote: > Hi > > Can anyone tell me what the latest stable release is? > http://lucene.apache.org/java/docs/index.html doesn't say. > > Thanks, > > - Chris ---

Re: Maintain last indexed information in a file or DB

2008-11-28 Thread Michael McCandless
Not sure if it's a fit here, but the 2.9 (not yet released) release of Lucene allows you to specify metadata when you call commit, ie commit(String userData). This way each commit point can record "something" application specific to describe it. Mike Ganesh wrote: My application is si

Re: Latest stable release?

2008-11-28 Thread Chris Bamford
Thanks Ian. Is that the convention - the top of the list on http://lucene.apache.org/java/docs/index.html is always the latest stable release - or do you know that by some other means? Cheers, - Chris Ian Lea wrote: 2.4.0. -- Ian. On Fri, Nov 28, 2008 at 11:16 AM, Chris Bamford <[EMAI

controlled indexing with Lucene

2008-11-28 Thread Amir Hossein Jadidinejad
Hi, I'm going to index some documents only with known phrases. Let me describe: Suppose that I have a controlled vocabulary of phrases (A list of some candidate phrases). I intend to index ONLY these phrases within my documents and have a retrieval model (for example simple VS-TF.IDF) that each

PhraseQuery and non-letter characters

2008-11-28 Thread Ng Vinny
Hi, I'm having an issue with PhraseQuery in which a query for the phrase "information technology" has among of its matches the strings "information, technology" and "information. Technology", which should not be considered as matches. Both StopAnalyzer StandardAnalyzer removes non-letter charact

Re: PhraseQuery and non-letter characters

2008-11-28 Thread Ian Lea
I suggest you write your own analyzer that doesn't remove non-letter characters at index time. There might be one out there already, but not that I can think of off hand. Instead of leaving the non-letters in place you might consider doing something with position increments. I think that would pr

Controlled Indexing -New Feature

2008-11-28 Thread Amir Hossein Jadidinejad
Hi, I'm going to index some documents only with known phrases. Let me describe: Suppose that I have a controlled vocabulary(a list of some known phrases/words). I intend to index ONLY these phrases within my documents and have a retrieval model (for example simple VS-TF.IDF). In way, each index

Deleting from Index by URL field: is it safe?

2008-11-28 Thread Niels Ott
Hi all, I want to safely delete documents from my index. There is an URL field that specifies where the document came from. I'm using something like this: indexwriter.deleteDocuments(new Term("URL", myURL)); (inspired by the Lucene in Action Book, page 35.) I'm uncertain whether this is

Re: SpanFirstQuery is not taking wildcard characters (like *) as a logical operator for the preffix

2008-11-28 Thread Karl Wettin
There has been several discussions on this list on how to create an "auto complete" feature. What solution fits best for you depends on your needs and the behavior of your users, i.e. what sort of documents it is your index contains and what part of the documents your users are searching fo

Re: Deleting from Index by URL field: is it safe?

2008-11-28 Thread German Kondolf
It works exactly as it does when you search of that term. Review in your index creation, if you store it without analyzing it (Index.UN_TOKENIZED), it will only match that document when you have an exact URL. It's possible that the URL is not unique enought in your domain, there is no other uniqu

Re: Controlled Indexing -New Feature

2008-11-28 Thread German Kondolf
You could use a "reverse" stop-word filter. The straight "StopFilter" actually removes the keywords that match with a given Set of words, you could do the reverse logic of that an remove ALL keywords that doesn't match that Set. Take a look at StopFilter and StandardAnalyzer ;) On Fri, Nov 28, 2

Re: Controlled Indexing -New Feature

2008-11-28 Thread Amir Hossein Jadidinejad
Yes, It's good to filter words but suppose that my controlled vocabulary contains synonyms and etc.Exactly, I want to supervise on indexing process. Thank you very much. From: German Kondolf <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Friday, Novem