Re: Atomicity in Lucene operations

2004-10-19 Thread Nader Henein
As soon as I've cleaned up the code, I'll publish it, it needs a little more documentation as well. Nader Roy Shan wrote: Maybe you can contribute it to sandbox? On Mon, 18 Oct 2004 08:31:30 -0700 (PDT), Yonik Seeley [EMAIL PROTECTED] wrote: Hi Nader, I would greatly appreciate it if you

Range Query

2004-10-19 Thread Karthik N S
Hi Guys Apologies. I have a Field Type Text 'ItemPrice' , Using it to Store Price Factor in numeric such as 10, 25.25 , 50.00 If I am suppose to Find the Range factor between 2 prices ex - Contents:shoes +ItemPrice:[10.00 TO 50.60] I get results

RE: Range Query

2004-10-19 Thread Chuck Williams
Range queries use a lexicographic (dictionary) order. So, assuming all your values are positive, you need to ensure that the integer part of each number has a fixed number of digits (pad with leading 0's). The fractional part should be fine, although 1.0 will follow 1. If you have negative

Search Help in word doc

2004-10-19 Thread Natarajan.T
Hi FFI, I am indexing multiple documents like (word,excel,html,ppt,pdf) at the time of indexing there is no problem. My search results contents(description) comes with small Boxes(this is happening only word documents) I think this is happening because of some special characters

RE: QueryParsing

2004-10-19 Thread Rupinder Singh Mazara
hi erik and everyone else ok i will buy the book ;) but this still does not solve the problem of why String x = \jakarta apache\~100; is being transalted as a PhraseQuery FULL_TEXT:jakarta apache~100 is the correct query beining formed ? or is there something wrong with the Proximity

RE: Search Help in word doc

2004-10-19 Thread Cocula Remi
Seen that. I use the Character.isISOControl() function to identify and remove these characters. -Message d'origine- De : Natarajan.T [mailto:[EMAIL PROTECTED] Envoyé : mardi 19 octobre 2004 10:37 À : [EMAIL PROTECTED] Objet : Search Help in word doc Hi FFI, I am indexing multiple

RE: Search Help in word doc

2004-10-19 Thread Natarajan.T
Hi Remi, Thanks for your response... Pls send me the jar name with sample code. Thanks, Natarajan. -Original Message- From: Cocula Remi [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 19, 2004 2:26 PM To: Lucene Users List Subject: RE: Search Help in word doc

RE: QueryParsing

2004-10-19 Thread Morus Walter
Rupinder Singh Mazara writes: hi erik and everyone else ok i will buy the book ;) but this still does not solve the problem of why String x = \jakarta apache\~100; is being transalted as a PhraseQuery FULL_TEXT:jakarta apache~100 is the correct query beining formed ? or is

RE: Search Help in word doc

2004-10-19 Thread Cocula Remi
This sample code changes undesired characters into underscores. Document doc = char[] cs = doc.get(content).toCharArray(); StringBuffer sb = new StringBuffer(); for (int j=0;j Array.getLength(cs);j++) { if (!Character.isISOControl(cs[j])) { sb.append(cs[j]);

RE: QueryParsing

2004-10-19 Thread Rupinder Singh Mazara
thank you Morus this makes things very clear to me Regards Rupinder -Original Message- From: Morus Walter [mailto:[EMAIL PROTECTED] Sent: 19 October 2004 10:05 To: Lucene Users List Subject: RE: QueryParsing Rupinder Singh Mazara writes: hi erik and everyone else ok i will buy

RE: Search Help in word doc

2004-10-19 Thread Natarajan.T
Ok Thanks Remi -Original Message- From: Cocula Remi [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 19, 2004 2:37 PM To: Lucene Users List Subject: RE: Search Help in word doc This sample code changes undesired characters into underscores. Document doc = char[] cs =

RE: Search Help in word doc

2004-10-19 Thread Natarajan.T
Are you doing this functionality under indexing part or search part -Original Message- From: Cocula Remi [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 19, 2004 2:37 PM To: Lucene Users List Subject: RE: Search Help in word doc This sample code changes undesired characters into

RE: Search Help in word doc

2004-10-19 Thread Natarajan.T
Ok, Thanks a lot... -Original Message- From: Cocula Remi [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 19, 2004 3:14 PM To: Lucene Users List Subject: RE: Search Help in word doc In my case, search. But probably that the best is to do it at indexing time. -Message

Re: Arabic analyzer

2004-10-19 Thread Pierrick Brihaye
Hi, Scott Smith a écrit : Is anyone aware of an open source (non-GPL; i.e.., free for commercial use) Arabic analyzer for Lucene? Unfortunately (for you), my Arabic Analyzer for Java (http://savannah.nongnu.org/projects/aramorph) is GPL-ed. Does Arabic really require a stemmer as well (some of

using optimize and addDocument concurrently.

2004-10-19 Thread Stephen Halsey
Hi, My basic question is whether it is possible to continue to add documents to an index in one Thread while running a long running optimization of the index (approx 30 mins) in another thread. I'm using Lucene version 1.4.2. The concurrency matrix at

Null or no analyzer

2004-10-19 Thread Rupinder Singh Mazara
Hi All I have a question regarding selection of Analyzer's during query parsing i have three field in my index db_id, full_text, subject all three are indexed, however while indexing I specified to lucene to index db_id and subject but not tokenize them I want to give a single search

MultiSearcher object question

2004-10-19 Thread Jeff Munson
I've just indexed over 600,000 documents (index size = 12GB) and have a simple servlet to search the index. I am using the MultiSearcher object (I will add more indexes in the future) in a servlet to test searching. I have noticed that the instantiation of my MulitSearcher object is taking about

RE: using optimize and addDocument concurrently.

2004-10-19 Thread Aad Nales
Steve, The behavior that you descibe is as expected. I have tackled a similar problem to yours by creating a proxy object that acts as a gatekeeper to all IndexReader, IndexSearcher and IndexWriter operations. With fully synchronized access to all methods of the proxy you will not run into any

Thesaurus ...

2004-10-19 Thread Patricio Galeas
Hello, I'm a new user of Lucene, and a would like to use it to create a Thesaurus. Do you have any idea to do this? Thanks! kind regards P.Galeas - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands,

RE: Null or no analyzer

2004-10-19 Thread Aviran
You can use WhiteSpaceAnalyzer Aviran http://aviran.mordos.com -Original Message- From: Rupinder Singh Mazara [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 19, 2004 11:23 AM To: Lucene Users List Subject: Null or no analyzer Hi All I have a question regarding selection of

Downloading Full Copies of Web Pages

2004-10-19 Thread Luciano Barbosa
Hi folks, I want to download full copies of web pages and storage them locally as well the hyperlink structures as local directories. I tried to use Lucene, but I've realized that it doesn't have a crawler. Does anyone know a software that make this? Thanks,

Re: Range Query

2004-10-19 Thread Jonathan Hager
That is exactly right. It is searching the ASCII. To solve it I pad my price using a method like this: /** * Pads the Price so that all prices are the same number of characters and * can be compared lexigraphically. * @param price * @return */ public static String

Re: Thesaurus ...

2004-10-19 Thread David Spencer
Erik Hatcher wrote: Have a look at the WordNet contribution in the Lucene sandbox repository. It could be leveraged for part of a solution. It's something I contributed. Relevant links are: http://jakarta.apache.org/lucene/docs/lucene-sandbox/ http://www.tropo.com/techno/java/lucene/wordnet.html