Re: Need advice: what Word/Excel/PowerPoint lib to use?

2004-10-26 Thread iouli . golovatyi
Many thanks to everybody for interesting info Regards and have a nice day J. sergiu gordea [EMAIL PROTECTED] 25.10.2004 17:05 Please respond to Lucene Users List To: Lucene Users List [EMAIL PROTECTED] cc: (bcc: Iouli Golovatyi/X/GP/Novartis) Subject:

Re: Need advice: what pdf lib to use?

2004-10-26 Thread iouli . golovatyi
OK, but even in this case parsing the doc would not be a violation, because actually what we need for lucene is just collection of terms. Has nothing to do with printing or copying of _text_ pieces. As long You provide method returning just Document (I mean lucene document) permissions

Large number of documents

2004-10-26 Thread Gard Arneson Haugen
Hi, I have just started looking at Lucene and are not an experienced user of Java, but from what I've been reading this search tool should manage large amounts of documents. I'm wondering if someone have any experience using Lucene on large amount of documents. I need to be able to index and

Re: BooleanQuery - TooManyClauses

2004-10-26 Thread Erik Hatcher
On Oct 25, 2004, at 6:35 PM, Angelov, Rossen wrote: Why there is a limit on the number of clauses? and is there any harm in setting MaxClauseCount to Integer.MAX_VALUE? The harm is in performance and resource utilization. Rather than do this, though, read on... I'm using a Range Query on a

Re: Large number of documents

2004-10-26 Thread Otis Gospodnetic
Hello Gard, This is certainly doable, it just depends on your hardware, complexity of queries, frequency of queries, and such. There is a benchmark page on the Lucene site that you may want to check to get some ideas. Otis --- Gard Arneson Haugen [EMAIL PROTECTED] wrote: Hi, I have just

RE: BooleanQuery - TooManyClauses

2004-10-26 Thread Angelov, Rossen
On Oct 25, 2004, at 6:35 PM, Angelov, Rossen wrote: Why there is a limit on the number of clauses? and is there any harm in setting MaxClauseCount to Integer.MAX_VALUE? The harm is in performance and resource utilization. Rather than do this, though, read on... I'm using a Range Query on a

Re: BooleanQuery - TooManyClauses

2004-10-26 Thread Terry Steichen
I think what Erik's asking is whether you can live with expressing your indexed date in the form of MMDD, without the hour and minute extension. That will sharply educe the number of range query expansion terms. If you're using the timestamp as a unique identifier, you might consider

Re: Exception in thread main java.lang.NoClassDefFoundError

2004-10-26 Thread chandrakant gopalan
Hi Rob, I noticed that you are using org.apache.lucene.demos where its just demo Regards CG On Mon, 25 Oct 2004 21:54:38 +0100, Rob Hailey [EMAIL PROTECTED] wrote: I am using lucene version 1.4.2 but am consistently getting an error when I run this: java -verbose -classpath

RE: BooleanQuery - TooManyClauses

2004-10-26 Thread Angelov, Rossen
OK, I got that part - to limit the clause counts limit the range. In my case replace the timestamp with date and if it gets too big again replace the MMDD with MM and later with . And that of course includes fixing the old files every time so they have new field. I was actually looking

RE: Aliasing problem

2004-10-26 Thread Chuck Williams
Looks like you produced a PhraseQuery rather than a BooleanQuery. You want +GAME:(doom3 3 doom) Chuck -Original Message- From: Abhay Saswade [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 26, 2004 10:22 AM To: [EMAIL PROTECTED] Subject: Aliasing problem Hi,

Re: Aliasing problem

2004-10-26 Thread Daniel Naber
On Tuesday 26 October 2004 19:22, Abhay Saswade wrote: I tried following but no luck I have written alias filter which returns 2 more tokens for doom3 as 3 and doom I construct query +GAME:doom3 QueryParser returns +GAME:doom3 3 doom Your approach is correct, but QueryParser doesn't yet

Re: BooleanQuery - TooManyClauses

2004-10-26 Thread Erik Hatcher
On Oct 26, 2004, at 1:55 PM, Angelov, Rossen wrote: OK, I got that part - to limit the clause counts limit the range. In my case replace the timestamp with date and if it gets too big again replace the MMDD with MM and later with . And that of course includes fixing the old files

RE: BooleanQuery - TooManyClauses

2004-10-26 Thread Vanlerberghe, Luc
Even if you need to be able to search on ranges that include the time, you could benefit from adding a few extra fields to your documents. For example: add a year field and an hour field: If the user then specifies a range between 2001-08-10 11:00 and 2004-10-11 13:00, you break it up behind the