Re: pdfboxhelp

2004-08-23 Thread Santosh
Hi natarajan, I kept log4j.properties in the classpath my new classpath is .;..;C:\j2sdk1.4.1\lib;C:\j2sdk1.4.1\lib\jndi.jar;C:\j2sdk1.4.1\lib\webclien t.ja r;C:\j2sdk1.4.1\lib\mail.jar;C:\j2sdk1.4.1\lib\activation.jar;C:\j2sdk1.4.1\ lib\

Re: pdfboxhelp

2004-08-23 Thread Santosh
I kept the file in the classpath .;..;C:\j2sdk1.4.1\lib;C:\j2sdk1.4.1\lib\jndi.jar;C:\j2sdk1.4.1\lib\webclien t.ja r;C:\j2sdk1.4.1\lib\mail.jar;C:\j2sdk1.4.1\lib\activation.jar;D:\JAVAPRO;E:\ Prog ram Files\Apache Tomcat 4.0\common\lib\servlet.jar;C:\j2sdk1.4.1\lib\classes12.z

RE: memory leek in lucene?

2004-08-23 Thread iouli . golovatyi
Yes Terence, it's exactly what I do Terence Lai [EMAIL PROTECTED] 21.08.2004 01:50 Please respond to Lucene Users List To: Lucene Users List [EMAIL PROTECTED] cc: Subject:RE: memory leek in lucene? Category: Are you calling

Re: Lucene Search Applet

2004-08-23 Thread Simon mcIlwaine
Thanks Jon that works by putting the jar file in the archive attribute. Now im getting the disablelock error cause of the unsigned applet. Do I just comment out the code anywhere where System.getProperty() appears in the files that you specified and then update the JAR Archive?? Is it possible you

RE: Lucene with English and Spanish Best Practice?

2004-08-23 Thread Chad Small
Thanks for the info Grant. As for indexes, do you anticipate adding more fields later in Spanish? Is the content just a translation of the English, or do you have separate conetent in Spanish? Are your users querying in only one language (cross-lingual) or are the Spanish speakers only

Re: pdfboxhelp

2004-08-23 Thread Stephane James Vaucher
Your classpath should point to a directory that contains log4j.properties, not the file directly, see below. sv On Mon, 23 Aug 2004, Santosh wrote: Hi natarajan, I kept log4j.properties in the classpath my new classpath is C:\j2sdk1.4.1\lib\log4j.properties; should be C:\j2sdk1.4.1\lib\

RE: memory leek in lucene?

2004-08-23 Thread Otis Gospodnetic
Iouli Terence, Could you create a self-sufficient test case that demonstrates the memory leak? If you can do that, please open a new bug entry in Bugzilla (the link to it is on Lucene's home page), and then attach your test case to it. Thanks! Otis --- [EMAIL PROTECTED] wrote: Yes Terence,

Re: Lucene for Indian Languages

2004-08-23 Thread Satish Kagathare
Hi,Srinivasa, Use StandardAnaylzer for indexing and parsing query for Indian Lang. docs. It will work. Right now we r searching on Hindi,Marathi but without specific stemmers and filters. We r plannig to develop Marathi Morphological Analyzer. Thanks, Satish. On Sun, 22 Aug 2004, srinivasa

Re: Lucene Search Applet

2004-08-23 Thread Stephane James Vaucher
Hi Simon, Does this work? From FSDirectory api: If the system property 'disableLuceneLocks' has the String value of true, lock creation will be disabled. Otherwise, I think there was a Read-Only Directory hack: http://www.mail-archive.com/[EMAIL PROTECTED]/msg05148.html HTH, sv On Mon, 23

spanish stemmer

2004-08-23 Thread Ernesto De Santis
Hello I use the Snowball jar for implement my SpanishAnalyzer. I found that the words finished in 'bol' are not stripped. For example: In spanish for say basketball, you can say basquet or basquetbol. But for SpanishStemmer are different words. Idem with voley and voleybol. Not idem with futbol

NegativeArraySizeException when creating a new IndexSearcher

2004-08-23 Thread Sven
Hi Doug! Thank you very much for your answer! It solved the problem. I found an 1.3-version next to the 1.4-version and after removing the old one it works for fine now, as you said. Thanks again! Sven Date: Fri, 20 Aug 2004 14:08:57 -0700 From: Doug Cutting [EMAIL PROTECTED] Subject:

Re: Lucene Search Applet

2004-08-23 Thread Simon mcIlwaine
Hi Stephane, A bit of a stupid question but how do you mean set the system property disableLuceneLocks=true? Can I do it from a call from FSDirectory API or do I have to actually hack the code? Also if I do use RODirectory how do I go about using it? Do I have to update the Lucene JAR archive

integration of lucene with pdfbox

2004-08-23 Thread Santosh
I have downloaded pdfbox and lucene and kept jar files in the class path, I am able to work with both of them independently but how can I integrate both regards Santosh kumar ---SOFTPRO DISCLAIMER-- Information contained in this E-MAIL and any

Re: Lucene Search Applet

2004-08-23 Thread Simon mcIlwaine
Hi, Just used the RODirectory and I'm now getting the following error: java.security.AccessControlException: access denied (java.util.PropertyPermission user.dir read) I'm reckoning that this is what Jon was on about with System.getProperty() within certain files because im using an applet. Is

Re: integration of lucene with pdfbox

2004-08-23 Thread Ben Litchfield
If you can use lucene on its own then you already know how to add a lucene Document to the index. So you need to be able to take a PDF and get a lucene Document. org.pdfbox.searchengine.lucene.LucenePDFDocument.getDocument() does that for you. Ben On Mon, 23 Aug 2004, Santosh wrote: I

Re: Lucene Search Applet

2004-08-23 Thread Stephane James Vaucher
I haven't used it, and I'm a little confused from the code: /** ... * pIf the system property 'disableLuceneLocks' has the String value of * true, lock creation will be disabled. */ public final class FSDirectory extends Directory { private static final boolean DISABLE_LOCKS =

Re: Lucene Search Applet

2004-08-23 Thread Erik Hatcher
On Aug 23, 2004, at 10:48 AM, Stephane James Vaucher wrote: I haven't used it, and I'm a little confused from the code: /** ... * pIf the system property 'disableLuceneLocks' has the String value of * true, lock creation will be disabled. */ public final class FSDirectory extends Directory {

Re: Lucene Search Applet

2004-08-23 Thread Stephane James Vaucher
Thanks Erik for correcting me, I feel a bit stupid: I actually looked at the api to make sure that I wasn't in left field, but I trusted common-sense and stopped at the constructor ;) Should this property be changed in the next major release of lucene to org.apache...disableLuceneLocks? sv

Re: Lucene Search Applet

2004-08-23 Thread Erik Hatcher
On Aug 23, 2004, at 11:36 AM, Stephane James Vaucher wrote: Should this property be changed in the next major release of lucene to org.apache...disableLuceneLocks? Yes, that makes sense to put an org.apache.lucene prefix. If that is the case, it should be changed to disableLocks - no point in

RE: Lucene Search Applet

2004-08-23 Thread Jon Schuster
Hi all, The changes I made to get past the System.getProperty issues are essentially the same in the three files org.apache.lucene.index.IndexWriter, org.apache.lucene.store.FSDirectory, and org.apache.lucene.search.BooleanQuery. Change the static initializations from a form like this: public

Re: Lucene for Indian Languages

2004-08-23 Thread Praveen Peddi
Infact CJK analyzer also works well with indian languages. Since CJKAnalyzer considers the multi byte characters as special case, it works with most asian multi byte characters. I introduced CJKAnalyzer for japanese text search and we also tested with hindi and telugu languages. All our search

RE: spanish stemmer

2004-08-23 Thread Chad Small
Do you mind sharing how you implemented your SpanishAnalyzer using Snowball? Sorry I can't help with your question. I am trying to implement Snowball Spanish or a Spanish Analyzer in Lucene. thanks, chad. -Original Message- From: Ernesto De Santis [mailto:[EMAIL PROTECTED] Sent:

Re: spanish stemmer

2004-08-23 Thread Ernesto De Santis
Yes, is too easy. You need do a wrapper for spanish Snowball initilization. analyzer = new SnowballAnalyzer(Spanish, SPANISH_STOP_WORDS); above the complete code. Bye, Ernesto. -- public class SpanishAnalyzer extends Analyzer { private static

Re: spanish stemmer

2004-08-23 Thread Grant Ingersoll
Ernesto, http://snowball.tartarus.org/texts/introduction.html might help w/ your understanding. The link provides basic info on why stemmer's are valuable (not necessarily any insight on how the Spanish version works). Of course, they don't solve every problem and in some cases may make

RE: spanish stemmer

2004-08-23 Thread Chad Small
Excellent Ernesto. Was there a reason you used your own stop word list and not just the default constructor SnowballAnalyzer(Spanish)? thanks, chad. -Original Message- From: Ernesto De Santis [mailto:[EMAIL PROTECTED] Sent: Monday, August 23, 2004 2:03 PM To: Lucene Users List

Re: spanish stemmer

2004-08-23 Thread Ernesto De Santis
Because the SnowballAnalyzer, and SpanishStemmer donĀ“t have a default stopword set. SnowballAnalyzer constructor: /** Builds the named analyzer with no stop words. */ public SnowballAnalyzer(String name) { this.name = name; } Note the comment. Bye, Ernesto. - Original Message

RE: spanish stemmer

2004-08-23 Thread Chad Small
One more question to the group. From what I have gathered, my choices for indexing and querying Spanish content are: 1. StandardAnalyzer (I read that this analyzer could be used for European languages) 2. SnowballAnalyzer(Spanish, SPANISH_STOP_WORDS); --custom stop words from Ernesto class

Re: spanish stemmer

2004-08-23 Thread Ernesto De Santis
Hi Chad One more question to the group. From what I have gathered, my choices for indexing and querying Spanish content are: 1. StandardAnalyzer (I read that this analyzer could be used for European languages) The StandardAnalyzer not is for European languages, is like a generic analyzer.

Re: Lucene for Indian Languages

2004-08-23 Thread srinivasa raghavan
Hi Satish, Thank you satish for the pointers. Actually, I am able to search Indian Language data by storing the content in the index in ISCII encoding. When I search, the search word(s) is also converted into ISCII encoded word(s) and hit the lucene index for search. It works pretty fine. But