Re: Zip Files

2005-03-01 Thread Ernesto De Santis
? Thanks, Luke - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- Ernesto De Santis - Colaborativa.net Córdoba 1147 Piso 6 Oficinas 3 y 4 (S2000AWO) Rosario, SF, Argentina

Re: Optimize not deleting all files

2005-02-04 Thread Ernesto De Santis
Hi all We have the same problem. We guess that the problem is that windows lock files. Our enviroment: Windows 2000 Tomcat 5.5.4 Ernesto. [EMAIL PROTECTED] escribió: Hi, When I run an optimize in our production environment, old index are left in the directory and are not deleted. My

Re: Disk space used by optimize - non space in disk corrupts index.

2005-02-04 Thread Ernesto De Santis
Hi all We have a big index and a little space in disk. When optimize and all space is consumed, our index is corrupted. segments file point to nonexistent files. Enviroment: java 1.4.2_04 W2000 SP4 Tomat 5.5.4 Bye, Ernesto. Yura Smolsky escribió: Hello, Otis. There is a big difference when you use

Re: Lucene and multiple languages

2005-01-20 Thread Ernesto De Santis
Hi Aurora I develop a tool with this multiple languages issue. I found very useful an nuke library language-identifier. This jar have nuke dependencies, but I delete all unnecessary code (for me obvious). This language-identifier that I use work fine and is very simple: For example:

Re: where is the SnowBallAnalyzer?

2004-09-08 Thread Ernesto De Santis
Is in snowball-1.0.jar I sent you it in private email. Bye Ernesto. - Original Message - From: Wermus Fernando [EMAIL PROTECTED] To: [EMAIL PROTECTED] Sent: Wednesday, September 08, 2004 1:12 PM Subject: where is the SnowBallAnalyzer? I have to look better, but why the

spanish stemmer

2004-08-23 Thread Ernesto De Santis
Hello I use the Snowball jar for implement my SpanishAnalyzer. I found that the words finished in 'bol' are not stripped. For example: In spanish for say basketball, you can say basquet or basquetbol. But for SpanishStemmer are different words. Idem with voley and voleybol. Not idem with futbol

Re: spanish stemmer

2004-08-23 Thread Ernesto De Santis
in Lucene. thanks, chad. -Original Message- From: Ernesto De Santis [mailto:[EMAIL PROTECTED] Sent: Monday, August 23, 2004 8:30 AM To: Lucene Users List Subject: spanish stemmer Hello I use the Snowball jar for implement my SpanishAnalyzer. I found that the words finished in 'bol

Re: spanish stemmer

2004-08-23 Thread Ernesto De Santis
. -Original Message- From: Ernesto De Santis [mailto:[EMAIL PROTECTED] Sent: Monday, August 23, 2004 2:03 PM To: Lucene Users List Subject: Re: spanish stemmer Yes, is too easy. You need do a wrapper for spanish Snowball initilization. analyzer = new SnowballAnalyzer(Spanish

Re: spanish stemmer

2004-08-23 Thread Ernesto De Santis
of the language (spanish for us). caminar, caminantes, camino are stored as camin. (Camin not exist in spanish). This improvement the quality of hits thanks, chad. Bye, Ernesto. -Original Message- From: Ernesto De Santis [mailto:[EMAIL PROTECTED] Sent: Monday, August 23, 2004 3:31 PM

Re: Index and Search question in Lucene.

2004-08-21 Thread Ernesto De Santis
Hi Dimitri What analyzer you use? You need take carefully with Keyword fields and analyzers. When you index a Document, the fields that have set tokenized = false, like Keyword, are not analyzed. In search time you need parse the query with your analyzer but not analyze the untokenized fields,

javadoc api

2004-08-17 Thread Ernesto De Santis
Hello Lucene developers A litle issue about a Field documentation. In Field class on getBoost() method it says: Returns the boost factor for hits on any field of this document. I think that this comment are copied from Document class and forgot change it. Bye Ernesto. --- Outgoing mail is

parce Query

2004-08-05 Thread Ernesto De Santis
Hello What is the best practice to parce a Query object.? QueryParcer only work with String, but if I have a Query? I want that anothers applications build yours lucene Query´s, and I want parse this when this applications do search with my server application. In my server application I store

Re: Weighting database fields

2004-07-21 Thread Ernesto De Santis
Hi Erik On Jul 21, 2004, at 11:40 AM, Anson Lau wrote: Is there any benefit to set the boost during indexing rather than set it during query? It allows setting each document differently. For example, TheServerSide is using field-level boosts at index time to control ordering by date,

Re: languages lucene can support

2004-07-01 Thread Ernesto De Santis
Hi Praveen You can develope your SpanishAnalyzer easily (or another language)with SnowballAnalyzer. I send you my SpanishAnalyzer. Bye, Ernesto. - Original Message - From: "Praveen Peddi" [EMAIL PROTECTED] To: "lucenelist" [EMAIL PROTECTED] Sent: Thursday, July 01, 2004 6:13 PM

Re: syntax of queries.

2003-12-19 Thread Ernesto De Santis
for toString on that page: http://today.java.net/pub/a/today/2003/11/07/QueryParserRules.html On Friday, December 12, 2003, at 10:38 PM, Ernesto De Santis wrote: Thanks Otis, I don´t resolve my problem. I see the Query sintaxis page, and the FAQ´s search section. I proof too many alternatives

Re: syntax of queries.

2003-12-12 Thread Ernesto De Santis
. - Original Message - From: Otis Gospodnetic [EMAIL PROTECTED] To: Lucene Users List [EMAIL PROTECTED]; Ernesto De Santis [EMAIL PROTECTED] Sent: Friday, December 12, 2003 7:18 PM Subject: Re: syntax of queries. Maybe it's the spaces after title:? Try title:importar ... instead. Maybe it's

Re: Index pdf files with your content in lucene.

2003-11-12 Thread Ernesto De Santis
Hello well, not work zip the files. I can send files, if somebody won, to personal email. And if somebody can post this in a web site, very cool. I don´t post in a web site. Ernesto. - To unsubscribe, e-mail: [EMAIL

Index pdf files with your content in lucene.

2003-11-11 Thread Ernesto De Santis
Classes for index Pdf and word files in lucene. Ernesto. - Original Message - From: Ernesto De Santis [EMAIL PROTECTED] To: [EMAIL PROTECTED] Sent: Wednesday, October 29, 2003 12:04 PM Subject: Re: [opencms-dev] Index pdf files with your content in lucene. Hello all, Thans very much

Re: Index pdf files with your content in lucene.

2003-11-11 Thread Ernesto De Santis
and word files, with content. Is pure java. Use TextExtraction library. tm-extractors-0.2.jar Use POI and PDFBox. Ernesto Sorry for my bad English. Thanks, Otis --- Ernesto De Santis [EMAIL PROTECTED] wrote: Classes for index Pdf and word files in lucene. Ernesto. - Original Message

Index pdf files with your content in lucene.

2003-10-23 Thread Ernesto De Santis
Hello I am new in opencms and lucene tecnology. I won index pdf files, and index de content of this files. I work in this way: Make a PDFDocument class like JspDocument class. use org.textmining.text.extraction.PDFExtractor class, this class work fine out of vfs. and write my registry.xml