?
Thanks,
Luke
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
--
Ernesto De Santis - Colaborativa.net
Córdoba 1147 Piso 6 Oficinas 3 y 4
(S2000AWO) Rosario, SF, Argentina
Hi all
We have the same problem.
We guess that the problem is that windows lock files.
Our enviroment:
Windows 2000
Tomcat 5.5.4
Ernesto.
[EMAIL PROTECTED] escribió:
Hi,
When I run an optimize in our production environment, old index are
left in the directory and are not deleted.
My
Hi all
We have a big index and a little space in disk.
When optimize and all space is consumed, our index is corrupted.
segments file point to nonexistent files.
Enviroment:
java 1.4.2_04
W2000 SP4
Tomat 5.5.4
Bye,
Ernesto.
Yura Smolsky escribió:
Hello, Otis.
There is a big difference when you use
Hi Aurora
I develop a tool with this multiple languages issue. I found very useful
an nuke library language-identifier. This jar have nuke dependencies,
but I delete all unnecessary code (for me obvious).
This language-identifier that I use work fine and is very simple:
For example:
Is in snowball-1.0.jar
I sent you it in private email.
Bye
Ernesto.
- Original Message -
From: Wermus Fernando [EMAIL PROTECTED]
To: [EMAIL PROTECTED]
Sent: Wednesday, September 08, 2004 1:12 PM
Subject: where is the SnowBallAnalyzer?
I have to look better, but why the
Hello
I use the Snowball jar for implement my SpanishAnalyzer. I found that the
words finished in 'bol' are not stripped.
For example:
In spanish for say basketball, you can say basquet or basquetbol. But for
SpanishStemmer are different words.
Idem with voley and voleybol.
Not idem with futbol
in Lucene.
thanks,
chad.
-Original Message-
From: Ernesto De Santis [mailto:[EMAIL PROTECTED]
Sent: Monday, August 23, 2004 8:30 AM
To: Lucene Users List
Subject: spanish stemmer
Hello
I use the Snowball jar for implement my SpanishAnalyzer. I found that the
words finished in 'bol
.
-Original Message-
From: Ernesto De Santis [mailto:[EMAIL PROTECTED]
Sent: Monday, August 23, 2004 2:03 PM
To: Lucene Users List
Subject: Re: spanish stemmer
Yes, is too easy.
You need do a wrapper for spanish Snowball initilization.
analyzer = new SnowballAnalyzer(Spanish
of the
language (spanish for us).
caminar, caminantes, camino are stored as camin. (Camin not exist in
spanish).
This improvement the quality of hits
thanks,
chad.
Bye, Ernesto.
-Original Message-
From: Ernesto De Santis [mailto:[EMAIL PROTECTED]
Sent: Monday, August 23, 2004 3:31 PM
Hi Dimitri
What analyzer you use?
You need take carefully with Keyword fields and analyzers. When you
index a Document, the fields that have set tokenized = false, like
Keyword, are not analyzed.
In search time you need parse the query with your analyzer but not
analyze the untokenized fields,
Hello Lucene developers
A litle issue about a Field documentation.
In Field class on getBoost() method it says:
Returns the boost factor for hits on any field of this document.
I think that this comment are copied from Document class and forgot change
it.
Bye
Ernesto.
---
Outgoing mail is
Hello
What is the best practice to parce a Query object.?
QueryParcer only work with String, but if I have a Query?
I want that anothers applications build yours lucene Query´s, and I want
parse this when this applications do search with my server application. In
my server application I store
Hi Erik
On Jul 21, 2004, at 11:40 AM, Anson Lau wrote:
Is there any benefit to set the boost during indexing rather than set
it
during query?
It allows setting each document differently. For example,
TheServerSide is using field-level boosts at index time to control
ordering by date,
Hi Praveen
You can develope your SpanishAnalyzer easily (or
another language)with SnowballAnalyzer.
I send you my SpanishAnalyzer.
Bye, Ernesto.
- Original Message -
From: "Praveen Peddi" [EMAIL PROTECTED]
To: "lucenelist" [EMAIL PROTECTED]
Sent: Thursday, July 01, 2004 6:13 PM
for toString on that page:
http://today.java.net/pub/a/today/2003/11/07/QueryParserRules.html
On Friday, December 12, 2003, at 10:38 PM, Ernesto De Santis wrote:
Thanks Otis, I don´t resolve my problem.
I see the Query sintaxis page, and the FAQ´s search section.
I proof too many alternatives
.
- Original Message -
From: Otis Gospodnetic [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]; Ernesto De
Santis [EMAIL PROTECTED]
Sent: Friday, December 12, 2003 7:18 PM
Subject: Re: syntax of queries.
Maybe it's the spaces after title:?
Try title:importar ... instead.
Maybe it's
Hello
well, not work zip the files.
I can send files, if somebody won, to personal email.
And if somebody can post this in a web site, very cool.
I don´t post in a web site.
Ernesto.
-
To unsubscribe, e-mail: [EMAIL
Classes for index Pdf and word files in lucene.
Ernesto.
- Original Message -
From: Ernesto De Santis [EMAIL PROTECTED]
To: [EMAIL PROTECTED]
Sent: Wednesday, October 29, 2003 12:04 PM
Subject: Re: [opencms-dev] Index pdf files with your content in lucene.
Hello all,
Thans very much
and word files, with
content.
Is pure java.
Use TextExtraction library.
tm-extractors-0.2.jar
Use POI and PDFBox.
Ernesto
Sorry for my bad English.
Thanks,
Otis
--- Ernesto De Santis [EMAIL PROTECTED] wrote:
Classes for index Pdf and word files in lucene.
Ernesto.
- Original Message
Hello
I am new in opencms and lucene tecnology.
I won index pdf files, and index de content of this files.
I work in this way:
Make a PDFDocument class like JspDocument class.
use org.textmining.text.extraction.PDFExtractor class, this class work fine out of vfs.
and write my registry.xml
20 matches
Mail list logo