Hi, I'm the author of the following italian document posted on this mailing 
list from Tun Lin the 3th December 2003.

Sorry for the huge delay of this reply, but I've just come back here after a 
very long time.
That document is referred to Lucy, a Java application I developed using Lucene 
and other useful open source libraries.

Lucy can index txt, html, pdf, doc, ppt, xls documents written in English 
and/or in Italian, with automatic language categorization and suitable stemming 
and filtering procedures.
Unfortunately I haven't translated the documentation to English yet, but if 
someone needs help, like Tun Lin did, please feel free to write to my e-mail 
address.
If the requests will be enough, I will post something like a FAQ document on 
this mailing list.

The last release of Lucy (1.2) can be downloaded from this webpage:
http://www.nsw2001.com/nsw2001/php/software.php
otherwise directly from this URL:
http://www.nsw2001.com/kenshir/lucy/lucy1.2.exe

Cheers! :)
Gimmy Pegoraro





From: Tun Lin <[EMAIL PROTECTED]>
Subject: Translation.
Date: Wed, 3 Dec 2003 09:42:02 +0800
Content-Type: multipart/alternative;
boundary="----=_NextPart_000_0007_01C3B981.B4EE1F10"

> Hi,
> 
> Can anyone translate this text for me? I cannot understand the
> instructions.
> Please help!
> 
> Thanks.
> 
> ===========
>  ____________
> |            |
> | LUCY 1.1   |   readme.txt    Ultimo aggiornamento: 18/03/2003
> |____________|
> 
> 
> 
> 
> 
> STRUTTURA
> 
> 
> Lucy 1.1      -> Lucene 1.2
>               -> HTMLParser 1.2
>               -> PdfBox 0.5.6
>               -> wvWare 0.7.2-3
>               -> xlhtml 0.4.9
>               -> antiword 0.33
>               -> Xpdf 2.01 
>               -> Snowball 0.1
>               -> NGramJ 01.12.11
>               -> it.corila.lucy       -> IndexAll.java
>                                       -> SearchIndex.java
>                                       -> HTMLDocument.java
>                                       -> PDFDocument.java
>                                       -> ExternalParser.java
>                                       -> ItalianStemFilter.java
>                                       -> EnglishStemFilter.java
>                                       -> ApostropheFilter.java
>                                       -> IndexAnalyzer.java
>                                       -> SearchAnalyzer.java
>                                       -> LanguageCategorizer
>                                       -> NgramjCategorizer.java
> 
> 
> 
> 
> 
> DESCRIZIONE
> 
> Lucy e' in grado di indicizzare tutti i files con estensione txt,
> html, pdf,
> doc, ppt, xls contenuti in una cartella base e nelle sue
> sottocartelle. Consente
> ricerche da linea di comando DOS oppure mediante interfaccia web.
> Gestisce testi
> in Italiano e Inglese con procedure di elaborazione lessicale
> specifiche.
> 
> (...)
-- 
___________________________________________________________
Sign-up for Ads Free at Mail.com
http://promo.mail.com/adsfreejump.htm


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to