Hi Eder,

If you are using Java 5, take a look at

java.util.Scanner to read your lines,
then use String
<http://java.sun.com/j2se/1.5.0/docs/api/java/lang/String.html>[]
split(String <http://java.sun.com/j2se/1.5.0/docs/api/java/lang/String.html>
regex) to split on column,
and read the first element of the array to decide what field you have.

Hope this helps.

Patrick


On 10/27/06, Eder <[EMAIL PROTECTED]> wrote:


Hi, Grant

Sorry for writing for ya... I'm a newbie in Lucene using. Could you give
me
a practical example for parsing a file? I tried to comprehend the
luceneweb
demo, but it's very complicated..

I'd thank ya a lot!

Eder


----- Original Message -----
From: "Grant Ingersoll" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Friday, October 27, 2006 10:43 AM
Subject: Re: How to create fields from a txt file for Lucene indexing?


You need to read in the file and parse it according to your business
rules (just like you would read in any file in your system) and then
create the appropriate Fields.

-Grant
On Oct 26, 2006, at 11:56 PM, Eder wrote:

> Hi all
>
> I'd like to create fields based in a txt.file, like the foollowing
> example:
>
> File1.txt
> Author: Eder
> Description: Indexing txt files in Lucene Tutorial
> Category: Software Development
>
> File2.txt
> Author: Cecilia
> Title: Preventioning Fever
> Category: Health y Wellness
>
> So, I'd like to create the fields "Author", "Description", "Title"  and
> "Category" by reading the files. If I got the texts, I would
do  something
> like:
>
> Document doc = new Document( );
> doc.add(New field("Author","Eder"));
>
> But this info is in txt files, so how can I read the file and get  the
> data?
>
>
> Great Hugh,
>
> Eder Rebouças dos Santos
> Salvador / BA - Brasil

--------------------------
Grant Ingersoll
Sr. Software Engineer
Center for Natural Language Processing
Syracuse University
335 Hinds Hall
Syracuse, NY 13244
http://www.cnlp.org




Reply via email to