As far as I know, you can do this.

You can either add things in the Metadata objects, but it is limited to
String values.

Or you can extend the Parse object, have a different OutputFormat for it
that would read/write your information from the segments.

The fetcher/parser would have to be modified slightly, but nothing hard
to do.
We did something around those lines, and it works perfectly in Nutch
0.8.



Any other way?




-----Original Message-----
From: cybercouf [mailto:[EMAIL PROTECTED] 
Sent: Friday, February 23, 2007 8:23 AM
To: nutch-dev@lucene.apache.org
Subject: How to add data into segment with my own plugin ?

I'm making a plugin to parse specific data from web page, and after I
will export dump segments into a specific SQL database. So I wont use
the indexing/lucense part I think.

I tried to do my plugin using the wiki tutorial, It works well for the
parse part, (and see it in the log).
(I'm just adding some content into the parseData object) But I don't see
how to add my parsed data in a segment.

I see that a segment have this kind of data for a page (correct me if
i'm
worng):
CrawlDatnum
Content
ParseData
ParseText

for exemple I would like to add some informations in "ParseData" but I
can't figure out how.

thanks for your help, i'm quite new using nutch.
--

View this message in context:
http://www.nabble.com/How-to-add-data-into-segment-with-my-own-plugin---
tf3279715.html#a9121761
Sent from the Nutch - Dev mailing list archive at Nabble.com.



-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to