I'd consider the following options:
1. If the database has an HTML web interface that allows you to crawl
the records, you can just point Nutch to it with an appropriate seed
URL and urlfilters
2. You could also have a look at SOLR, in which case you could dump
your database to a set of XML documents and use post.jar to upload
those documents to the SOLR index (see http://lucene.apache.org/solr/
tutorial.html for shortest path to quickly trying this out) or you
could modify post.jar to cycle through your database using a database
connection.
Hope this helps,
Jasper
On Mar 4, 2008, at 9:36 AM, payo wrote:
hi to all
i can index a database with nutch?
i am use nutch 0.8.1
thanks
--
View this message in context: http://www.nabble.com/indexing-
database-tp15832696p15832696.html
Sent from the Nutch - User mailing list archive at Nabble.com.