I'd consider the following options:

1. If the database has an HTML web interface that allows you to crawl the records, you can just point Nutch to it with an appropriate seed URL and urlfilters

2. You could also have a look at SOLR, in which case you could dump your database to a set of XML documents and use post.jar to upload those documents to the SOLR index (see http://lucene.apache.org/solr/ tutorial.html for shortest path to quickly trying this out) or you could modify post.jar to cycle through your database using a database connection.

Hope this helps,

Jasper

On Mar 4, 2008, at 9:36 AM, payo wrote:


hi to all

i can index a database with nutch?

i am use nutch 0.8.1

thanks
--
View this message in context: http://www.nabble.com/indexing- database-tp15832696p15832696.html
Sent from the Nutch - User mailing list archive at Nabble.com.



Reply via email to