I am working on a boosting solutiong where I am having to create more
binary databases than just the linkdb, crawldb, etc. For example I
create one for uncommon words in a page. Then I want to use these
database objects inside of the indexing process, in the filters, by key
along with the linkdb, parse text ,parse data and so on.
The link database and parse text and data are passed into the filters
directly through the filter interface. I can't pass other databases
alongside because I would have to change the interface which means I
would have to refactor all existing indexing filters. The easiest way I
found right now in modifying the parse interface to also hold the
database objects that I need, but that doesn't feel like a good long
term solution.
Is there a better way to pass other keyed values (database) objects into
the indexing filters? Should we start a discussion about if we need
this functionality in Nutch and how best to implement it. I would be
happy to implement it but I want some discussion and opinions first.
Dennis
-------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers