I have an example of nutch being run as a local search engine at http://www.searchmitchell.com

 

There are a few issues that I’m initially concerned about, and wondering if you could provide any comments/suggestions:

-          Doesn’t seem to index sites using frames (e.g., http://www.dicefinancial.com)

-          Also, doesn’t seem to index those starting with a redirect (e.g., http://www.cornerstonescareer.com)

-          Also has problems w/ querystrings at times (e.g., caught looping through a calendar on http://www.focusag.us)

-          Grouping by same-hosts (already posted on this issue, and looks like you are working towards a solution. I’m excited to try this out, once implemented)

 

Btw: I do have a business plan for this concept that has a good deal of interest in it, so if I can execute the plan, I would be interested in committing a percentage of the returns back to nutch development. I’m not stating this to entice you to help me, but rather to be open and honest about my intentions for using nutch. (This seems like a good place to be “open” with people, right? :)

 

Thanks,

Eric Holman

Chamber Centric

 

 

Reply via email to