Re: The Future of Nutch, reactivated

2009-05-23 Thread Julien Nioche
Hi, Am joining the conversation a bit late but nevermind... In my views the main targets should be (2). As you pointed out, SOLR covers (3) and (4) quite well (or will progressively do so). As for (1), there is definitely an audience even if it is small but would certainly benefit from the work

Re: The Future of Nutch, reactivated

2009-05-15 Thread Raymond Balmès
I 'm still a new user so although I found it rather easy to get going and build my own plugin's I have some suggestions. Yes one thing that I'd like to see is a kind of way to estimate how long will a certain step (fetch, ...) will take... something like a progress bar. Because you launch a step

Re: The Future of Nutch, reactivated

2009-05-15 Thread consultas
Keep it simple. Many people, it seems to me, use nutch to exercise, in some way their programming expertise and talents. I am just a user, and I think that users just want something thant can index the web and find results, when they search. I don't want to deal with complicated application

Re: The Future of Nutch, reactivated

2009-05-14 Thread AJ Chen
Andrzej, great summary. I played with nutch before for web search engine, but has not used it for a while because it has become too complicated. based on my experience in building semantic search engine for healthcare vertical, it think it would be benefitial to separate crawling from search

Re: The Future of Nutch, reactivated

2009-05-14 Thread Mattmann, Chris A
Hi Andrzej, Great summary. My general feeling on this is similar to my prior comments on similar threads from Otis and from Dennis. My personal pet projects for Nutch2: * refactored Nutch core data structures, modeled as POJOs * refactored Nutch architecture where