Hi,
Am joining the conversation a bit late but nevermind...
In my views the main targets should be (2). As you pointed out, SOLR covers
(3) and (4) quite well (or will progressively do so). As for (1), there is
definitely an audience even if it is small but would certainly benefit from
the work
I 'm still a new user so although I found it rather easy to get going and
build my own plugin's I have some suggestions.
Yes one thing that I'd like to see is a kind of way to estimate how long
will a certain step (fetch, ...) will take... something like a progress
bar. Because you launch a step
Keep it simple.
Many people, it seems to me, use nutch to exercise, in some way their
programming expertise and talents.
I am just a user, and I think that users just want something thant can index
the web and find results, when they search. I don't want to deal with
complicated application
Andrzej, great summary. I played with nutch before for web search engine,
but has not used it for a while because it has become too complicated. based
on my experience in building semantic search engine for healthcare vertical,
it think it would be benefitial to separate crawling from search
Hi Andrzej,
Great summary. My general feeling on this is similar to my prior comments on
similar threads from Otis and from Dennis. My personal pet projects for
Nutch2:
* refactored Nutch core data structures, modeled as POJOs
* refactored Nutch architecture where