I may want to follow you there, not sure where there is less work : + making a webapp for Nutch 1.3 + or upgrading all my web site with/into Solr, quite some work + learning of Solr
Although I have no experience in contributing to code development. 2011/7/6 Mattmann, Chris A (388J) <[email protected]> > Also of note: > > If I get any free cycles, my #1 priority is to re-enable the default Nutch > web-app through > JAX-RS or the recent work that Andrzej was doing with REST services and > Restlet. I don't think it would be hard, just haven't had the time yet. > > Cheers, > Chris > > On Jul 6, 2011, at 12:15 PM, Julien Nioche wrote: > > > Hi, > > > > The decision to drop the legacy Nutch search web apps was motivated by > the > > following facts : > > 1 - most people tend to use SOLR anyway > > 2 - the number of active developers / committers is very limited which > means > > that we couldn't maintain / improve the Nutch searcher and work on other > > aspects of Nutch > > 3 - SOLR has more functionality, a larger community - trying to catch up > and > > match the functionalities in Nutch would have been a duplication of > efforts > > and IMHO a waste of time > > > > This is part of a larger attempt in Nutch to delegate to external > components > > : Tika, SOLR and shortly crawler-commons. It makes sense to contribute to > > these projects and benefits from external contributions. We can then > focus > > on more specific crawling-related issues and make Nutch better, the move > > towards 2.0 is a good example of this. This is a change in the scope of > the > > project from being an open source large scale search engine to an open > > source crawler indeed. We should make this clearer on the website. > > > > While I agree that moving your search to SOLR represents an investment in > > the short term, you will benefit from the efforts from the SOLR/Lucene > > community, for instance the field collapsing in 3.3 is something that > should > > be of interest to a lot of Nutch users. > > > > Hope it makes sense > > > > Julien > > > > On 6 July 2011 18:18, MilleBii <[email protected]> wrote: > > > >> I'm not too happy either that 1.3 forces to use Solr. > >> Which means I have to redevelop all my search web sites, didn't need > that > >> extra work/burden. > >> > >> I will start the process to upgrade to 1.2 but 1.3 is a much bigger > issue > >> for me. > >> > >> 2011/7/6 Markus Jelsma <[email protected]> > >> > >>> I'd still recommend to upgrade. Modifying Solr's example Velocity > >> template > >>> is > >>> really simple. You'll get a complete HTML driven search engine for > free! > >>> > >>> On Wednesday 06 July 2011 18:57:26 [email protected] wrote: > >>>> So, I assume that, for now, I and people like me, should stick with > >> older > >>>> versions of Nutch, until something comes along that allows us to have > >>> just > >>>> a regular safety engine. > >>>> Thank you for your prompt reply. > >>>> > >>>>> Hi, > >>>>> > >>>>> With Solr's Velocity integration you can easily get an interface > >>> without > >>>>> a lot > >>>>> of work. It hasn't been tuned to work with Nutch' data yet. It's on > >> the > >>>>> todo > >>>>> list. > >>>>> > >>>>> Cheers, > >>>>> > >>>>> On Wednesday 06 July 2011 18:29:59 [email protected] wrote: > >>>>>> Ok, I could do the integration of Nutch-1.3 and Solr 3.3, but the > >>>>>> interface of Solr seems quite complicated to me. I just want to > >> have > >>> a > >>>>>> search engine with a customizable front and results pages, in full > >>> html > >>>>>> (not xml) like Nutch provided, until the last version. It is clear > >> to > >>>>>> me > >>>>>> that I have a lot do learn about Solr, or, may be, Nutch deviated > >>> from > >>>>>> the original purpose? I also noticed that the clustering-carrot2 is > >>> not > >>>>>> included and this one was very important to me. > >>>>>> Is there any tutorial that clarify those questions? > >>>>>> > >>>>>> Thank you. > >>>>> > >>>>> -- > >>>>> Markus Jelsma - CTO - Openindex > >>>>> http://www.linkedin.com/in/markus17 > >>>>> 050-8536620 / 06-50258350 > >>> > >>> -- > >>> Markus Jelsma - CTO - Openindex > >>> http://www.linkedin.com/in/markus17 > >>> 050-8536620 / 06-50258350 > >>> > >> > >> > >> > >> -- > >> -MilleBii- > >> > > > > > > > > -- > > * > > *Open Source Solutions for Text Engineering > > > > http://digitalpebble.blogspot.com/ > > http://www.digitalpebble.com > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Chris Mattmann, Ph.D. > Senior Computer Scientist > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 171-266B, Mailstop: 171-246 > Email: [email protected] > WWW: http://sunset.usc.edu/~mattmann/ > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Adjunct Assistant Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > -- -MilleBii-

