Thank you, everybody, for all your replies on this. We are trying RSS parsing with parse-rss enabled.
Dima. ----- Original Message ----- From: "Chris Mattmann" <[EMAIL PROTECTED]> To: <[email protected]> Sent: Monday, August 28, 2006 9:55 AM Subject: Re: RSS search by nutch > Hi there Dima, > > I'm not exactly sure what you mean by "real time", but there is an RSS > Parsing plugin in Nutch that can parse RSS feeds that Nutch encounters > during its crawl. You can enable parse-rss by opening up > $NUTCH_HOME/conf/nutch-site.xml, and searching for the property > "plugin.includes". For the value of "plugin.includes", ensure that there is > an entry for "parse-rss" somewhere in that property value. > > HTH, > Chris > > > On 8/28/06 10:44 AM, "Dima Gritsenko" <[EMAIL PROTECTED]> wrote: > > > Hi, > > > > Does nutch have a class for searching incoming RSS feeds in real time? > > Thank you. > > Dima. > > ______________________________________________ > Chris A. Mattmann > [EMAIL PROTECTED] > Staff Member > Modeling and Data Management Systems Section (387) > Data Management Systems and Technologies Group > > _________________________________________________ > Jet Propulsion Laboratory Pasadena, CA > Office: 171-266B Mailstop: 171-246 > _______________________________________________________ > > Disclaimer: The opinions presented within are my own and do not reflect > those of either NASA, JPL, or the California Institute of Technology. > > > > ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
