Hi folks :
What’s I want to do is to separate a rss file into several pages .
Just as what has been discussed before. I want fetch a rss page and index
it as different documents in the index. So the searcher can search the
Item’s info as a individual hit.
What’s my opinion create a
Hi there,
I could most likely be of assistance, if you gave me some more information.
For instance: I'm wondering if the use case you describe below is already
supported by the current RSS parse plugin?
The current RSS parser, parse-rss, does in fact index individual items that
are pointed to
thx for ur reply .
mybe i didn't tell clearly .
I want to index the item as a individual page .then when i search the some
thing for example nutch-open source, the nutch return a hit which contain
title : nutch-open source
description : nutch nutch nutch nutch nutch
url :
Hi there,
On 1/30/07 7:00 PM, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote:
Chris,
I saw your name associated with the rss parser in nutch. My understanding is
that nutch is using feedparser. I had two questions:
1. Have you looked at vtd as an rss parser?
I haven't in fact; what are
1. Claims to be faster
2. Asynchronous should take care of sitting and waiting for one fetch to
return before you do the next.
Ps I am not sure if you checked out tailrank.com for that branch of feedparser
(I think its in code.tailrank.com/feedparser)
Thanks
-Original
why when I changed the nutch/conf/log4j.properties
I just changed the first line
Log4j.rootLogger=info,drfa to log4j.rootLogger=debug,drfa
Like this:
*** **
# RootLogger - DailyRollingFileAppender
#log4j.rootLogger=INFO,DRFA
Hi there,
With the explanation that you give below, it seems like parse-rss as it
exists would address what you are trying to do. parse-rss parses an RSS
channel as a set of items, and indexes overall metadata about the RSS file,
including parse text, and index data, but it also adds each item