Hi there.

While I am trying to create the protocol-foo, an implementation for the example 
protocol with URLs like foo://something I see difficulty in distinguishing when 
to tell nutch to search for more URLs and when not to. It would be something 
like a directory listing, or no directory listing but content.

It is possible that a protocol-plugin cannot do much without a parser-plugin? 
And if I were to implement such a parser-plugin, would I then have to implement 
the directory listing plus all the content parsing like Tika?

Hiran


Hiran Chaudhuri
Principal Support Engineer
Service Reliability Engineering - Custom
Amadeus Data Processing GmbH
Berghamer Strasse 6
85435 Erding
T: +49-8122-43x3662
[email protected]
http://amadeus.com<http://amadeus.com/>
[cid:[email protected]]

Reply via email to