Absolutely...

There is a short (old) thread here on this topic [1], from what I can see
this issue has not been addressed. Therefore it looks like implementing your
own parser plugin is what's required.

[1]
http://www.lucidimagination.com/search/document/a8d53fac1caa578c/nutch_with_nsf_files

2011/7/3 Alexander Aristov <[email protected]>

> Hi
>
> If it is a text file then you can simply associate the extension with text
> parser. But if I understand you right it's a lotus Db file then I suspect
> you have no other choice than implementing your own parser. I haven't heard
> of lotus files support in nutch.
>
> Best Regards
> Alexander Aristov
>
>
> 2011/7/3 丛云牙之主 <[email protected]>
>
> > Hello, I am using nutch-1.2 has encountered a problem.The site is
> > writtenwith lotus domino, I use the browser to enter, click on the
> emergence
> > of thoseconnections have not changed the site URL, unlike some sites have
> a
> > lot of suffixes.Then there is a web site is buptoa.bupt.edu.cn /
> > student_broad.nsf, I wanted to climbwill take. Nsf file. But nutch does
> not
> > support. Nsf file crawl, I should write my ownplugin or should solve this
> > problem from the other side?
> > Extremely grateful for your help
>



-- 
*Lewis*

Reply via email to