Hi:

i guess the urls you mentioned are all directed to the same jsp or servlet,
apparently they all begin with
http://app02.laopdr.gov.la/ePortal/news/detail.action<http://app02.laopdr.gov.la/ePortal/news/detail.action?id=10110&from=ePortal_NewsDetail_FromHome>.
the difference is the request_locale parameter. I have no idea how these two
urls with different request_locale parameters are generated, but I guess
nutch just don't know this request_locale parameters because this parameter
may be added by javascript or backend content management system. Maybe u can
write these links in a page that can be crawled by nutch. The point is that
these links must can be found somewhere in your whole website pages. if not,
they can not be found by nutch.

good luck

yanky



2009/3/19 陈琛 <kylin.chc...@gmail.com>

> please help me, it is Urgent and Important, thanks
>
> ---------- Forwarded message ----------
> From: 陈琛 <kylin.chc...@gmail.com>
> Date: 2009/3/19
> Subject: index web
> To: nutch-user@lucene.apache.org
>
>
> hi, all:
>
> i can get index url like
>
> http://app02.laopdr.gov.la/ePortal/news/detail.action?id=10110&from=ePortal_NewsDetail_FromHome
>
> but  cannot get index like
>
> http://app02.laopdr.gov.la/ePortal/news/detail.action?request_locale=en_US&id=10110&from=ePortal_NewsDetail_FromHome
> &<http://app02.laopdr.gov.la/ePortal/news/detail.action?request_locale=en_US&id=10110&from=ePortal_NewsDetail_FromHome%0A&;>
> and
>
> http://app02.laopdr.gov.la/ePortal/news/detail.action?request_locale=lo_LA&id=10110&from=ePortal_NewsDetail_FromHome
> &<http://app02.laopdr.gov.la/ePortal/news/detail.action?request_locale=lo_LA&id=10110&from=ePortal_NewsDetail_FromHome%0A&;>
>
>
> why not index ?
> the web have any different?
>
> please notice "request_locale="
>
>
> thanks
>

Reply via email to