because?
you mean urls which contain a query part?

they can be crawled.
the default nutch configuration excludes them by this filter rule in
conf/crawl-urlfilter.txt

# skip URLs containing certain characters as probable queries, etc.
-[...@=]


Zaihan schrieb:
> Hi All,
>
> I'm sure I've read somewhere before that URLs that is made like
> http://www.site.com/categories.asp?cid=25&page=9 
>
> Can't be crawled. Is that true?
>
> Warmest Regards,
> Zaihan
>
>
>
>
>   

Reply via email to