Hi

Firstly I would suspect character encoding issues. Turn on tracsing on web
server and check which senetence is searched.

Next thing is dedup. It can reduce number of results and it's turned on by
default. But of cause it should not reduce to 0.

Best Regards
Alexander Aristov


On 28 February 2011 05:53, Jason Shi <[email protected]> wrote:

> hi guys,I'm using nutch-1.0 for Chinese web search,I changed the
> NutchDocumentAnalyzer.java to use imdict-chinese-analyzer,which is
> dedicated
> to chinese word segmentation,after successfully crawled my computer
> department's website,and deployed nutch-1.0.war,I found that nutch web
> search returns much less results than command search.for example, this
> command "bin/nutch org.apache.nutch.searcher.NutchBean 计算机",returns
> 265hits,but the web search returns 0 result.
> any help would be greatly appreciated.
>

Reply via email to