to add to this...

please try Solr for search  funtionality. Solr.war

Thank you Lewis

________________________________________
From: Alexander Aristov [[email protected]]
Sent: 28 February 2011 09:20
To: [email protected]
Cc: Jason Shi; [email protected]
Subject: Re: web search returns less results than command search

Hi

Firstly I would suspect character encoding issues. Turn on tracsing on web
server and check which senetence is searched.

Next thing is dedup. It can reduce number of results and it's turned on by
default. But of cause it should not reduce to 0.

Best Regards
Alexander Aristov


On 28 February 2011 05:53, Jason Shi <[email protected]> wrote:

> hi guys,I'm using nutch-1.0 for Chinese web search,I changed the
> NutchDocumentAnalyzer.java to use imdict-chinese-analyzer,which is
> dedicated
> to chinese word segmentation,after successfully crawled my computer
> department's website,and deployed nutch-1.0.war,I found that nutch web
> search returns much less results than command search.for example, this
> command "bin/nutch org.apache.nutch.searcher.NutchBean 计算机",returns
> 265hits,but the web search returns 0 result.
> any help would be greatly appreciated.
>
Email has been scanned for viruses by Altman Technologies' email management 
service - www.altman.co.uk/emailsystems

Glasgow Caledonian University is a registered Scottish charity, number SC021474

Winner: Times Higher Education’s Widening Participation Initiative of the Year 
2009 and Herald Society’s Education Initiative of the Year 2009.
http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html

Winner: Times Higher Education’s Outstanding Support for Early Career 
Researchers of the Year 2010, GCU as a lead with Universities Scotland partners.
http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,15691,en.html

Reply via email to