Michael, 

Did you remember to specify the -filter option when you ran nutch/mergesegs
?

-filter         filter out URL-s prohibited by current URLFilters



Feng Ji wrote:
> 
> Hi there,
> 
> I want to filter out particular ursl from search result.
> 
> And I try to use segement merger to do it;
> 
> Firstly, I put target urls in regex-urlfiter.txt and
> automaton-urlfiter.txt,
> as "-http://abc.com/";.
> 
> then, run "nutch/mergesegs" and "nutch/index", but the search page still
> show the urls I want to filter out.
> 
> Any idea and which step I missed?
> 
> thanks,
> 
> Michael,
> 
> 

-- 
View this message in context: 
http://www.nabble.com/filter-urls-from-search-result-tf2224405.html#a6169863
Sent from the Nutch - User forum at Nabble.com.


-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to