Michael, Did you remember to specify the -filter option when you ran nutch/mergesegs ?
-filter filter out URL-s prohibited by current URLFilters Feng Ji wrote: > > Hi there, > > I want to filter out particular ursl from search result. > > And I try to use segement merger to do it; > > Firstly, I put target urls in regex-urlfiter.txt and > automaton-urlfiter.txt, > as "-http://abc.com/". > > then, run "nutch/mergesegs" and "nutch/index", but the search page still > show the urls I want to filter out. > > Any idea and which step I missed? > > thanks, > > Michael, > > -- View this message in context: http://www.nabble.com/filter-urls-from-search-result-tf2224405.html#a6169863 Sent from the Nutch - User forum at Nabble.com. ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
