Hello , I am working on nutch 1.2 to crawl a site . Now few urls are like
www.example/(sndjnc22e3r3r))/abc.com. I want to strip out this part inside
brackets to normalize my urls . For this I wrote a regex in my regex
normalizer and substituted it . Now I am crawling again but still not able
to get proper results.

Please guide me in solving this issue

Reply via email to