Hi, I have recently been using Nutch 0.9 and is currently facing some problems. I know that Nutch will remove older segments, not deduplicating but removing segments that are old(with old information). Can anyone tell me where this part is located?
Also, it appear that Nutch 0.9 fetching is much faster..but can't seems to understand why is it so fast compare to the older version. Can anyone advise me on this? Thanks a lot -- View this message in context: http://www.nabble.com/Nutch-Removing-Segments-tp20696366p20696366.html Sent from the Nutch - User mailing list archive at Nabble.com.
