[
https://issues.apache.org/jira/browse/NUTCH-1747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13961211#comment-13961211
]
Sebastian Nagel commented on NUTCH-1747:
----------------------------------------
+1
Looks like inProgress was intended to hold more than the bare count of
FetchItems in progress. In doubt, we can get the in-progress FetchItems and
their associated queue from FetcherThreads (cf. NUTCH-1182).
> Use AtomicInteger as semaphore in Fetcher
> -----------------------------------------
>
> Key: NUTCH-1747
> URL: https://issues.apache.org/jira/browse/NUTCH-1747
> Project: Nutch
> Issue Type: Improvement
> Components: fetcher
> Affects Versions: 1.8
> Reporter: Julien Nioche
> Priority: Minor
> Attachments: NUTCH-1747-trunk.patch
>
>
> In Fetcher we currently use
> Set<FetchItem> inProgress = Collections.synchronizedSet(new
> HashSet<FetchItem>());
> as semaphore within the FetchItemQueues to keep track of the URLs being
> fetched and prevent threads from pulling from them. It works fine but we
> could use AtomicIntegers instead as all we need is the counts, not the
> contents.
> This change would have little impact on the performance but would make the
> code a bit cleaner.
--
This message was sent by Atlassian JIRA
(v6.2#6252)