Soren Scott created NUTCH-2044:
----------------------------------

             Summary: Support for an expanded HttpHeaders list
                 Key: NUTCH-2044
                 URL: https://issues.apache.org/jira/browse/NUTCH-2044
             Project: Nutch
          Issue Type: Improvement
          Components: metadata
            Reporter: Soren Scott
            Priority: Minor


Is there currently any consideration for either a) expanding the current 
HttpHeaders list from 
[HttpHeaders.java|https://github.com/apache/nutch/blob/trunk/src/java/org/apache/nutch/metadata/HttpHeaders.java]
 to include at least the current permanent or provisional headers or b) 
revising that handler to iterate some unknown KVP for the headers? Either as a 
configurable widget or something along those lines?

I am mostly interested in the Accept headers to help inform some additional 
actions on the fetched responses but even from an accurate assessment of the 
crawls, the full set of headers provided by a request is important. I know that 
we frown on non-standard keys but, again, imperfect world :).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to