Soren Scott created NUTCH-2044:
----------------------------------
Summary: Support for an expanded HttpHeaders list
Key: NUTCH-2044
URL: https://issues.apache.org/jira/browse/NUTCH-2044
Project: Nutch
Issue Type: Improvement
Components: metadata
Reporter: Soren Scott
Priority: Minor
Is there currently any consideration for either a) expanding the current
HttpHeaders list from
[HttpHeaders.java|https://github.com/apache/nutch/blob/trunk/src/java/org/apache/nutch/metadata/HttpHeaders.java]
to include at least the current permanent or provisional headers or b)
revising that handler to iterate some unknown KVP for the headers? Either as a
configurable widget or something along those lines?
I am mostly interested in the Accept headers to help inform some additional
actions on the fetched responses but even from an accurate assessment of the
crawls, the full set of headers provided by a request is important. I know that
we frown on non-standard keys but, again, imperfect world :).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)