Marco Quaranta created TIKA-1206:
------------------------------------

             Summary: rfc822 standard headers
                 Key: TIKA-1206
                 URL: https://issues.apache.org/jira/browse/TIKA-1206
             Project: Tika
          Issue Type: Improvement
          Components: detector, mime
    Affects Versions: 1.4
            Reporter: Marco Quaranta
            Priority: Minor


Please add the following headers to message/rfc822 type:
{noformat}
<match value="Subject:" type="stringignorecase" offset="0" />   
<match value="Content-Type:" type="stringignorecase" offset="0">
  <match value="MIME-Version:" type="stringignorecase" offset="0:500"/>
</match>
{noformat}
The former is defined in [RFC822|http://www.ietf.org/rfc/rfc0822.txt], the 
latter is defined in [RFC2045|http://tools.ietf.org/html/rfc2045#page-10]. I've 
added a further check on the Content-type header just to be sure it's a MIME.
In RFC822 (part 4.1) are also defined other headers like: Resent-From, 
References, Keyword etc.. 

Please also consider to add the following Mozilla custom-headers:

{noformat}
<match value="X-Account-Key:" type="string" offset="0">
  <match value="X-Mozilla-Keys:" type="string" offset="0:500" />
</match>
<match value="X-Mozilla-Status:" type="string" offset="0">
  <match value="X-Mozilla-Keys:" type="string" offset="0:500" />
</match>
{noformat}

Thanks



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

Reply via email to