On Wed, 28 Mar 2018, Allison, Timothy B. wrote:
With the new mime patterns, we've gotten quite a few changes of
message/news being identified as message/rfc822. An example is:
http://162.242.228.174/docs/commoncrawl2/DA/DALFSFPD6FX4GGZ6EEJQA6RABA7OXIF5<http://162.242.228.174/docs/commoncrawl2/VG/VGXYD2ISNSDJAVMK6CK7DHB3KI6ZHB6L>
That looks like a regression to me, it's really news
We should correct this, right? Any recommendations?
I think it's the Message-ID header it's matching on. I'd suggest we bump
the news magics up from 50 (same as rfc822) to 60, so the news ones take
preference
Nick