https://issues.apache.org/bugzilla/show_bug.cgi?id=55641

--- Comment #10 from Mike Rumph <[email protected]> ---
Okay, I did a little research.

Believe it or not, the behavior you are seeing appears to be working as
intended.

See line 326 in modules/metadata/mod_mime_magic.c:

-
http://svn.apache.org/viewvc/httpd/httpd/branches/2.2.x/modules/metadata/mod_mime_magic.c?view=markup
 

322     {
323         "The", L_ENG
324     },
325     {
326         "the", L_ENG
327     },

When "The" or "the" appear as tokens in a file, that file is considered English
text and is marked as type "text/plain".
Your other samples "<b></b>(the )" or "<b></b>( the)" keep "the" from being
recognized as a token.

So "<b></b>( the )" is considered English text.

Even more interesting are the following cases:

English:
<h1></h1>( the )

HTML:
<h1> </h1>( the )

English:
<b> </b>( the )

The httpd trunk version of mod_mime_magic.c appears to have the same code.

Thanks,

Mike Rumph

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to