Please forgive this going to both lists but I'm not sure where things
are going wrong...
I have many website around the world that I need to index. They're
straight HTML pages rather than perl-served and thus the headers say the
content-type is 'text/html' .. without mentioning the encoding.
The
Hello,
Rick Measham wrote:
> My thought process is to convert them into utf8 and store that in the
> database. Then it's just a case of retrieving them later and outputting
> them all on one page marked as utf8.
>
> That being the case, I grab the charset and use Encode's decode function
> to tur