Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?
On 13-01-19, gg wrote: > W dniu 19.01.2013 o 13:25 Techlive Zheng pisze: > > >On 13-01-19, gg wrote: > >>W dniu 19.01.2013 o 12:22 Techlive Zheng > >> pisze: > >> > >>> Hey, I found that elinks now could display HTML file with Content-Type > >>> header set as "GBK" or "GB2312" properly, but how could I specify that > >>> to display part of a GBK encoded file with no Content-Type header set? > >>> > >>> The problem I have is that I intend to use elinks to dump HTML > >>email to > >>> text in mutt, and most of my emails are encoded in GBK or > >>GB2312 without > >>> the Content-Type header set, I can not find a way to tell elinks about > >>> the encoding of these HTML emails. > >>> > >>> I found a thread[1] that talked about displaying BIG5 encoded website, > >>> any update on that? > >>> > >>> [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920 > >> > >>Did you try to change default codepage in Option manager? > >>press o (Option Manager) -> Document -> Charset -> Default codepage -> > >>enter gb2312 > >>Save > >Thanks, I have just tried what you said, it turns out that elinks could > >recognize GB2312 correctly, but not GBK or GB18030. > > > >I have also tried the flowing to dump the text, also, GB2312 works, but > >not GBK or GB18030. > > > >elinks -dump -eval 'set document.codepage.assume = "gb2312"' > >mutt.html > > > >As GB2312 is support now, it should not be too hard to support GBK or > >GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of > >GB2312, both of them are commonly used as the content type of a HTML > >email message, it is better to support them both, so the following could > >be put into the .mailcap file. > > To add a new codepage is easy. > See how the GB2312 was added in src/intl/codepage.inc. > Remember to increase N_CODEPAGES. > Prepare the patch and send to the list. Just sent an patch to elinks-deve mailing list, please take a look. ___ elinks-users mailing list [email protected] http://linuxfromscratch.org/mailman/listinfo/elinks-users
Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?
On 13-01-19, gg wrote: > W dniu 19.01.2013 o 13:25 Techlive Zheng pisze: > > >On 13-01-19, gg wrote: > >>W dniu 19.01.2013 o 12:22 Techlive Zheng > >> pisze: > >> > >>> Hey, I found that elinks now could display HTML file with Content-Type > >>> header set as "GBK" or "GB2312" properly, but how could I specify that > >>> to display part of a GBK encoded file with no Content-Type header set? > >>> > >>> The problem I have is that I intend to use elinks to dump HTML > >>email to > >>> text in mutt, and most of my emails are encoded in GBK or > >>GB2312 without > >>> the Content-Type header set, I can not find a way to tell elinks about > >>> the encoding of these HTML emails. > >>> > >>> I found a thread[1] that talked about displaying BIG5 encoded website, > >>> any update on that? > >>> > >>> [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920 > >> > >>Did you try to change default codepage in Option manager? > >>press o (Option Manager) -> Document -> Charset -> Default codepage -> > >>enter gb2312 > >>Save > >Thanks, I have just tried what you said, it turns out that elinks could > >recognize GB2312 correctly, but not GBK or GB18030. > > > >I have also tried the flowing to dump the text, also, GB2312 works, but > >not GBK or GB18030. > > > >elinks -dump -eval 'set document.codepage.assume = "gb2312"' > >mutt.html > > > >As GB2312 is support now, it should not be too hard to support GBK or > >GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of > >GB2312, both of them are commonly used as the content type of a HTML > >email message, it is better to support them both, so the following could > >be put into the .mailcap file. > > To add a new codepage is easy. > See how the GB2312 was added in src/intl/codepage.inc. > Remember to increase N_CODEPAGES. > Prepare the patch and send to the list. Thanks a lot, I will give it a try. ___ elinks-users mailing list [email protected] http://linuxfromscratch.org/mailman/listinfo/elinks-users
Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?
W dniu 19.01.2013 o 13:25 Techlive Zheng pisze: > On 13-01-19, gg wrote: >> W dniu 19.01.2013 o 12:22 Techlive Zheng >> pisze: >> >> > Hey, I found that elinks now could display HTML file with Content-Type >> > header set as "GBK" or "GB2312" properly, but how could I specify that >> > to display part of a GBK encoded file with no Content-Type header set? >> > >> > The problem I have is that I intend to use elinks to dump HTML email >> to >> > text in mutt, and most of my emails are encoded in GBK or GB2312 >> without >> > the Content-Type header set, I can not find a way to tell elinks about >> > the encoding of these HTML emails. >> > >> > I found a thread[1] that talked about displaying BIG5 encoded website, >> > any update on that? >> > >> > [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920 >> >> Did you try to change default codepage in Option manager? >> press o (Option Manager) -> Document -> Charset -> Default codepage -> >> enter gb2312 >> Save > Thanks, I have just tried what you said, it turns out that elinks could > recognize GB2312 correctly, but not GBK or GB18030. > > I have also tried the flowing to dump the text, also, GB2312 works, but > not GBK or GB18030. > > elinks -dump -eval 'set document.codepage.assume = "gb2312"' > mutt.html > > As GB2312 is support now, it should not be too hard to support GBK or > GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of > GB2312, both of them are commonly used as the content type of a HTML > email message, it is better to support them both, so the following could > be put into the .mailcap file. To add a new codepage is easy. See how the GB2312 was added in src/intl/codepage.inc. Remember to increase N_CODEPAGES. Prepare the patch and send to the list. ___ elinks-users mailing list [email protected] http://linuxfromscratch.org/mailman/listinfo/elinks-users
Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?
On 13-01-19, gg wrote:
> W dniu 19.01.2013 o 12:22 Techlive Zheng pisze:
>
> > Hey, I found that elinks now could display HTML file with Content-Type
> > header set as "GBK" or "GB2312" properly, but how could I specify that
> > to display part of a GBK encoded file with no Content-Type header set?
> >
> > The problem I have is that I intend to use elinks to dump HTML email to
> > text in mutt, and most of my emails are encoded in GBK or GB2312 without
> > the Content-Type header set, I can not find a way to tell elinks about
> > the encoding of these HTML emails.
> >
> > I found a thread[1] that talked about displaying BIG5 encoded website,
> > any update on that?
> >
> > [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920
>
> Did you try to change default codepage in Option manager?
> press o (Option Manager) -> Document -> Charset -> Default codepage ->
> enter gb2312
> Save
Thanks, I have just tried what you said, it turns out that elinks could
recognize GB2312 correctly, but not GBK or GB18030.
I have also tried the flowing to dump the text, also, GB2312 works, but
not GBK or GB18030.
elinks -dump -eval 'set document.codepage.assume = "gb2312"' mutt.html
As GB2312 is support now, it should not be too hard to support GBK or
GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of
GB2312, both of them are commonly used as the content type of a HTML
email message, it is better to support them both, so the following could
be put into the .mailcap file.
text/html; elinks -dump -eval 'set document.codepage.assume = "%{charset}"'
%s; copiousoutput;
___
elinks-users mailing list
[email protected]
http://linuxfromscratch.org/mailman/listinfo/elinks-users
Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?
W dniu 19.01.2013 o 12:22 Techlive Zheng pisze: > Hey, I found that elinks now could display HTML file with Content-Type > header set as "GBK" or "GB2312" properly, but how could I specify that > to display part of a GBK encoded file with no Content-Type header set? > > The problem I have is that I intend to use elinks to dump HTML email to > text in mutt, and most of my emails are encoded in GBK or GB2312 without > the Content-Type header set, I can not find a way to tell elinks about > the encoding of these HTML emails. > > I found a thread[1] that talked about displaying BIG5 encoded website, > any update on that? > > [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920 Did you try to change default codepage in Option manager? press o (Option Manager) -> Document -> Charset -> Default codepage -> enter gb2312 Save ___ elinks-users mailing list [email protected] http://linuxfromscratch.org/mailman/listinfo/elinks-users
