Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?
On 13-01-19, gg wrote: > W dniu 19.01.2013 o 13:25 Techlive Zheng pisze: > > >On 13-01-19, gg wrote: > >>W dniu 19.01.2013 o 12:22 Techlive Zheng > >> pisze: > >> > >>> Hey, I found that elinks now could display HTML file with Content-Type > >>> header set as "GBK" or "GB2312" properly, but how could I specify that > >>> to display part of a GBK encoded file with no Content-Type header set? > >>> > >>> The problem I have is that I intend to use elinks to dump HTML > >>email to > >>> text in mutt, and most of my emails are encoded in GBK or > >>GB2312 without > >>> the Content-Type header set, I can not find a way to tell elinks about > >>> the encoding of these HTML emails. > >>> > >>> I found a thread[1] that talked about displaying BIG5 encoded website, > >>> any update on that? > >>> > >>> [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920 > >> > >>Did you try to change default codepage in Option manager? > >>press o (Option Manager) -> Document -> Charset -> Default codepage -> > >>enter gb2312 > >>Save > >Thanks, I have just tried what you said, it turns out that elinks could > >recognize GB2312 correctly, but not GBK or GB18030. > > > >I have also tried the flowing to dump the text, also, GB2312 works, but > >not GBK or GB18030. > > > >elinks -dump -eval 'set document.codepage.assume = "gb2312"' > >mutt.html > > > >As GB2312 is support now, it should not be too hard to support GBK or > >GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of > >GB2312, both of them are commonly used as the content type of a HTML > >email message, it is better to support them both, so the following could > >be put into the .mailcap file. > > To add a new codepage is easy. > See how the GB2312 was added in src/intl/codepage.inc. > Remember to increase N_CODEPAGES. > Prepare the patch and send to the list. Just sent an patch to elinks-deve mailing list, please take a look. ___ elinks-users mailing list elinks-users@linuxfromscratch.org http://linuxfromscratch.org/mailman/listinfo/elinks-users
Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?
On 13-01-19, gg wrote: > W dniu 19.01.2013 o 13:25 Techlive Zheng pisze: > > >On 13-01-19, gg wrote: > >>W dniu 19.01.2013 o 12:22 Techlive Zheng > >> pisze: > >> > >>> Hey, I found that elinks now could display HTML file with Content-Type > >>> header set as "GBK" or "GB2312" properly, but how could I specify that > >>> to display part of a GBK encoded file with no Content-Type header set? > >>> > >>> The problem I have is that I intend to use elinks to dump HTML > >>email to > >>> text in mutt, and most of my emails are encoded in GBK or > >>GB2312 without > >>> the Content-Type header set, I can not find a way to tell elinks about > >>> the encoding of these HTML emails. > >>> > >>> I found a thread[1] that talked about displaying BIG5 encoded website, > >>> any update on that? > >>> > >>> [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920 > >> > >>Did you try to change default codepage in Option manager? > >>press o (Option Manager) -> Document -> Charset -> Default codepage -> > >>enter gb2312 > >>Save > >Thanks, I have just tried what you said, it turns out that elinks could > >recognize GB2312 correctly, but not GBK or GB18030. > > > >I have also tried the flowing to dump the text, also, GB2312 works, but > >not GBK or GB18030. > > > >elinks -dump -eval 'set document.codepage.assume = "gb2312"' > >mutt.html > > > >As GB2312 is support now, it should not be too hard to support GBK or > >GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of > >GB2312, both of them are commonly used as the content type of a HTML > >email message, it is better to support them both, so the following could > >be put into the .mailcap file. > > To add a new codepage is easy. > See how the GB2312 was added in src/intl/codepage.inc. > Remember to increase N_CODEPAGES. > Prepare the patch and send to the list. Thanks a lot, I will give it a try. ___ elinks-users mailing list elinks-users@linuxfromscratch.org http://linuxfromscratch.org/mailman/listinfo/elinks-users
Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?
W dniu 19.01.2013 o 13:25 Techlive Zheng pisze: > On 13-01-19, gg wrote: >> W dniu 19.01.2013 o 12:22 Techlive Zheng >> pisze: >> >> > Hey, I found that elinks now could display HTML file with Content-Type >> > header set as "GBK" or "GB2312" properly, but how could I specify that >> > to display part of a GBK encoded file with no Content-Type header set? >> > >> > The problem I have is that I intend to use elinks to dump HTML email >> to >> > text in mutt, and most of my emails are encoded in GBK or GB2312 >> without >> > the Content-Type header set, I can not find a way to tell elinks about >> > the encoding of these HTML emails. >> > >> > I found a thread[1] that talked about displaying BIG5 encoded website, >> > any update on that? >> > >> > [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920 >> >> Did you try to change default codepage in Option manager? >> press o (Option Manager) -> Document -> Charset -> Default codepage -> >> enter gb2312 >> Save > Thanks, I have just tried what you said, it turns out that elinks could > recognize GB2312 correctly, but not GBK or GB18030. > > I have also tried the flowing to dump the text, also, GB2312 works, but > not GBK or GB18030. > > elinks -dump -eval 'set document.codepage.assume = "gb2312"' > mutt.html > > As GB2312 is support now, it should not be too hard to support GBK or > GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of > GB2312, both of them are commonly used as the content type of a HTML > email message, it is better to support them both, so the following could > be put into the .mailcap file. To add a new codepage is easy. See how the GB2312 was added in src/intl/codepage.inc. Remember to increase N_CODEPAGES. Prepare the patch and send to the list. ___ elinks-users mailing list elinks-users@linuxfromscratch.org http://linuxfromscratch.org/mailman/listinfo/elinks-users
Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?
On 13-01-19, gg wrote: > W dniu 19.01.2013 o 12:22 Techlive Zheng pisze: > > > Hey, I found that elinks now could display HTML file with Content-Type > > header set as "GBK" or "GB2312" properly, but how could I specify that > > to display part of a GBK encoded file with no Content-Type header set? > > > > The problem I have is that I intend to use elinks to dump HTML email to > > text in mutt, and most of my emails are encoded in GBK or GB2312 without > > the Content-Type header set, I can not find a way to tell elinks about > > the encoding of these HTML emails. > > > > I found a thread[1] that talked about displaying BIG5 encoded website, > > any update on that? > > > > [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920 > > Did you try to change default codepage in Option manager? > press o (Option Manager) -> Document -> Charset -> Default codepage -> > enter gb2312 > Save Thanks, I have just tried what you said, it turns out that elinks could recognize GB2312 correctly, but not GBK or GB18030. I have also tried the flowing to dump the text, also, GB2312 works, but not GBK or GB18030. elinks -dump -eval 'set document.codepage.assume = "gb2312"' mutt.html As GB2312 is support now, it should not be too hard to support GBK or GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of GB2312, both of them are commonly used as the content type of a HTML email message, it is better to support them both, so the following could be put into the .mailcap file. text/html; elinks -dump -eval 'set document.codepage.assume = "%{charset}"' %s; copiousoutput; ___ elinks-users mailing list elinks-users@linuxfromscratch.org http://linuxfromscratch.org/mailman/listinfo/elinks-users
Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?
W dniu 19.01.2013 o 12:22 Techlive Zheng pisze: > Hey, I found that elinks now could display HTML file with Content-Type > header set as "GBK" or "GB2312" properly, but how could I specify that > to display part of a GBK encoded file with no Content-Type header set? > > The problem I have is that I intend to use elinks to dump HTML email to > text in mutt, and most of my emails are encoded in GBK or GB2312 without > the Content-Type header set, I can not find a way to tell elinks about > the encoding of these HTML emails. > > I found a thread[1] that talked about displaying BIG5 encoded website, > any update on that? > > [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920 Did you try to change default codepage in Option manager? press o (Option Manager) -> Document -> Charset -> Default codepage -> enter gb2312 Save ___ elinks-users mailing list elinks-users@linuxfromscratch.org http://linuxfromscratch.org/mailman/listinfo/elinks-users
[elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?
Hey, I found that elinks now could display HTML file with Content-Type header set as "GBK" or "GB2312" properly, but how could I specify that to display part of a GBK encoded file with no Content-Type header set? The problem I have is that I intend to use elinks to dump HTML email to text in mutt, and most of my emails are encoded in GBK or GB2312 without the Content-Type header set, I can not find a way to tell elinks about the encoding of these HTML emails. I found a thread[1] that talked about displaying BIG5 encoded website, any update on that? [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920 ___ elinks-users mailing list elinks-users@linuxfromscratch.org http://linuxfromscratch.org/mailman/listinfo/elinks-users