Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?

2013-01-19 Thread Techlive Zheng
On 13-01-19, gg wrote:
> W dniu 19.01.2013 o 13:25 Techlive Zheng  pisze:
> 
> >On 13-01-19, gg wrote:
> >>W dniu 19.01.2013 o 12:22 Techlive Zheng
> >> pisze:
> >>
> >>> Hey, I found that elinks now could display HTML file with Content-Type
> >>> header set as "GBK" or "GB2312" properly, but how could I specify that
> >>> to display part of a GBK encoded file with no Content-Type header set?
> >>>
> >>> The problem I have is that I intend to use elinks to dump HTML
> >>email to
> >>> text in mutt, and most of my emails are encoded in GBK or
> >>GB2312 without
> >>> the Content-Type header set, I can not find a way to tell elinks about
> >>> the encoding of these HTML emails.
> >>>
> >>> I found a thread[1] that talked about displaying BIG5 encoded website,
> >>> any update on that?
> >>>
> >>> [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920
> >>
> >>Did you try to change default codepage in Option manager?
> >>press o (Option Manager) -> Document -> Charset -> Default codepage ->
> >>enter gb2312
> >>Save
> >Thanks, I have just tried what you said, it turns out that elinks could
> >recognize GB2312 correctly, but not GBK or GB18030.
> >
> >I have also tried the flowing to dump the text, also, GB2312 works, but
> >not GBK or GB18030.
> >
> >elinks -dump -eval 'set document.codepage.assume = "gb2312"'
> >mutt.html
> >
> >As GB2312 is support now, it should not be too hard to support GBK or
> >GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of
> >GB2312, both of them are commonly used as the content type of a HTML
> >email message, it is better to support them both, so the following could
> >be put into the .mailcap file.
> 
> To add a new codepage is easy.
> See how the GB2312 was added in src/intl/codepage.inc.
> Remember to increase N_CODEPAGES.
> Prepare the patch and send to the list.

Just sent an patch to elinks-deve mailing list, please take a look.
___
elinks-users mailing list
elinks-users@linuxfromscratch.org
http://linuxfromscratch.org/mailman/listinfo/elinks-users


Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?

2013-01-19 Thread Techlive Zheng
On 13-01-19, gg wrote:
> W dniu 19.01.2013 o 13:25 Techlive Zheng  pisze:
> 
> >On 13-01-19, gg wrote:
> >>W dniu 19.01.2013 o 12:22 Techlive Zheng
> >> pisze:
> >>
> >>> Hey, I found that elinks now could display HTML file with Content-Type
> >>> header set as "GBK" or "GB2312" properly, but how could I specify that
> >>> to display part of a GBK encoded file with no Content-Type header set?
> >>>
> >>> The problem I have is that I intend to use elinks to dump HTML
> >>email to
> >>> text in mutt, and most of my emails are encoded in GBK or
> >>GB2312 without
> >>> the Content-Type header set, I can not find a way to tell elinks about
> >>> the encoding of these HTML emails.
> >>>
> >>> I found a thread[1] that talked about displaying BIG5 encoded website,
> >>> any update on that?
> >>>
> >>> [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920
> >>
> >>Did you try to change default codepage in Option manager?
> >>press o (Option Manager) -> Document -> Charset -> Default codepage ->
> >>enter gb2312
> >>Save
> >Thanks, I have just tried what you said, it turns out that elinks could
> >recognize GB2312 correctly, but not GBK or GB18030.
> >
> >I have also tried the flowing to dump the text, also, GB2312 works, but
> >not GBK or GB18030.
> >
> >elinks -dump -eval 'set document.codepage.assume = "gb2312"'
> >mutt.html
> >
> >As GB2312 is support now, it should not be too hard to support GBK or
> >GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of
> >GB2312, both of them are commonly used as the content type of a HTML
> >email message, it is better to support them both, so the following could
> >be put into the .mailcap file.
> 
> To add a new codepage is easy.
> See how the GB2312 was added in src/intl/codepage.inc.
> Remember to increase N_CODEPAGES.
> Prepare the patch and send to the list.

Thanks a lot, I will give it a try.
___
elinks-users mailing list
elinks-users@linuxfromscratch.org
http://linuxfromscratch.org/mailman/listinfo/elinks-users


Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?

2013-01-19 Thread gg
W dniu 19.01.2013 o 13:25 Techlive Zheng  pisze:

> On 13-01-19, gg wrote:
>> W dniu 19.01.2013 o 12:22 Techlive Zheng   
>> pisze:
>>
>> > Hey, I found that elinks now could display HTML file with Content-Type
>> > header set as "GBK" or "GB2312" properly, but how could I specify that
>> > to display part of a GBK encoded file with no Content-Type header set?
>> >
>> > The problem I have is that I intend to use elinks to dump HTML email  
>> to
>> > text in mutt, and most of my emails are encoded in GBK or GB2312  
>> without
>> > the Content-Type header set, I can not find a way to tell elinks about
>> > the encoding of these HTML emails.
>> >
>> > I found a thread[1] that talked about displaying BIG5 encoded website,
>> > any update on that?
>> >
>> > [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920
>>
>> Did you try to change default codepage in Option manager?
>> press o (Option Manager) -> Document -> Charset -> Default codepage ->
>> enter gb2312
>> Save
> Thanks, I have just tried what you said, it turns out that elinks could
> recognize GB2312 correctly, but not GBK or GB18030.
>
> I have also tried the flowing to dump the text, also, GB2312 works, but
> not GBK or GB18030.
>
> elinks -dump -eval 'set document.codepage.assume = "gb2312"'  
> mutt.html
>
> As GB2312 is support now, it should not be too hard to support GBK or
> GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of
> GB2312, both of them are commonly used as the content type of a HTML
> email message, it is better to support them both, so the following could
> be put into the .mailcap file.

To add a new codepage is easy.
See how the GB2312 was added in src/intl/codepage.inc.
Remember to increase N_CODEPAGES.
Prepare the patch and send to the list.
___
elinks-users mailing list
elinks-users@linuxfromscratch.org
http://linuxfromscratch.org/mailman/listinfo/elinks-users


Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?

2013-01-19 Thread Techlive Zheng
On 13-01-19, gg wrote:
> W dniu 19.01.2013 o 12:22 Techlive Zheng  pisze:
> 
> > Hey, I found that elinks now could display HTML file with Content-Type
> > header set as "GBK" or "GB2312" properly, but how could I specify that
> > to display part of a GBK encoded file with no Content-Type header set?
> >
> > The problem I have is that I intend to use elinks to dump HTML email to
> > text in mutt, and most of my emails are encoded in GBK or GB2312 without
> > the Content-Type header set, I can not find a way to tell elinks about
> > the encoding of these HTML emails.
> >
> > I found a thread[1] that talked about displaying BIG5 encoded website,
> > any update on that?
> >
> > [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920
> 
> Did you try to change default codepage in Option manager?
> press o (Option Manager) -> Document -> Charset -> Default codepage ->  
> enter gb2312
> Save
Thanks, I have just tried what you said, it turns out that elinks could
recognize GB2312 correctly, but not GBK or GB18030.

I have also tried the flowing to dump the text, also, GB2312 works, but
not GBK or GB18030.

elinks -dump -eval 'set document.codepage.assume = "gb2312"' mutt.html

As GB2312 is support now, it should not be too hard to support GBK or
GB18030. GBK is a sub-set of GB2312, and GB18030 is a supper-set of
GB2312, both of them are commonly used as the content type of a HTML
email message, it is better to support them both, so the following could
be put into the .mailcap file.

text/html; elinks -dump -eval 'set document.codepage.assume = "%{charset}"' 
%s; copiousoutput;
___
elinks-users mailing list
elinks-users@linuxfromscratch.org
http://linuxfromscratch.org/mailman/listinfo/elinks-users


Re: [elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?

2013-01-19 Thread gg
W dniu 19.01.2013 o 12:22 Techlive Zheng  pisze:

> Hey, I found that elinks now could display HTML file with Content-Type
> header set as "GBK" or "GB2312" properly, but how could I specify that
> to display part of a GBK encoded file with no Content-Type header set?
>
> The problem I have is that I intend to use elinks to dump HTML email to
> text in mutt, and most of my emails are encoded in GBK or GB2312 without
> the Content-Type header set, I can not find a way to tell elinks about
> the encoding of these HTML emails.
>
> I found a thread[1] that talked about displaying BIG5 encoded website,
> any update on that?
>
> [1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920

Did you try to change default codepage in Option manager?
press o (Option Manager) -> Document -> Charset -> Default codepage ->  
enter gb2312
Save
___
elinks-users mailing list
elinks-users@linuxfromscratch.org
http://linuxfromscratch.org/mailman/listinfo/elinks-users


[elinks-users] How to specify the codepage for a portion of GBK encoded HTML file?

2013-01-19 Thread Techlive Zheng
Hey, I found that elinks now could display HTML file with Content-Type
header set as "GBK" or "GB2312" properly, but how could I specify that
to display part of a GBK encoded file with no Content-Type header set?

The problem I have is that I intend to use elinks to dump HTML email to
text in mutt, and most of my emails are encoded in GBK or GB2312 without
the Content-Type header set, I can not find a way to tell elinks about
the encoding of these HTML emails.

I found a thread[1] that talked about displaying BIG5 encoded website,
any update on that?

[1]: http://thread.gmane.org/gmane.comp.web.elinks.user/1920
___
elinks-users mailing list
elinks-users@linuxfromscratch.org
http://linuxfromscratch.org/mailman/listinfo/elinks-users