On Jul 1, 8:42 pm, Jim <[EMAIL PROTECTED]> wrote:
> On Jul 1, 8:29 pm, John Machin <[EMAIL PROTECTED]> wrote:
> > Comments on the above grep output:
> > 1. You have SOFT HYPHEN twice, mapping it to u'-' and '-'
>
> Hmph. I'll correct that. Thanks.
Well, maybe not. I forgot that I got the by-hand
Thank you.
That is exactly what I was looking for.
2008/7/2 Jim <[EMAIL PROTECTED]>:
> Peter Bulychev wrote:
> > I want to convert unicode character into ascii one.
> You have to make some arbitrary choices of what to translate. Based
> on some materials on effbot's site, and a recipe, I made
On 2008-07-01 20:31, Peter Bulychev wrote:
Hello.
I want to convert unicode character into ascii one.
The method ".encode('ASCII') " can convert only those unicode characters,
which fit into 0..128 range.
But there are still lots of characters beyond this range, which can be
manually converted
Jim <[EMAIL PROTECTED]> writes:
> I don't like the spacing in [PEP 8], personally.
Nevertheless, your Python code will be much less effort to read by
others (and yourself in future) if it is written in conformance with
PEP 8.
Writing all your Python code to conform with that standard is the
simp
On Jul 1, 8:29 pm, John Machin <[EMAIL PROTECTED]> wrote:
> On Jul 2, 9:55 am, Jim <[EMAIL PROTECTED]> wrote:
>
> Comments on the above grep output:
> 1. You have SOFT HYPHEN twice, mapping it to u'-' and '-'
Hmph. I'll correct that. Thanks.
> 2. The idea of a soft hyphen is as a hint to a hyphena
On Jul 1, 8:29 pm, John Machin <[EMAIL PROTECTED]> wrote:
> On Jul 2, 9:55 am, Jim <[EMAIL PROTECTED]> wrote:
>
> Comments on the above grep output:
> 1. You have SOFT HYPHEN twice, mapping it to u'-' and '-'
Hmph. I'll correct that. Thanks.
> 2. The idea of a soft hyphen is as a hint to a hyphena
On Jul 2, 9:55 am, Jim <[EMAIL PROTECTED]> wrote:
> Peter Bulychev wrote:
> > I want to convert unicode character into ascii one.
>
> You have to make some arbitrary choices of what to translate. Based
> on some materials on effbot's site, and a recipe, I made
> ftp://alan.smcvt.edu/hefferon/uni
Peter Bulychev wrote:
> I want to convert unicode character into ascii one.
You have to make some arbitrary choices of what to translate. Based
on some materials on effbot's site, and a recipe, I made
ftp://alan.smcvt.edu/hefferon/unicode2ascii.py
which has at least some of what you are looking
Peter Bulychev wrote:
> I want to convert unicode character into ascii one.
You have to make some arbitrary choices of what to translate. Based
on some materials on effbot's site, and a recipe, I made
ftp://alan.smcvt.edu/hefferon/unicode2ascii.py
which has at least some of what you are looking
Peter Bulychev wrote:
Hello.
I want to convert unicode character into ascii one.
The method ".encode('ASCII') " can convert only those unicode
characters, which fit into 0..128 range.
But there are still lots of characters beyond this range, which can be
manually converted to some visibly
Peter Bulychev wrote:
Thank you for you answer.
If you only want this to work for a subset, please define that subset.
Actually, I want to convert only punctuations (dots, commas, hyphens
and so on).
Then make your translation table manually and apply this method:
unicode.translate
Fina
Thank you for you answer.
If you only want this to work for a subset, please define that subset.
Actually, I want to convert only punctuations (dots, commas, hyphens and so
on).
--
Best regards,
Peter Bulychev.
--
http://mail.python.org/mailman/listinfo/python-list
Peter Bulychev wrote:
Hello.
I want to convert unicode character into ascii one.
The method ".encode('ASCII') " can convert only those unicode
characters, which fit into 0..128 range.
But there are still lots of characters beyond this range, which can be
manually converted to some visibly si
Hello.
I want to convert unicode character into ascii one.
The method ".encode('ASCII') " can convert only those unicode characters,
which fit into 0..128 range.
But there are still lots of characters beyond this range, which can be
manually converted to some visibly similar ascii characters. For
14 matches
Mail list logo