Re: New tool unidump

2017-03-20 Thread Andrey Lukyanov

apropos unidump:

It would be nice to add the option of printing not only numbers, but also
character names and other info from the NamesList.txt file.

I am using a homemade program at my computer:

$ typecode fc25 
66 63 32 35 

$ typecode -l fc25 
0066LATIN SMALL LETTER F
0063LATIN SMALL LETTER C
0032DIGIT TWO
0035DIGIT FIVE

$ typecode -f fc25 
0066LATIN SMALL LETTER F
0063LATIN SMALL LETTER C
0032DIGIT TWO
~ 0032 FE0E text style
~ 0032 FE0F emoji style
0035DIGIT FIVE
~ 0035 FE0E text style
~ 0035 FE0F emoji style



Re: New tool unidump

2017-03-19 Thread Rebecca T
I maintain a list of various Unicode tools and resources at
unicode.yea.rs and always welcome new additions!

On Sat, Mar 18, 2017 at 1:42 AM, Janusz S. Bien  wrote:

> Quote/Cytat - Manuel Strehl  (Fri 17 Mar 2017
> 09:44:15 PM CET):
>
> Hi,
>>
>> for my work on codepoints.net and Emojipedia I found myself repeatedly
>> in a place, where I needed some tool like hexdump to inspect the content
>> of a string. However, instead of raw bytes I am more interested in the
>> code points that the string is composed of. So I wrote this tool.
>>
>
> Is somebody maintaining a list of such utilities?
>
> There is a page
>
> http://www.unicode.org/resources/online-tools.html
>
> but I remember that earlier a page on the site used to be links to the
> programs mentioned in 2012 "Tool to convert characters to character names",
> in particular to Bill Poser's uniutils (http://billposer.org/Software
> /unidesc.html) and the orphaned unihist by a student of mine (
> https://bitbucket.org/jsbien/unihistext). I'm unable to find them now.
>
> Best regards
>
> Janusz
>
> --
> Prof. dr hab. Janusz S. Bień -  Uniwersytet Warszawski (Katedra
> Lingwistyki Formalnej)
> Prof. Janusz S. Bień - University of Warsaw (Formal Linguistics Department)
> jsb...@uw.edu.pl, jsb...@mimuw.edu.pl, http://fleksem.klf.uw.edu.pl/~
> jsbien/
>
>


Re: New tool unidump

2017-03-17 Thread Janusz S. Bien
Quote/Cytat - Manuel Strehl  (Fri 17 Mar 2017  
09:44:15 PM CET):



Hi,

for my work on codepoints.net and Emojipedia I found myself repeatedly
in a place, where I needed some tool like hexdump to inspect the content
of a string. However, instead of raw bytes I am more interested in the
code points that the string is composed of. So I wrote this tool.


Is somebody maintaining a list of such utilities?

There is a page

http://www.unicode.org/resources/online-tools.html

but I remember that earlier a page on the site used to be links to the  
programs mentioned in 2012 "Tool to convert characters to character  
names", in particular to Bill Poser's uniutils  
(http://billposer.org/Software/unidesc.html) and the orphaned unihist  
by a student of mine (https://bitbucket.org/jsbien/unihistext). I'm  
unable to find them now.


Best regards

Janusz

--
Prof. dr hab. Janusz S. Bień -  Uniwersytet Warszawski (Katedra  
Lingwistyki Formalnej)

Prof. Janusz S. Bień - University of Warsaw (Formal Linguistics Department)
jsb...@uw.edu.pl, jsb...@mimuw.edu.pl, http://fleksem.klf.uw.edu.pl/~jsbien/



Re: New tool unidump

2017-03-17 Thread Manish Goregaokar
https://r12a.github.io/uniview/

https://r12a.github.io/apps/conversion/

are excellent tools for this, as well, if you're in a situation where
you can copy into a web form.

This looks useful for commandline stuff, though, thanks!
-Manish


On Fri, Mar 17, 2017 at 1:44 PM, Manuel Strehl  wrote:
> Hi,
>
> for my work on codepoints.net and Emojipedia I found myself repeatedly
> in a place, where I needed some tool like hexdump to inspect the content
> of a string. However, instead of raw bytes I am more interested in the
> code points that the string is composed of. So I wrote this tool.
>
> I reasoned, that it might come in handy for other people on this list.
> It is, conveniently, named unidump and can be installed via pip (pip3,
> that is, because it needs Python 3):
>
> pip3 install unidump
>
> The source code is available on Github,
> https://github.com/Codepoints/unidump, and the tool is MIT licensed. The
> README on Github also explains some other use cases, like counting code
> points in a file (as opposed to bytes) or using it as a replacement for
> strings(1).
>
> If you have any comment, feedback, bug report or other questions, I'm
> glad to answer any of those.
>
> Cheers and have a nice weekend,
> Manuel


New tool unidump

2017-03-17 Thread Manuel Strehl
Hi,

for my work on codepoints.net and Emojipedia I found myself repeatedly
in a place, where I needed some tool like hexdump to inspect the content
of a string. However, instead of raw bytes I am more interested in the
code points that the string is composed of. So I wrote this tool.

I reasoned, that it might come in handy for other people on this list.
It is, conveniently, named unidump and can be installed via pip (pip3,
that is, because it needs Python 3):

pip3 install unidump

The source code is available on Github,
https://github.com/Codepoints/unidump, and the tool is MIT licensed. The
README on Github also explains some other use cases, like counting code
points in a file (as opposed to bytes) or using it as a replacement for
strings(1).

If you have any comment, feedback, bug report or other questions, I'm
glad to answer any of those.

Cheers and have a nice weekend,
Manuel