Re: New tool unidump

2017-03-20 Thread Andrey Lukyanov

apropos unidump:

It would be nice to add the option of printing not only numbers, but also
character names and other info from the NamesList.txt file.

I am using a homemade program at my computer:

$ typecode fc25 
66 63 32 35 

$ typecode -l fc25 
0066LATIN SMALL LETTER F
0063LATIN SMALL LETTER C
0032DIGIT TWO
0035DIGIT FIVE

$ typecode -f fc25 
0066LATIN SMALL LETTER F
0063LATIN SMALL LETTER C
0032DIGIT TWO
~ 0032 FE0E text style
~ 0032 FE0F emoji style
0035DIGIT FIVE
~ 0035 FE0E text style
~ 0035 FE0F emoji style



Re: New tool unidump

2017-03-19 Thread Rebecca T
I maintain a list of various Unicode tools and resources at
unicode.yea.rs and always welcome new additions!

On Sat, Mar 18, 2017 at 1:42 AM, Janusz S. Bien  wrote:

> Quote/Cytat - Manuel Strehl  (Fri 17 Mar 2017
> 09:44:15 PM CET):
>
> Hi,
>>
>> for my work on codepoints.net and Emojipedia I found myself repeatedly
>> in a place, where I needed some tool like hexdump to inspect the content
>> of a string. However, instead of raw bytes I am more interested in the
>> code points that the string is composed of. So I wrote this tool.
>>
>
> Is somebody maintaining a list of such utilities?
>
> There is a page
>
> http://www.unicode.org/resources/online-tools.html
>
> but I remember that earlier a page on the site used to be links to the
> programs mentioned in 2012 "Tool to convert characters to character names",
> in particular to Bill Poser's uniutils (http://billposer.org/Software
> /unidesc.html) and the orphaned unihist by a student of mine (
> https://bitbucket.org/jsbien/unihistext). I'm unable to find them now.
>
> Best regards
>
> Janusz
>
> --
> Prof. dr hab. Janusz S. Bień -  Uniwersytet Warszawski (Katedra
> Lingwistyki Formalnej)
> Prof. Janusz S. Bień - University of Warsaw (Formal Linguistics Department)
> jsb...@uw.edu.pl, jsb...@mimuw.edu.pl, http://fleksem.klf.uw.edu.pl/~
> jsbien/
>
>


Re: New tool unidump

2017-03-17 Thread Janusz S. Bien
Quote/Cytat - Manuel Strehl  (Fri 17 Mar 2017  
09:44:15 PM CET):



Hi,

for my work on codepoints.net and Emojipedia I found myself repeatedly
in a place, where I needed some tool like hexdump to inspect the content
of a string. However, instead of raw bytes I am more interested in the
code points that the string is composed of. So I wrote this tool.


Is somebody maintaining a list of such utilities?

There is a page

http://www.unicode.org/resources/online-tools.html

but I remember that earlier a page on the site used to be links to the  
programs mentioned in 2012 "Tool to convert characters to character  
names", in particular to Bill Poser's uniutils  
(http://billposer.org/Software/unidesc.html) and the orphaned unihist  
by a student of mine (https://bitbucket.org/jsbien/unihistext). I'm  
unable to find them now.


Best regards

Janusz

--
Prof. dr hab. Janusz S. Bień -  Uniwersytet Warszawski (Katedra  
Lingwistyki Formalnej)

Prof. Janusz S. Bień - University of Warsaw (Formal Linguistics Department)
jsb...@uw.edu.pl, jsb...@mimuw.edu.pl, http://fleksem.klf.uw.edu.pl/~jsbien/



Re: New tool unidump

2017-03-17 Thread Manish Goregaokar
https://r12a.github.io/uniview/

https://r12a.github.io/apps/conversion/

are excellent tools for this, as well, if you're in a situation where
you can copy into a web form.

This looks useful for commandline stuff, though, thanks!
-Manish


On Fri, Mar 17, 2017 at 1:44 PM, Manuel Strehl  wrote:
> Hi,
>
> for my work on codepoints.net and Emojipedia I found myself repeatedly
> in a place, where I needed some tool like hexdump to inspect the content
> of a string. However, instead of raw bytes I am more interested in the
> code points that the string is composed of. So I wrote this tool.
>
> I reasoned, that it might come in handy for other people on this list.
> It is, conveniently, named unidump and can be installed via pip (pip3,
> that is, because it needs Python 3):
>
> pip3 install unidump
>
> The source code is available on Github,
> https://github.com/Codepoints/unidump, and the tool is MIT licensed. The
> README on Github also explains some other use cases, like counting code
> points in a file (as opposed to bytes) or using it as a replacement for
> strings(1).
>
> If you have any comment, feedback, bug report or other questions, I'm
> glad to answer any of those.
>
> Cheers and have a nice weekend,
> Manuel