[datameet] Parsing Voters List : Glyph to Unicode issue

Siddharth Vijayakrishnan Tue, 01 Sep 2015 07:10:15 -0700

Hi, 

I downloaded a few files containing voter rolls and tried to parse the PDFs 
using pdfminer. Ran straight into a problem[1] where the glyphs are converted 
to unicode using a wrong character map.  Before I try and solve this on my own, 
I wonder if anyone in this community has a readymade solution ?


[1] 
http://stackoverflow.com/questions/31876415/parsing-a-pdfdevanagari-script-using-pdfminer-gives-incorrect-output

-- 
Datameet is a community of Data Science enthusiasts in India. Know more about 
us by visiting http://datameet.org
--- 
You received this message because you are subscribed to the Google Groups 
"datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

[datameet] Parsing Voters List : Glyph to Unicode issue

Reply via email to