Hi, I downloaded a few files containing voter rolls and tried to parse the PDFs using pdfminer. Ran straight into a problem[1] where the glyphs are converted to unicode using a wrong character map. Before I try and solve this on my own, I wonder if anyone in this community has a readymade solution ?
[1] http://stackoverflow.com/questions/31876415/parsing-a-pdfdevanagari-script-using-pdfminer-gives-incorrect-output -- Datameet is a community of Data Science enthusiasts in India. Know more about us by visiting http://datameet.org --- You received this message because you are subscribed to the Google Groups "datameet" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
