Hello all, I am not sure if this is the right forum, but would love to get any pointers.
I am volunteering with a local Hindi newspaper and want to get their editions online in web searchable format. Here is the link to the site. http://aainanews.blogspot.in/2012/08/14th-issue-3-year.html The biggest hurdle I am facing is to convert the fonts the paper is encoded in (APS-Priyanka) and converting them to unicode (assuming that I can extract the text from the pdfs and keeping the formatting issues on the side for the moment) >From what I gathered from web searches, APS Priyanka is a really old font and >does not follow any specific encoding like ISCII etc. I tried some basic >scripts and character maps but it does not seem like a "trivial" problem. If anyone has experience in this and can help, it would be great. best, Rushabh W: https://erpnext.com T: @rushabh_mehta -- For more details about this list http://datameet.org/discussions/ --- You received this message because you are subscribed to the Google Groups "datameet" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.
