This link could be useful for your data conversion. http://nepalinlp.blogspot.com/2010/09/few-years-back-ttf-to-unicode.html
In, Santa Basnet On Wed, Nov 13, 2013 at 9:05 PM, sapradhan <[email protected]> wrote: > nice initiative, > > I looked into the pdf and it seems that there are two fonts in use Preeti > and PCS Nepali. It turns out PCS Nepali has different keymapping than > Preeti specifically the numeric layer ie ^=ट , &=ठ , *=ड and so forth > which is causing quite a few errors. If it is feasible change the mapping > based on the font being used, the conversion should be better. > > If it would be quicker to do this manually I can help. I dont have write > access to google docs, please do the needful > thanks > santosh > > On Wednesday, November 13, 2013 6:19:09 PM UTC+5:45, prawesh wrote: >> >> Hello all, >> >> If you look at this constituency http://election.opennepal.net:8000/#/ >> constituency/39, there are lots of issues with unicode, which should not >> have been if the data were available in proper format. We scraped the data >> (in Nepali Preeti font) from http://www.election.gov. >> np/oldecn/NP/pollinglist/dist_const_list.html. The task was not easy, >> https://github.com/foss-np/2utf8 was used to convert Preeti to Unicode. >> There could be problems in either conversion as well as scraping. We think >> that it might be quick to get help from the community to resolve these >> issues. For ease, we have maintained all the scraped polling centers >> district-wise in the google docs >> here<https://docs.google.com/a/yipl.com.np/spreadsheet/ccc?key=0AhWLpToTogBwdDR4WWNuQkZMa0I0S0dXQjlCT3pISlE&usp=drive_web#gid=77>. >> We plan to release these data in opendatanepal.org as well but after >> resolving those issues, for which we seek your help. >> >> District-file >> sheet<https://docs.google.com/a/yipl.com.np/spreadsheet/ccc?key=0AhWLpToTogBwdDR4WWNuQkZMa0I0S0dXQjlCT3pISlE&usp=drive_web#gid=78> >> lists >> the districts and appropriate polling-list pdf file (from which we >> scraped). And the corresponding district page has all the polling centers >> (with issues, there might be repetitions as well). We have created columns >> for corrected center name and romanized center name. So you could correct >> the center names and add in case of missing centers. >> Summary<https://docs.google.com/a/yipl.com.np/spreadsheet/ccc?key=0AhWLpToTogBwdDR4WWNuQkZMa0I0S0dXQjlCT3pISlE&usp=drive_web#gid=77> >> shows >> the list of districts with issues and corrected name - the google script >> will run and update the numbers there. For e.g, Taplejung district >> page<https://docs.google.com/a/yipl.com.np/spreadsheet/ccc?key=0AhWLpToTogBwdDR4WWNuQkZMa0I0S0dXQjlCT3pISlE&usp=drive_web#gid=2>seems >> to have 3-4 issues in names. Thank you so much for your help. >> >> >> -- >> With Regards, >> >> Prawesh Shrestha >> > -- > -- > FOSS Nepal mailing list: [email protected] > http://groups.google.com/group/foss-nepal > To unsubscribe, e-mail: [email protected] > > Mailing List Guidelines: > http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines > Community website: http://www.fossnepal.org/ > > --- > You received this message because you are subscribed to the Google Groups > "FOSS Nepal" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/groups/opt_out. > -- Santa B. Basnet Department of Computer Science & Engineering Nepal Engineering College Changunarayan, Bhaktapur, Nepal -- -- FOSS Nepal mailing list: [email protected] http://groups.google.com/group/foss-nepal To unsubscribe, e-mail: [email protected] Mailing List Guidelines: http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines Community website: http://www.fossnepal.org/ --- You received this message because you are subscribed to the Google Groups "FOSS Nepal" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.
