nice initiative, 

I looked into the pdf and it seems that there are two fonts in use Preeti 
and PCS Nepali. It turns out PCS Nepali has different keymapping than 
Preeti specifically the numeric layer ie ^‌=ट ,  &=ठ  ,  *=ड and so forth 
which is causing quite a few errors. If it is feasible change the mapping 
based on the font being used, the conversion should be better. 

If it would be quicker to do this manually I can help. I dont have write 
access to google docs, please do the needful 
thanks
santosh

On Wednesday, November 13, 2013 6:19:09 PM UTC+5:45, prawesh wrote:
>
> Hello all,
>
> If you look at this constituency 
> http://election.opennepal.net:8000/#/constituency/39, there are lots of 
> issues with unicode, which should not have been if the data were available 
> in proper format. We scraped the data (in Nepali Preeti font) from 
> http://www.election.gov.np/oldecn/NP/pollinglist/dist_const_list.html. 
> The task was not easy, https://github.com/foss-np/2utf8 was used to 
> convert Preeti to Unicode. There could be problems in either conversion as 
> well as scraping. We think that it might be quick to get help from the 
> community to resolve these issues. For ease, we have maintained all the 
> scraped polling centers district-wise in the google docs 
> here<https://docs.google.com/a/yipl.com.np/spreadsheet/ccc?key=0AhWLpToTogBwdDR4WWNuQkZMa0I0S0dXQjlCT3pISlE&usp=drive_web#gid=77>.
>  
> We plan to release these data in opendatanepal.org as well but after 
> resolving those issues, for which we seek your help. 
>
> District-file 
> sheet<https://docs.google.com/a/yipl.com.np/spreadsheet/ccc?key=0AhWLpToTogBwdDR4WWNuQkZMa0I0S0dXQjlCT3pISlE&usp=drive_web#gid=78>
>  lists 
> the districts and appropriate polling-list pdf file (from which we 
> scraped). And the corresponding district page has all the polling centers 
> (with issues, there might be repetitions as well). We have created columns 
> for corrected center name and romanized center name. So you could correct 
> the center names and add in case of missing centers. 
> Summary<https://docs.google.com/a/yipl.com.np/spreadsheet/ccc?key=0AhWLpToTogBwdDR4WWNuQkZMa0I0S0dXQjlCT3pISlE&usp=drive_web#gid=77>
>  shows 
> the list of districts with issues and corrected name - the google script 
> will run and update the numbers there. For e.g, Taplejung district 
> page<https://docs.google.com/a/yipl.com.np/spreadsheet/ccc?key=0AhWLpToTogBwdDR4WWNuQkZMa0I0S0dXQjlCT3pISlE&usp=drive_web#gid=2>seems
>  to have 3-4 issues in names. Thank you so much for your help.
>
>
> -- 
> With Regards,
>
> Prawesh Shrestha 
>

-- 
-- 
FOSS Nepal mailing list: [email protected]
http://groups.google.com/group/foss-nepal
To unsubscribe, e-mail: [email protected]

Mailing List Guidelines: 
http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines
Community website: http://www.fossnepal.org/

--- 
You received this message because you are subscribed to the Google Groups "FOSS 
Nepal" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to