Implementation of Unicode Porting For Marathi using Free Software
in a Maharashtra Govt Department
Background
Over the decade and a half in the past, indian IT industry driven by Central
govt efforts and lead by C-DAC, NCST , IIT kanpur and other instituions, has done a
commendable job of introducing local scripts on DOS and Windows platforms etc. So
long as the computers were confined to typing and printing , and that too largely
within the confines of an office or an enterprise, the ad-hoc i.e. Non-standard
approaches followed were infact an unexpected delight! However now that the demands
for indian language working are for computing ( which includes searching and sorting)
, for communication across networks and across platforms and across applications the
non-standard approaches are a problem. On top of it , due to globalisation , there is
need for multilingual documents i,e a document simultaneously containing many
languages including chinese/japanese and european etc . The need to create webpages
with such multilingual content with interactions leaves very little scope o
f choice other than unicode. Thus the old font-encoded content will simply not do.
However many governement organisations , with progressive bureacrats wanting to
change over to this modern format are blocked in the path by the huge legacy of
documents in older formats. Hence the capability to transform the old content to
unicode is a crucial requrement. Those who want to convert to use of Opensource
software technologies are posed witth this problem even more acutely.There is not even
that little chance of using old font-encoded content as may be available for the
Proporietory software users.
This felt need lead us to the project of porting to unicode in Directorate of IT in
Maharastra Govt. As is well known , the Govt of Maharashtra is not only a leading
state in the field of Finance and industry but also in the field of IT and
e-governance.
What is the Significance of this project ?
This project of converting to unicod is , as far as we know, for the first time that
a government body has undertaken to convert to unicode and that too on Free Software.
1.The most important effect of this will be to open the possibility to shift to
OpenSource software by the governments or any other corporate body who have
accumulated a lot of content in old font-encoded formats. This does not prevent anyone
to work with (more recent versions of) proprietory software like Microsoft Windows
2000 onwards. In fact it is a necessary requirement for many of them.
2.The other significant advantage will be that cost of working in indian languages
will be reduced drastically.
3.Thirdly all the content(information and data) will be available in sortable and
searchable global format which is what unicode enables.
This will fulfill a long-term requirement of many departments of IT in many
governemnts and Publicsector organsiations.
What we have Accomplished
Government of Maharastra , being particular about standards, chose the best option of
standardising on the government organisation like C-DAC and workind with ISM software
and ISFOC fonts. They used the Lotus Word Pro software and Microsoft Word ediors
enabled with marathi by ISM and iPlugin of C-DAC. We have converted this Marathi
content originally created in various ISFOCfonts of C-DAC ( using ISM and I-Plugin in
several proprietory softwares) into unicode based, open type fonts which have been
developed and made freely available by us We had to convert the legacy documents into
new encoding , integrate the same with existing Database of the Govt.(DJMS =Document
Journey management Sysytem) . We worked with one department to start with, i.e.
Directorate of IT . We have also installed opesource software i.e OpenOffice 1.1 in
some machines and demonstrated its use and trained some people so that all fresh work
can begin in unicode now. Task of making the Database
metafiles in unicode has also been completed .
Underestimation:
Initially the department estimated about 1000 files to be converted. However , we
have had to deal with about 6700 files , identified and picked out the files that
required convesions and converted about 2000 files. For actual work see the table
above for work accomplished.
File Types and fonts:
We converted files with tables 9real painful) and without tables, from
Lotus Word Processor, MSWord, Power Point, EXCEL etc and using DV-TT, DVB-TT and
DVBW fonts.
Font Developed :
We have also developed gargi font. Initiated By Dr Nagarjun of Free Software
Foundation(India) , right from the beginning (july 2002 ) our personnel developed and
now we at indictrans maintain the font.During conversion we came across some
deficienciesin the font. See our site for the more complete snapshots of
performance of our fonts in all platforms and applications.
We have also developed a Gujarati font padmaa (Prof Jitendra Shah's mother's name)
We have translated the GUI messages In Marathi and Gujarati , (while
modified IndLinux) Hindi .
Other Localisations achieved:
As linuxers know , we have come out with a live, bootable CD enabled in 4 languages,
Hindi, Marathi and Gujarati with Debian GNU/Linux operating system and OpenOffice ,
Mozilla and other software. This is known s gnubhaaratii and has tutorials embedded in
it. See www.indictrans.org for more details.
We have translated GUI messsages in marathi, Gujarati for GNOME 2.4 and have
modified some translations for Hindi as coordinated by IndLinux.
We have also localsed DrGeo and other small software to show proof of concept of how
to localise programs.
We maintain a website as a collaboration platform www.indictrans.org
jitendra
--
http://mm.ilug-bom.org.in/mailman/listinfo/linuxers