Re: [Ankur-core] Huge contribution to E2B dictionary project
Hi, I'd be glad to help, Bhaiya :) I hope, I can help you, please send me a mail. Thanks Zaher and Salahuddin. I will be sending you a list of words each along with brief instructions shortly. Best, Golam -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Huge contribution to E2B dictionary project
Hi all, I am delighted to share this news with you that Mr Sharfuz Zaman (sharfuz at gmail dot com) has donated more than 16 thousands meanings to Ankur E2B dictionary project. This is more than the number of edited entries that we have currently. I have started pushing them into our MySQL database. With bit of checking and minor editing this could take weeks to complete unless some of you want to help me out. Cheers, Golam -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Ankur Abhidhan project needs help!
Hi All, Ankur English to Bengali dictionary project [1] has been serving increasingly more and more users for quite some time. According to Google analytics, Ankur E2B dictionary project has served more than sixty thousands request in last month alone [2]. It has also lead to increased contributions. Unfortunately, the project is lacking man-power to keep up with the increased demand. Also, I am unable to give enough time to the project lately and I don't see my situation is changing anytime soon. Consequently, large numbers of contributed entries remain unedited [3]. So I am now seeking opinions from Ankur members to sustain the project meaningfully. I would be happy to make personal request to anyone who might be interested in helping the project by any means. In case, you know of someone either from Ankur or outside, who could help in this regard, then please let me know. It may be helpful to forward this request to any other interested groups. [1] http://www.bengalinux.org/english-to-bengali-dictionary/ [2] http://www.bengalinux.org/english-to-bengali-dictionary/VisitorsOverviewReport.pdf [3] http://www.bengalinux.org/cgi-bin/abhidhan/statistics.pl Cheers, Golam -- Crystal Reports - New Free Runtime and 30 Day Trial Check out the new simplified licensing option that enables unlimited royalty-free distribution of the report engine for externally facing server and web deployment. http://p.sf.net/sfu/businessobjects ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Ankur Dictionary
Hi Salahuddin, On Wed, May 13, 2009 at 2:26 AM, salahuddin66 salahuddi...@gmail.com wrote: Yes, you are right. Personally I also think getxml.pl would be a standard name. Now BDictXML import BDictSQL and use the existing db function. File attached. Please feel free to let me know if you need any changes. :) First of all, it is coming out very nicely. I have one more suggestion. dict_entry_unedited id=7 I would prefer not to have above element for the following reason. The client side parsing would need to have separate code for reading unedited entries. My suggestion would be to have statusUNEDITED/status for each entry. Currently, we have four separate status (1) EDITED (2) UNEDITED* (3) OBSOLETE (4) DELETED (It helps to undo delete easily by simply changing the status). Also, there is a unique id field for each dictionary entry. In case, you want to have editing capability in your client app then you need to send back this id. So something like this would be good to have bdict_id23232/bdict_id for each entry. Given you are almost finishing up the spec, I would urge you to write down the final specifications that we can put up in Abhidhan page. This will help those who would want to write their own XML based app using Ankur dictionary. Here is the current dict_table elements `id` int(11) NOT NULL auto_increment, `en_word` text, `pos_tag` text, `en_lemma` text, `bn_pronunciation` text, `bn_word` text, `explanation` text, `example` text, `status` text, Finally, please make sure to put your name in the license headers and in the final XML specs. Best, Golam -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] XML standard for Ankur's Abhidhan
Hi, On Tue, May 12, 2009 at 5:13 PM, Salahuddin Pasha salahuddi...@gmail.com wrote: Basic work is already done, but we need to define a standard XML (XML DTD or XML Schema). Example: test XML output. ?xml version=1.0 encoding=utf-8? dictionary search_results dict_entry id=1 en_wordread/en_word pos_tagNoun, singular or mass/pos_tag Thanks a lot for your work. I should suggest that you also try to have an entry for PennTag for Parts-of-Speech (pos) like NN, VV etc. So something like penn_tagNN/penn_tag This would be needed if Anubadok Online intreface needs to update its database using your XML gateway of Ankur dictionary database. Cheers, Golam -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Firefox Padma Update for Anandabazar rendering bug
On Sun, May 10, 2009 at 12:33 AM, Shabab Mustafa shabab.must...@gmail.com wrote: The rendering of Padma cannot show the 'কার' and sometimes the 'য-ফলা properly. Is it a drawback of Padma? or the poor typeset of Anandabazar and Bartaman? It's likely that Padma is at fault. Could you please send me a link to the page where you see those issue? Cheers, -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Firefox Padma Update for Anandabazar rendering bug
Hi, Some of you might find this update on Padama rendering bug for ABP to be useful http://methopath.wordpress.com/2009/05/09/improved-firefox-padma-for-reading-anandabazar-bartaman/ Cheers, Golam -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Fwd: [Anubadok-core] english to bangla dictionary
Hello Asif, Given Anubadok uses dictionary database from Ankur's English to Bengali dictionary project ( http://www.bengalinux.org/english-to-bengali-dictionary/ ), I am forwarding your email to Ankur's mailing list. Regarding the encoding problem of dictionary database dump, frankly, I am aware of this problem. This problem seems to have started after last year's SourceForge data-center migration although web interface works as expected. So I am sort of guilty for not fixing this issue earlier. Does anyone know whether there are easy ways to fix the encoding/re-encoding of MySQL database dump in utf-8? Thanks, Golam -- Forwarded message -- From: asif mahmud acid_ko...@yahoo.com Date: Wed, May 6, 2009 at 8:20 AM Helo everyone, currently i am trying to develop a english to bangla dictionary software for desktop. I was really excited to find this project called anubadok and the english to bangla dictionary. Specially the dictionary is the thing that will help me a lot. I was trying to fetch the bangla content from the table dict_table. However, i failed to collect the strings. It was showing all junks. I saw that the table use utf8 encoding, so i used set NAMES = 'utf8' function before select statements in mysql. i tried with collations like utf8_general/unicode_ci but no luck. I have been trying this for quite a long time but its not working. If you could provide me some clue regarding fetching the bangla characters from db, it would be really helpful for me. Thanks in advance -Asif -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] E2B dictionary project [Was: Re: waiting for repley ...]
Hello, Thanks for your interests in Ankur English to Bengali dictionary project. You are always welcome to contribute to the project. Normally, you can use dictionary interface to contribute. However, if you want to contribute offline way then please let me know. You can write Juktakhor, ra-phala, ba-phala etc by using hasanta though the interface. For example, 1)প + ্ + র = প্র (PA + Hasanta + RA) 2)স + ্ + ব + প + ্ +ন = স্বপ্ন You can find more on juktakkhor from the link (courtesy Ekushey) http://prdownloads.sourceforge.net/ekushey/Bangla_Conjunction_Lists.pdf?download Best, On Fri, Mar 13, 2009 at 7:18 PM, Mohammad Onupom Rahman onup...@yahoo.com wrote: assalamu alaikum vai golam hossain , otibo poritaper bisoy je bangla vasa vasi jono gosthi eto bishal howa sotteo.. er jonno uccho sikkhai sikkhito howar sohayok english sikkhar jonno web a kono poripurno dictionary nai. so apni ei guru daitto ti nia silen..kintu kormer bahullota and paribarik jhamelai khub somvoboto apni egie uthte paren ni.. ami apnar ei kajke saagoto janie amar khudro agroho apnader kormer proti prokash korsi.. jodi somvob hoy ki vabe apnader ei kormoke ami egie niite pari taha bolben.. ami bangla bijoy jani.and english type o mota muti valo.. kintu apnader unicode dictionary te dekhlam banglar ro fola ba jo fola dewar kono upai nai.. ja bijoy board e sohojei dewa jai..fole issa thaka sotteo dictionary te notun word jog kora jai nai. ei bisoy guli kheyal korben ar pori seshe apnar mail er opekkhai roilam mohammad onupom rhaman -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] New Editor Member of Ankur E2B Dictionary Project
Hi All, On behalf of Ankur English to Bengali dictionary project, I am delighted to welcome Rezwan, as the new editor of the dictionary project. He will be joining Dipankar Das and myself to have editing access on Ankur dictionary database. Some of you may already know Rezwan. He is a prolific blogger. He blogs on wide-ranging issues, including several important social issues. He is a regular contributor to Global Voices Online [1] both in English and Bengali. You can read more in his blog http://rezwanul.blogspot.com In last couple of months, I have had several email exchanges with him and he had expressed keen interests on having a free dictionary for Bengali. I had requested him whether he could help Ankur dictionary project with his expertize. I am glad to hear his positive reply. Apart from editing, currently he is also exploring an interesting idea of giving out gift-cheques to volunteers of the project who contribute significant number of new dictionary entries. Personal donations could be one of the funding sources. Finally, current statistics of the dictionary project is available from here http://www.bengalinux.org/cgi-bin/abhidhan/statistics.pl [1] http://globalvoicesonline.org Cheers Golam -- http://methopath.wordpress.com/ -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Firefox Padma :: Support for almost all Bengali News Sites (India + Bangladesh)
On Mon, Feb 16, 2009 at 2:43 AM, Omi m...@omiazad.net wrote: In my point of view this addon sux. It even doesn't change the encoding of the page after converting them into Unicode. Texts becomes garbage if we use this. Try changing your browser encoding to Unicode ( View - Character Encoding - Unicode ). Which version, OS, and default encoding of Firefox have you tried? Please let me know. I will pass your issue to the upstream :-) Best, Golam -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Fwd: Firefox Padma :: Support for almost all Bengali News Sites (India + Bangladesh)
-- Forwarded message -- From: সোহম চ্যাটার্জী soham.chatter...@gmail.com Date: Mon, Feb 16, 2009 at 7:03 AM Subject: Re: Firefox Padma :: Support for almost all Bengali News Sites (India + Bangladesh) To: Golam Mortuza Hossain gmhoss...@gmail.com Dear Golam! Let me congratulate you at the outset for a great service to all Bengali speakers. With one stroke a number of significant mile-stones have been crossed. I have installed the new version in both Windows and Linux and have checked out a number of sites and added 10 more sites to the white-list which can also be transformed by the new version. I have observed some glitches, which I am sure, have been obvious to you too. These sites have been added to the white-list - www.jugantor.com www.chandpurkantha.com www.satyeralo.net www.weeklyamod.com www.banglapaper.net www.weeklysonarbangla.com www.bangladesherkhela.info www.canvasonline.biz www.energybangla.com www.computerbarta.com www.suprovatbangladesh.com Some problems that have been observed are - Samokal - র্যাবের নিষ্ক্রিয়তা অার পুলিশের শিথিলতার কারণে রাজধানীর অাইন-শৃঙ্খলা পরিস্্থিতি উদ্্বগেজনক পর্যায়ে পেঁৗছেছে। The text should be - র্যাবের নিষ্ক্রিয়তা আর পুলিশের শিথিলতার কারণে রাজধানীর আইন-শৃঙ্খলা পরিস্থিতি উদ্বেগজনক পর্যায়ে পৌঁছেছে। The Jugantor site has posed some problems for me. While Padma has transformed the main page and most of the site text, but when an article is clicked upon Padma doesnt transform the text neither automatically nor throught the direct method of selecting text and clicking Transform to Unicode. This is think is a site-problem rather than Padma's. Also on installing in Ubuntu Linux, I have messed up the settings and auto-transform is not working any more. I have tried uninstalling and re-installing but the problem persists. Most of the glitches with other fonts, that I have noticed through a quick perusal, mostly concern with া, conjuncts with lower ব, ্য, and ঃ . This only goes to show that an amazing job has been done and I am honored to enjoy this correspondence with you. The next mile-stone is the BanglapediaII font of the Banglapedia site, and maybe the West Bengal Government's Banglarmukh site which uses some WebSamhita font which I have not managed to install and run in IE. Only a few sites remain such as Parabaas, Vinnomot, Viswayan, Medinipur.com, Guruchandali, Prabasi.org, Abasar and Al-Ihsan. But these sites use separate fonts. I would like to see the new mappings when the source is available regarding which you had been discussing with Harshita Vani. I guess it will be much easier to map the fonts then. Thanks and best wishes, Soham -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Fwd: Firefox Padma :: Support for almost all Bengali News Sites (India + Bangladesh)
On Mon, Feb 16, 2009 at 7:28 AM, Omi m...@omiazad.net wrote: This is the problem I was talking about. Install Padma and go to Anandabazar's site and see how bad it works. Even it takes huge resource. I'm happy with Poroshmoni. :-)) Oh, now I see your point of view as well as the actual problem. Your company product seems to have found a competitor in Padma! Its good to have competition, specially if it is between open-source and closed-source. Users will benefit :-)) Cheers Golam -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Firefox Padma :: Support for almost all Bengali News Sites (India + Bangladesh)
Hi, I am about to submit the support for following Bengali news websites for viewing in Firefox using the extension Padma ( padma.mozdev.org ), for upstream inclusion. Here is an unofficial Padma XPI file that you can install to check it working yourself. It will be great to get your feedback, specially if you find any omission or any other issue. http://www.math.unb.ca/~ghossain/padma-0.4.14-new-bengali.xpi ***Note: If you have Padma already installed, you might need to clean up your old preferences for auto transform to work for new websites. Try about:config in Firefox location bar then remove the old line Padma.autoTransformWhiteList. Here goes the list: Indian Sites: === * Anandabazar (some error fixed) ( http://www.anandabazar.com ) * Bartaman (some error fixed) ( http://www.bartamanpatrika.com ) * Aajkaal ( http://www.aajkaal.net ) * Uttarbanga Sambad ( http://www.uttarbangasambad.com ) Bangladeshi Sites: === * Prothom Alo ( http://www.prothom-alo.com ) * Amar Desh ( http://www.amardeshbd.com ) * Daily Desh Bangla ( http://www.dailydeshbangla.com ) * Amader Shomoy ( http://www.amadershomoy.com ) * Bhorer Kagoj ( http://www.bhorerkagoj.net, http://www.bangladeshnews24.com ) * Ittefaq ( http://www.ittefaq.com ) * Jai Jai Din ( http://www.jaijaidin.com ) * Naya Diganta ( http://www.dailynayadiganta.com ) * Jugantor ( http://www.jugantor.com ) * Manab Zamin ( http://www.manabzamin.net ) * Daily Dinkal ( http://www.daily-dinkal.com ) * Dainik Destiny (http://www.dainikdestiny.com ) * Shamokal ( http://www.shamokal.com ) * Daily Sangbad ( http://www.thedailysangbad.com ) * Daily Sangram ( http://www.dailysangram.com ) Cheers, Golam -- methopath.wordpress.com -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Copyright Violation of Baishakhi Inscript Keyboard
Dear Prof. Basu, First of all, let me congratulate you for taking the huge step of promoting Free Open source Linux OS in Bengali through the means of releasing a localized Bengali, Ubuntu-derivative Linux distribution named Baishakhi. This particular email is in response to your alleged claim of copyright violation of Baishakhi Inscript keyboard layout which is one of several layouts, being used in Ankur's English to Bengali dictionary project [1]. I, being the maintainer of the project, take the responsibility of responding to your claims [your email enclosed below]. I think, it is better to have the terminology clarified before getting into the discussion. As far as I see, there are three aspects of it. (1) Issue of trademark (2) Copyright of Layout design (3) Method of implementing the layout (1) Issue of trademark: As you have claimed, you own the trademark of Baishakhi then I agree we can't use the term Baishakhi in our project. I will make sure the term is not used anymore. (2) Copyright of layout design. You have claimed that Baishakhi Inscript is copyrighted to us. I am amazed to see your tall claim. You, in your specification, have clearly said Baishakhi Inscript is a modified version of Inscript layout design [2]. So, according to you, Baishakhi Inscript is a derivative of Inscript layout design. I am sure, you are aware that according to the terms of GPL the original authors (In this case Bureau of Indian Standards, Govt of India) remain the copyright holder of any derivative work. Just because you have modified some part, doesn't make you an exclusive copyright holder of the design. (3) Method of implementation: You have claimed, that the .klc file that is being used for Baishakhi Inscript in the Bengali Virtual Keyboard of [2], is a minimal modification of baishakhi_inscrpt.klc prepared by us . Would you please care to show, where exactly any Ubuntu version or your Baishakhi Linux uses .klc file (Microsoft layout format) to specify any input layout? Or would you please point out any link where from one can get the source of Baishakhi Linux so that one can check whether baishakhi_inscrpt.klc file exist inside it at all? Question of verifying your claim of minimal modifications, comes later. Otherwise, the question of minimal modification of your .klc file seems just a wishful thinking to me. Regarding wrong mapping of ?, I think that is a bug in my implementation and not a feature. I mainly volunteer for use of Bengali in Free OS like Linux and I have no intension of taking the credit that you deserve. It is really painful to see your allegation, specially when you claim to promote free software movement for Bengali. In any case, I will be happy to hear your response. [1] http://www.bengalinux.org/english-to-bengali-dictionary/ [2] http://www.nltr.org/download/About_Inscript_Keyboard.pdf NB: Given the issue affects one of Ankur (Bengalinux.org) project, I am marking a CC to the core member of the group. Best Regards, Golam On Tue, Jan 20, 2009 at 6:03 AM, Ilya S. Lebedev i...@lebedev.net wrote: Hello Golam, I've received a letter from Anupam Basu, director of Society of Natural Language Technology Research complaining of use their registered keyboard layout name Baishakhi Inscript. Please, rename your layout to fix this issue. Thank you. -- Forwarded -- Fro Anupam Basu anupam...@gmail.com To: Ilya S. Lebedev i...@lebedev.net Date:Tue, 20 Jan 2009 11:48:10 +0530 Subject: Copyright Violation of Baishakhi Inscript Keyboard Dear Ilya, The keyboard layout Baishakhi Inscript was developed by Society of Natural Language Technology Research and Mat3Impex. The same was released by the Hon. Minister of IT in September 2008 I have compared the two .klc files (baishakhi_inscrpt.klc prepared by us and another bengali-baishakhi-inscript.klc by Mr. Hossain). There are some minor changes in his .klc file. The most important thing is that in his .klc file there is no '? ' but in our .klc file it is in normal mode of 'w' key. Hence, it will be proper to include our keyboard as the Baishakhi Inscript, copyrighted to us. We will send our .klc file to you. Mr. Hossain can rename his file and publish it with the minimal modification that he has done UNDER A DIFFERENT NAME. Regards, Anupam Basu -- Forwarded -- -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Virtual Keyboard for Bengali
Hi All, Ankur E2B dictionary project http://www.bengalinux.org/english-to-bengali-dictionary/ has now got a new browser-based Bengali Virtual Keyboard. The demo page is here http://www.bengalinux.org/english-to-bengali-dictionary/bengali-virtual-keyboard.htm I guess, currently this is the most sophisticated browser-based virtual keyboard for Bengali available under OSI-approved license. Please try it out and let me know if you face any issue. Cheers Golam - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Ankur dictionary and Bnwebtools new license
Hi all, As it appears, its time for change. Bnwebtool is not available anymore. So I have tentatively decided to drop bnwebtools from Ankur dictionary project. I am playing with few other options and expecting a replacement soon. Cheers Golam On Wed, Nov 12, 2008 at 11:17 AM, Golam Mortuza Hossain [EMAIL PROTECTED] wrote: Dear Murshed, I am writing you regarding your bnwebtools and its new license. As you know, we use your bnwebtools in Ankur's E2B dictionary project. Frankly, your tool has been extremely useful to the project. I was considering an upgrade of bnwebtools to the latest version. However, your new license terms (v 2.1.0) made me extremely concerned about your tool. In Sourceforge project page http://sourceforge.net/projects/bnwebtools the mentioned license is GPL. On the other hand, in your site http://bnwebtools.sourceforge.net/ the mentioned license is GPL only for non-commercial use. In my understanding, both these licenses are incompatible with each other. GPL doesn't prevent any commercial use. It seems to me your new license also violates Sourceforge terms of service as they allow only OSI-approved license for code hosting. http://alexandria.wiki.sourceforge.net/Terms+of+Use However, the main concern for me is, we allow distribution of the dictionary project under GPL. This includes database as well as client program (CGI scripts) which currently includes your Bnwebtools inside it. So if someone wants s/he can use our dictionary project to take out the bnwebtools and use it for ANY purpose including commercial as per as GPL. This will bypass your intended license completely. Does it mean, we cannot use it in the dictionary project anymore? It will be really helpful if you could clarify about the new license. Given this issue affects Ankur's dictionary project, I am marking this email also to Ankur-core. Thanks in advance for your response. Regards Golam - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] KDE 4 - Bengali Translation
On Wed, Jul 23, 2008 at 7:15 PM, Deepayan Sarkar [EMAIL PROTECTED] wrote: I don't have much information on the bn_IN part, as that has more or less completely bypassed the Ankur lists. OK! Should I then presume that one should contribute translations for KDE through Ankur kde-list only for bn until the policy for bn_IN is spelled out? I remember having long discussion on bn, bn_IN issue for Gnome. Similarly, It would be good if Sayamindu, Runa, ... could point the differences in policy for bn_IN for KDE as well. It would be helpful for those who want to contribute to both bn and bn_IN but with suitably modified translations according to their policy. I was planning to take a closer look after 4.1 was out. That's good. Please let us know your comments once 4.1 is out and whether it requires any changes to the current method of translations. Cheers Golam - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] KDE 4 - Bengali Translation
Hi All, It seems KDE 4.1 release candidate was out last week and final release is scheduled for next week. Given, this release is going to be the first production release of KDE 4 series, I was wondering whether there are any related plans by some of our Ankur members. Also, I am bit confused about the BN and BN-IN issue. How and where should one contribute new PO file translation? Does one need to contribute separate translation (specially if it doesn't have any country-specific issues) ? It would be helpful if Deepayan, Runa, and others can shed some light on these issues. Thanks and Cheers, Golam -- - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Why Should You Switch to Unicode?
Hi, We all are aware that many local language websites are still continuing the use of non-standard encoding. In a related discussion, Sankarshan has made an interesting suggestion of compiling a set of arguments favoring the use of Unicode. The arguments should be aimed at the people who run websites with local language contents (mainly in Indian sub-continent) using dynamic font technology instead of standard Unicode. Following Sankarshan's suggestion, I am putting forward an initial set of arguments (from my own perspective). Please add your arguments and/or modify/change/delete the the arguments below. We are planning to put up the final version in Ankur website. These arguments can be used as handy reference to pursue people for switching to Unicode. == Why Should You Switch to Unicode? =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- By switching to Unicode you can... (*) Instantly expand your current user base Unicode is now supported at a native level in all major operating systems including Windows Vista, XP, Linux, Mac, BSD. One the other hand, dynamics font technology are supported only in a subset of OSs. Thus, switching to Unicode not only allows you to retain your current users but also allows you to expand your user base immediately. (*) Make your contents search-able in search engine like Google --- It is extremely likely that a major fraction of your users reach your website via results of search engine. Switching to Unicode makes your local language contents search-able in search engine like Google. This will lead additional web-traffic to your website. Consequently, it could lead to additional search-engine generated revenue for you. The number of Internet users in Indian subcontinent is going to increase in coming future. A significant portion of new Internet users will be coming from the interior part of the countries. Most of these users will be more comfortable in searching Internet using keywords in their own language. (*) Save future hassles of converting your legacy contents - By continuing the use of non-standard encoding, you will be piling up your archive with non-standard contents which would require a major effort in your part to convert them into standard encoding. You can save future hassles of conversion by switching to Unicode sooner than later. == Cheers Golam -- http://gravity.psu.edu/~hossain/ - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Bangla OCR progress
On Wed, Jul 2, 2008 at 9:32 AM, Sayamindu Dasgupta [EMAIL PROTECTED] This guy seems to be doing some interesting progress for a Bangla OCR - or more precisely, enabling Bangla in Tesseract. http://debayanin.googlepages.com/hackingtesseract Yes, it looks definitely interesting. Looks like he needs some more training data - can we provide him with some ? If I remember correctly, there was a sample file for testing completeness of Bengali fonts. Since it has all letters and conjuncts typed-in, the file might be useful for training Tesseract as well . Deepayan should be able to give some input here. He has working experience with R and may have some training sample as well. Cheers, Golam -- http://gravity.psu.edu/~hossain/ - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] attn: ankur/bengali-linux team, hossain saheb
Dear Dr Agarwala, I am grateful to the ankur online dictionary project. it was a great help to me. Its my pleasure to accept your compliment on behalf of Ankurian and the dictionary project. = interface (I have thought and coined a word 'antarprishto'. is that ok?) If I am not wrong, Ankur members, in the past, have mostly used ইন্টারফেস (interface) for this word. = cascade (I have thought 'propater moto sajan'. is that ok?) = tile horizontally (I have thought 'anubhoomik bhabe sajan'. is that ok?) = tile vertically (I have thought 'ullamba bhabe sajan'. is that ok?) These seems fine to me, =check for updates (I have thought 'sampratikikaran karun'. is that ok?) Here, I will prefer to keep the english word update. Regular Expressions or Wildcards are terms with very specific meanings. So unless there is a Bengali equivalent, it may be best to simply transliterate the terms. but, bengali being such a rich language, I would be interested in accurately translating these IT terms. That would be interesting! However, in my experience, I have seen people to have varied opinion in this regard. Best Regards, Golam - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Ankur E2B dictionary Reloaded: Trial launch
On Nov 11, 2007 2:28 PM, Sajed Chowdhury [EMAIL PROTECTED] wrote: Hi Golam, It's great to see you continue your good work in machine translation. I was wondering, just how big is the word list? Is it available as sql dump or xml? No of entries in the list is around 12k but no of words are less than that. Yes, its should be available as sql dump though currently its in a mess. I expect this transition to settle down within a week. After that I will put up a link for downloading the sql dump. Since AJAX is all the rage now, have you considered implementing a Google Suggest-like functionality? Not yet! My priority at present is to increase the size of the database. Cheers, Golam Maybe not with the entire word list, but say, top X words, like Google Suggest shows from top X search terms. Also, a natural progression would be to turn the web page into an iGoogle module. I'm not sure this is valuable feedback, but anyways there it is! thanks, -Sajed - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Ankur E2B dictionary Reloaded: Trial launch
Hi All, I am happy to announce the trial launch of Ankur's new E2B dictionary interface with MySQL backend. New user interface uses CSS extensively and is available at http://www.bengalinux.org/cgi-bin/abhidhan/new/index.pl This is a trial launch so please feel free to do everything you wish. Please let me know your suggestions and/or bugs. As Dipankar-da suggested, it aims to provide some sort of controlled wiki interface. Every user (registered/not-registered) can contribute but the decision of users with rank of 'EDITOR' will make it to the main dictionary. A full example of new interface can be seen here http://www.bengalinux.org/cgi-bin/abhidhan/new/index.pl?en_word=critical I have merged (not in trial run) the database of GPoSTTL with it. This mean this interface knows almost all English words. So it can tell a user whether s/he has made a possible spelling error. After this trial run (hopefully within next few days) old interface will be replaced by the new interface. BTW, I have also slightly modified its logo :) Cheers, Golam - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] West Bengal govt and free software
On 6/13/07, Sankarshan Mukhopadhyay [EMAIL PROTECTED] wrote: http://www.anandabazar.com/9bus2.htm That URL returns a 404. Hmm!! I posted this one 9th June and url was valid. Anyway this is from their archive http://www.anandabazar.com/archive/1070609/9bus2.htm However are you looking for specific information or a general overview of what's happening related to: Just for a general overview!! Thanks, Golam - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] West Bengal govt and free software
Hi all, According to http://www.anandabazar.com/9bus2.htm West Bengal government is moving to free software for its e-governance project. Any other thought on this? Cheers, Golam - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Ankur's English to Bengali Dictionary
On 5/14/07, Sankarshan Mukhopadhyay wrote: I have updated the interface of Ankur's E2B dictionary with Murshed's really nice BN Web tool. So now people can contribute new Bengali word without changing his/her keyboard layout and it should just work :-) http://www.bengalinux.org/cgi-bin/abhidhan/bdict.pl So please feel free to test it and post any suggestions that you may have. I have a single query: what are the things that you think are required to enhance, augment and make your effort more rich ? In my opinion, there are basically two fronts that require efforts. 1) First and most crucial one to have more contributors. Say, as a first target we want to have a dictionary which has 25,000 words (near same no of definitions that of Pocket Oxford dictionary). For that, we need to have roughly around 50,000 entries in our database (each parts of speech, are counted separately) Currently we have 11,000+ entries. So roughly 100 entries a day, will take us within this target in a year. On the other hand with current rate of roughly 10 entries a day will take ten years!! BTW, during Jamil-bhai's localization camp contributions goes up to few hundreds a day :-) 2) Second aspect is technical. As suggested by Dipankar-da, we can have more richer database. For example, for each entry (English word) apart from current field of Bengali meaning and Parts of speech, we can have more fields like Explanation (ব্যাখ্যা), Usage, Original Contributor, Last editor, etc... Currently, the database is in plain text. However, I have plans to switch into some good database back-end during the course. These are my current thoughts on it but I try to keep myself open to new ideas!! Golam --- http://www.gravity.psu.edu/~hossain/ - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Revamped interface for online PO Anubadok
Hi all, I have updated the interface for online PO file translator. http://www.bengalinux.org/cgi-bin/anubadok/po/index.pl The interface is now capable of handling header information. Also, it can now manipulate translator's name, email id inside PO catalog. The new interface will set cookie to preserve translator's information. This of course, means you won't have to type your name,.. every time you use the interface. As usual, please feel free to test it and post your suggestions, comments or any bug report. Cheers, Golam -- http://www.gravity.psu.edu/~hossain/ - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] [Proposal] Changing the assignment of ZWNJ in Probhat
[`] ZWJ [\] ZWNJ +1 I agree with reasoning of Dipankarda and Runa regarding hasanta. Cheers, Golam - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] [Proposal] Changing the assignment of ZWNJ in Probhat
On 4/25/07, Deepayan Sarkar [EMAIL PROTECTED] wrote: BTW, why is hasanta / and not \ ? Since, usability is the main issue, I also agree with Deepayan on relocation of 'hasant'. [ \ ] intuitaviely resembles more like actual hasanta than current [ / ]. Moreover, It seems that people want both ZWJ and ZWNJ in non-shift charracters. so I suggest [ ` ] = ZWJ( like roof... both under the same roof ... JOINER) [ / ] = ZWNJ ( like wall...a wall between them ... NON-JOINER) [ \ ] = HASANTA( Closely resembles actual hasanta...) :-) As it could be a major upgrade for Probhat layout, it would be nice if people post their other concerns/comments/usability issues that they might have!! Cheers, Golam - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Ankur's English to Bengali Dictionary
On 5/6/07, das [EMAIL PROTECTED] wrote: 3. I usually use Probhat, with my Ubuntu keyboard set like this, 'Ctrl-Shift' changes the keyboard from English to Bangla or Bangla to English. Within the Bangla definition itself, at times, one may have to write some English characters or words, but, I don't know, it maybe my OS behaving that way, but within the text field my 'Ctrl-Shift' is not working. I cannot go to English within the field. Can this be changed? Please press [ESC]! Its written inside the 'help' near typing area!! Cheers, Golam - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Ankur's English to Bengali Dictionary
Hi all, I have updated the interface of Ankur's E2B dictionary with Murshed's really nice BN Web tool. So now people can contribute new Bengali word without changing his/her keyboard layout and it should just work :-) http://www.bengalinux.org/cgi-bin/abhidhan/bdict.pl So please feel free to test it and post any suggestions that you may have. In April 2007, for first time Ankur E2B dictionary has reached five digit hits (11,927) per month. Of late, Google ranks it very highly for relevant terms and of course Google ranking is the main reason for the surge. Now, I would like to put a issue for discussion. Does use of Google AdSense would make any sense? What are the main moral/legal issues for/against it? Please feel free to share your frank opinions!! Cheers, Golam - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] [Proposal] Changing the assignment of ZWNJ in Probhat
On 4/25/07, Deepayan Sarkar [EMAIL PROTECTED] wrote: ZWJ is no longer of any use (it was only used for khanda ta; Bengali doesn't have the concept of half-consonants, no matter what some people think). It should be removed if it was there. It seems, I can't write র্য (RA-Japhala) without using ZWJ in my feisty box. Can someone shed some light on latest convention for Reph-Ra-Japhala typing issue? Thanks in advance.. Cheers, Golam - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Anubadok online: PO file translations
On 3/7/07, mak [EMAIL PROTECTED] wrote: So until temproary file storing is implemented, will it take long? Fortunately not. It is FIXED. So now you can upload a partially translated file and download the complete file instead of only untranslated messages of original file. Cheers, Golam - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Anubadok online: PO file translations
Hi, With due thanks to all who have made suggestions for an online interface for PO file translations, I am happy to announce the test launch :-) of Anubadok's PO file translation interface. So please feel free to hammer it with PO file and me with bugs :-) http://www.bengalinux.org/cgi-bin/anubadok/po/index.pl Please note an extra /po in the URL. This interface is right now very basic. So feel free to throw your suggestions for future improvements. Cheers, Golam On 3/4/07, Deepayan Sarkar [EMAIL PROTECTED] wrote: This reminds me of something I have been meaning to ask: would it be hard to set up a server (similar to the current Anubadok Online interface) that translates uploaded POT files? It could be very helpful, especially for new translators. Deepayan - - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] [Ankur-kde-translation] translation
Hi Subhodip, On 3/4/07, Subhodip Biswas [EMAIL PROTECTED] wrote: .if possible you can visit this site www.dgplug.org. Hmmm!! I just visited the website. Frankly, its really heartening to see the acitivity of Durgapur Linux users. Keep it up!! Cheers, Golam - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Fwd: [Padma] Release 0.4.10
...so its official now!! Anandabazar Patrika is now readable in Firefox with Padma extension. Please note, update from the browser (if you have it already) may still point to version 0.4.9 for next couple of days. Meanwhile you can get the it directly from the Padma website. Cheers, Golam -- Forwarded message -- From: Nagarjuna Venna [EMAIL PROTECTED] Date: Dec 19, 2006 10:52 AM Subject: [Padma] Release 0.4.10 To: [EMAIL PROTECTED] ...is now available from http://padma.mozdev.org. The highlights of this release are: 1. Support for fonts used by Ananda Bazar Patrika (Bengali) 2. Support for MLBW-TTRevathi and Kalakaumudi fonts (Malayalam) ABP support has some known issues but works pretty well. Many thanks to Golam and Anvar for contributing the code. Regards, Nagarjuna ___ Padma mailing list [EMAIL PROTECTED] http://mozdev.org/mailman/listinfo/padma - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Ankur's English to Bengali dictionary:: Usage statistics
Hi, I am happy to share with you some statistics for Ankur's E2B dictionary page access :-) The traffic has increased reasonably fast in last few months. For a detailed summary, please visit the the page (static) http://gravity.psu.edu/~hossain/bdict-report.htm Since its launch, the users have contributed more than 1800 entries to the database. More than a thousand have already made to the main dictionary. I hope to clear the backlog soon. Interestingly, the almost 2/3 of the contributions have come through Anubadok interface. It seems to me that people prefer to contribute more if their contributions are immediately visible:-) So I have upgraded the dictionary interface. Now the contributions through dictionary interface, are also immediately accessible (optionally). Jamilbhai, your online PO file translation interface on its way :-) However, I need (and hope) to find sometime for upgrading the Anubadok backend :-( If you have any suggestions, comments, please feel free to post. Cheers, Golam - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Anubadok frontend
On Wed, Oct 18, 2006 at 05:05:47PM +0530, Kushal Das wrote: Wrote a very small PyGTK based frontend for anubadok. http://freeshell.in/~kushal/eng2beng-0.1.tar.gz Hmm!! Screenshot ? :-) Thanks! Golam - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Bengalinux.org
Hi, It seems, something is not right with the domain name www.bengalinux.org. Currently somehow its not working. Although direct access of ankur's SourceForge pages seems to be working fine. Any clue? Golam - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Padma for Anandabazar
Hi! Its almost ready!! Nagarjuna (Padma's owner) has made necessary modification in Padma to accommodate Bengali and it should be available from its official site soon. Meanwhile, you can get it from http://www.imsc.res.in/~golam/padma-with-bn.xpi However please read this first :-) I am asking you to test above for two reasons. Firstly, you can let me know if I have missed any conjuncts or something else is wrong! Secondly and crucially, there is a rendering issue specially for ABP. So, if any expert on Firefox/ Unicode/Javascript can throw me some solution for it. In my understanding the problem is the following It seems ABP sometime uses three different fonts even to construct a single word. The first font provides all alphabets, some conjuncts and the second, third provide mainly conjuncts. This leads to matra for conjuncts (from 2nd and 3rd fonts) to not appear properly. It seems to me, this is happening because of the presence of font/font tag inside a single word. It essentially separates matras from the letter and so blocking proper rendering!! So...please feel free to post your comments, suggestions!! Cheers Golam -- http://www.gravity.psu.edu/~hossain/ - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Anubadok Online
On 9/25/06, mak [EMAIL PROTECTED] wrote: It's good to see the dream taking shape towards being true :) :-) msg The amount of text entered is more than allowed at a time /msg what is the allowed limit ? Right now, it is set to just 500 characters but there are no technical reasons for it being so. It would be good, if you could suggest some other number! Cheers, - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Anubadok Online
On 9/25/06, Deepayan Sarkar [EMAIL PROTECTED] wrote: 500 is too little. Isn't there some constraint imposed by the tagger (or was that the old non-free one)? Yep!! TreeTagger had constraint on maximum amount of data to be processed at a time but that too was quite high. I think it was almost megabytes of text data (xml-ed, so actual text is less than that)! So whats your suggestion? Should it be roughly equal to few paragraphs? Thanks Golam - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Anubadok Online
Well, larger is always better. OK, its increased by an order of magnitude i.e. its now 5000 characters :-) I may increase it further once it becomes stable. Golam - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Anubadok Online
Hi, Yes! Its online now :-) Thanks to Deepayan, MAK, Jamilbhai and others for sharing their ideas and dreams ! http://www.bengalinux.org/cgi-bin/anubadok/index.pl As usual, there should be some new bugs, crash...etc. So it would be great to get your feedback. Also, its kind of test-run of online interface. So if you have some other ideas, dreams then feel free share those as well! Right now, online interface is not capable of handling translation from something like new_words.list that is available in the offline client. However, that's in my TODO list and should be available in near future. Then this interface will also contribute to Ankur's English to Bengali dictionary :-) Cheers, Golam - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] khanda-t in ODT Varying with OS
On 9/19/06, dipankar das [EMAIL PROTECTED] wrote: The 'Shift-8' keystroke for 'khanda-ta' is not working at all in Ubuntu 6.06 for 'Solaiman-Lipi'. I can confirm this. I did not check it for other fonts though. But it works fine with Ekushey fonts. So may be the version of SolaimanLipi that we have, is not updated for Khanda-ta yet. Golam - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Indic Printing in Mozilla
On 9/9/06, Sayamindu Dasgupta [EMAIL PROTECTED] wrote: It works!! http://sayamindu.randomink.org/ramblings/2006/09/09/indic-printing-it-works-in-mozilla/ :-) Thats really cool! Sayam, do you expect it will make it to the mainstream soon? I mean compiled with those config options that you have mentioned. Cheers Golam - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Bengali support in Padma extention?
On 9/8/06, Jamil Ahmed [EMAIL PROTECTED] wrote: May I know the list of supported fonts? :-) Just having support for Anandabazar Patrika suffices my current need :-)). It seems ABP itself uses three fonts Aabpbengali Aabpbengalix Aabpbengalixx So above three will be definiely supported :-). The way Padma works, you can add as many mapping (font) you wish. Since I am mostly done with base files for Bengali then adding another font support would be like adding a new file containing the mapping for the font like.. yourfont.vowel_A = \u00xx yourfont.vowel_AA = \u00xy and you are done with your font!! Also, you can add little script in it for some font-specific preprocessing, if needed. Cheers, Golam - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Bengali support in Padma extention?
Do you have any idea what is the character inside the marked region of the screenshot http://www.imsc.res.in/~golam/abp-firefox1.jpg Padma seems to be working great for reading Anandabazar Patrika in Firefox. I am yet to enter all the conjuncts but still its mostly readable. Cheers, Golam - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Bengali support in Padma extention?
On 9/8/06, Deepayan Sarkar [EMAIL PROTECTED] wrote: On 9/8/06, Golam Mortuza Hossain [EMAIL PROTECTED] wrote: Do you have any idea what is the character inside the marked region of the screenshot http://www.imsc.res.in/~golam/abp-firefox1.jpg Could be 00DE or 00FE Thanks a lot! It was 00FE. I could get rid of that rather easily. Now it looks really clean :-) Golam - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Bengali support in Padma extention?
On 9/6/06, Omi Azad [EMAIL PROTECTED] wrote: If you want to go, I can give you the database. Good!! but do you have it in the form.. like Aabpbengali.consnt_KA = \u00XX ? Meanwhile, I could get it to show Bengali script through an ugly hack! I ( substituted Devanagari codepoints by Bengali and face=Jagran by Aabpbengali:-). Tanmoy: If you are reading this, then can u please tell me an easy way to convert your aabpbengali.pm to something like above? Thanks Golam - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Bengali support in Padma extention?
Hi, Is there anyone working on adding Bengali support (at least ABP :-)) in Padma mozilla plugin? padma.mozdev.org (Essentially converts XX - Unicode on-the-fly) Since we already know (thanks to Tanmoy's converter script) the mapping, it shouldn't be very difficult to add bn support, I guess!! Any other thoughts? Cheers, Golam - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Bengali support in Padma extention?
On 9/6/06, Omi Azad [EMAIL PROTECTED] wrote: Did you see that the plug-in makes FireFox freeze for a while if you visit any corresponding site? I just tried www.jagran.com (Dainik Jagaran) with the latest 0.4.7. It seems fine. I mean, I couldn't see any appreciable freeze out. Cheers, Golam - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Anubadok
On 8/18/06, Jamil Ahmed [EMAIL PROTECTED] wrote: I am very bad in spelling... :-P :-)) OK! Its updated now (except words with Khanda ta and some spelling that I couldn't confirm myself without a dictionary :-)) Hello SM M M! can you please check whether your contributions are updated now? Cheers Golam - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Anubadok
On 8/16/06, S M Mahbub Murshed [EMAIL PROTECTED] wrote: Hi, How long does it take to update a word? I submitted some words two/three days ago but its still missing. First of all, thank you for your contributions to E2b dictionary! Ideally it shouldn't take more than a day or two. However, at present it hasn't been updated for last two weeks and its due to my own lack of access to a machine with proper Bengali support ;-( Though I am hoping to have it soon. Sorry for the delay! Hope you will continue contributing to it. Meanwhile, Jamil-bhai can you help me out by going through (editing, spell...) current bdict.contrib.db? You can send me the final file. I can update it without seeing them. Thanks and Cheers. Golam - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Contributions to online English to Bengali Dictionary at Ankur
On 7/30/06, mak [EMAIL PROTECTED] wrote: Doneexpecting সমালোচনা ;) Thanks MAK for the posting. Its a good one! Cheers - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Contributions to online English to Bengali Dictionary at Ankur
So there are three suggestions!! We can go for all them! (1) Jamil :: I think we can also announce it to other mailing lists Jamil bhai, Can you please do so :-) (2) Deepayan :: You might also consider submitting a story about it at http://bn.shunya.in May be, MAK will be able to help us here? (3) Anirban :: May i suggest adding an entry to internet directory of Google. I have just submitted to dmoz.org for inclusion. Cheers Golam - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Ankur's Dictionary project
On 6/16/06, Jamil Ahmed [EMAIL PROTECTED] wrote: IIRC we had 3 PO files containing the common technical terms. Are those terms included in Anubadok's database? If not, can you add them soon? All three of them are now added and updated! commonwords.po was added last year itself. Great! I will take the source and setup before next camp. I will be off to my home till mid-July with somehow erratic net connection :-) However, you can download it from here... http://www.imsc.res.in/~golam/abhidhan-0.1.tar.bz2 it has the latest database with installation instructions for the English to Bengali dictionary. Cheers, Golam -- http://www.imsc.res.in/~golam/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Ankur's Dictionary project
On 6/18/06, Jamil Ahmed [EMAIL PROTECTED] wrote: Can you install it at http://www.bengalinux.org/cgi-bin/ ? :-) Yeah! but I don't have access to the site :-). Anyway, its not difficult. So please feel free to go ahead and install it there, at least for testing :-) Cheers, Golam ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Ankur's Dictionary project
On 6/15/06, Jamil Ahmed [EMAIL PROTECTED] wrote: IIRC we had 3 PO files containing the common technical terms. Are those terms included in Anubadok's database? If not, can you add them soon? As far as I remember Progga had sent me one Ankur's commonwords.po. In any case, please send me those files. I will surely add them asap, if they have new entries in them. Another issue. While searching if any word is not found, can u put add or suggest button? So people can submit translation. Those will be in a seperate db. You can verify and merge them with the original Anubadok's db. :) Hmm.. that's a great idea. I will try to implement it by tonight. Hopefully, I will learn little bit of http POST as well :-) Yeah!! definitely! the localization camp like the one you, MAK, Munir and others organized at IICT, BUET can greatly contribute to the Ankur's dictionary project also, along with PO translations. Cheers, Golam -- http://www.imsc.res.in/~golam/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] [annunce] Anubadok 0.1 beta: First official release
Hi all, As usual its available from http://www.imsc.res.in/~golam/anubadok/ Alternately, you may try Google(Anubadok) or even Google(অনুবাদক) :-) Here goes the release note... It started little more than a year from now. Within this time-frame and given my own constraints and limitations, it is heartening to announce its first official release. Few key features of the first official release: * For the first time the number of entries in Anubadok's English-to-Bengali dictionary has gone to five-figure. This number currently stands at 10,128. * Anubadok now completely supports free gposttl tagger along with the restricted treetagger. Further, it includes some error correcting codes for some known tagging errors of gposttl. This also means that one can expect a little higher translation accuracy while using gposttl than treetagger. * Anubadok now has an improved proper noun handling mechanism. For example, it can now recognise pattern like Bay of Bengal as a single proper noun and will translate as bangopsagor instead banglar upsagor. Although it can recognise such pattern but for the translation to proceed, it needs to have a corresponding entry (bay.of.bengal) in its E2B dictionary. Otherwise, Anubadok will use a fall-back mechanism and will translate the same as usual. Nevertheless, it will report for the entry through new_words.list. * Documentation has been slightly improved. Though, it needs more works. * English sentence splitter for complex sentences: This is lacking ever since Anubadok was born but recently I have started working on it. The version 0.1 itself has some code in it. This will be the main area of thrust for anubadok-0.2 which is available as cvs version. * Anubadok-0.1 does not have any dedicated program for Wikipedia translation but one can use anubadok or english2bangla scripts for generic translations. However, anubadok-0.2-cvs now includes wiki_anubadok for translation of wiki text (a script wikiget is included in the package for fetching wiki articles in text format. You just need to give the title of an English article.). - Lastly, I thought of having a my own balance sheet of how much I could and couldn't do in last one year, mainly using the pre-release versions of Anubadok (its also sort of beating my own drum :-)). In last few months, I have translated more than six thousands PO strings in KDE. I am sure, I would have gone no way near to that without using it. So to conclude, though Anubadok still has a long way to go but its current performance with the given amount of codes in it, is certainly encouraging for the future of Bengali Machine translation. Cheers, Golam -- http://www.imsc.res.in/~golam/ ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Fwd: http://bn.shunya.in - call for Bangla contributions and moderators
Hi Baishampayan! If no one objects then I would like to propose your name as a moderator of bn.shunya.in. You can begin after mid-June. Hopefully, by then your academic workload will come down to normal :-) Hey! Congratulations! on being selected for Google SOC 2006. Your project is indeed a very nice and cool one. I think, your project not only applies to the not connected to the Internet PCs but also to those which are CONNECTED but through dial-up. So I hope, you will give us something hot in near future! Cheers, Golam - -- Forwarded message -- From: आलोक कुमार [EMAIL PROTECTED] Date: May 31, 2006 5:18 PM Subject: http://bn.shunya.in - call for Bangla contributions and moderators To: [EMAIL PROTECTED] Hello bengalinux users, http://bn.shunya.in It doesn't exist yet, but wants to :) Shunya is a tech news site similar to slashdot, but India-centric. There's already a Hindi version, http://hi.shunya.in, up and running. This would be a good platform to publish free software news, related to bengalinux and other projects, *in Bangla*. The Bangla version would be independent of and parallel to the English version (http://en.shunya.in) and therefore would also help in attracting more contributors to Bangla l10n and i18n projects, as well as increase the Bangla user base. The Shunya webaster is looking for moderators who could volunteer to moderate the Bangla stories - it's not much work - just five minutes or so every day at three hour intervals to look up and clear the moderation queue. And the only qualifications for this job are good Bangla and a nose for tech news. More details are available at the http://en.shunya.in website and the webmaster is open to suggestions - in fact, thrives on them. You may reply to this email or send feedback http://en.shunya.in/account/feedback directly to the webmaster to know more. Your suggestions and comments are welcome. Looking forward to an active and vibrant Bangla users community on শূন্য় . অলক - -- Can't see Hindi? http://devanaagarii.net - --- - -- Baishampayan Ghose b.ghose at gmail.com 1024D/86361B74 BB2C E244 15AD 05C5 523A 90E7 4249 3494 8636 1B74 -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.2.2 (GNU/Linux) iD8DBQFEfe4KQkk0lIY2G3QRApwbAKDBxRMhEtSysULDV5nCTdVMIIKnpwCggq6B FFEvVKH27T1sga2MkeuNFEY= =JCez -END PGP SIGNATURE- ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Article on FOSS in Bangla
Hi Omi, Its a nice article and with a quite provocative title! Did you do the typesetting yourself or they did it? Did you use OO or something else for your own typing? Cheers, Golam On 5/24/06, Omi Azad [EMAIL PROTECTED] wrote: Hello All, Recently I wrote about FOSS in Weekly Ekattor Magazine. If you are interested to know about FOSS, you can read the article by downloading the PDF from http://www.ekushey.org/files/ICT_Report-FOSS-9th.pdf . Please note that the article is in Bangla. :) Thanks -- Omi Azad Contributor Bangla Computing and Localization Projects: Ankur: http://www.ankurbangla.org Ekushey: http://www.ekushey.org --- All the advantages of Linux Managed Hosting--Without the Cost and Risk! Fully trained technicians. The highest number of Red Hat certifications in the hosting industry. Fanatical Support. Click to learn more http://sel.as-us.falkag.net/sel?cmd=lnkkid=107521bid=248729dat=121642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core --- All the advantages of Linux Managed Hosting--Without the Cost and Risk! Fully trained technicians. The highest number of Red Hat certifications in the hosting industry. Fanatical Support. Click to learn more http://sel.as-us.falkag.net/sel?cmd=lnkkid7521bid$8729dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] GPoSTTL - an open-source alternative for TreeTagger
Hi, Its an announcement for the 'official' release of GPoSTTL. GPoSTTL has been developed as an open-source alternative for TreeTagger, a Penn Treebank parts-of-speech tagger and a crucial component of Anubadok. With this availability Anubadok's non-GPL dependence is OPTIONAL from now. I should thank Progga for FreeBSD patches and Prof. Y. Someya for permitting the use of his lemma list. GPoSTTL is available from http://www.imsc.res.in/~golam/gposttl/; GPoSTTL is derived from Brill's tagger which is a rule-based tagger unlike TreeTagger which is a statistical tagger. Being open-source, it is possible to add new verbs like blog in its database quite easily, but that is not possible for TreeTagger. Naturally, TreeTagger's performance goes down while translating PO files as they contain many such new terms. So even though performance wise, TreeTagger is rated slightly higher than Brills tagger but in my experience (using with po_anubadok) difference isn't significant. Moreover, GPoSTTL is less than one tenth in size of TreeTagger. Recently, I have switched to GPoSTTL while using po_anubadok. Further, while digging through it, I realized that its possible to improve current tagging method of GPoSTTL. So any of you are interested in getting your hand dirty with NLP stuff then it could be a nice playground! Please mind that it is a parts-of-speech tagger for English. So having a smart open-source tagger would be greatly helpful for any machine translator that aims to translate from English to XX. Further, it could be a crucial component for futuristic Grammar correcting program for any English word processor. Please feel free to use, comment... Cheers, Golam -- http://www.imsc.res.in/~golam/ --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid0709bid3057dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Re: Bangla software for Linux ?
On 5/7/06, Hafis Ali [EMAIL PROTECTED] wrote: Hello, Do you know of any freeware/shareware etc. bangla text editors/word processors that work on linux like Redhat Linux ? If you are using GNOME desktop then try gedit and for KDE please try kate. For office-suite please try out the latest OpenOffice. These are quite good applications for multipurpose Bengali text processing. BTW, these come bundled with any standard Linux distro. So you may not need to install them. Any suggestions on how to get me in a position to write bangla documents and bangla websites without using the MS Windows OS ? Its rather easy. Following documents will be of help to you. http://www.bengalinux.org/new/content/category/5/14/26/ If you have further query, please feel free to post. Also, you should mention the desktop, you are using. Cheers Golam --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid0709bid3057dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] How to test a new Bengali font?
On 5/7/06, Shaheed [EMAIL PROTECTED] wrote: By Bangla Literary Archive, do you mean www.boipara.com? May be not! Please try out this one. http://www.stat.wisc.edu/~deepayan/Bengali/WebPage/Archive/archive.bn.html Cheers, Golam -- http://www.imsc.res.in/~golam/ --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid0709bid3057dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Re: shortcut key combinations
On 4/28/06, Jamil Ahmed [EMAIL PROTECTED] wrote: But we are not sticking only with KDE and GNOME. What will be for Fedora Core, Mandriva, SUSE, Ubuntu, Debian, OOo, Mozilla, Gaim etc? :-) I feel and believe that we should follow the same standard wherever possible. However, in this case problems seem to be due to the technical reasons. As pointed out by Runa that in GNOME bn shortcuts are non-functional. On the other hand Progga is quite happy with bn shortcuts in KDE. So if it is GNOME specific problem (surely, it can be there somewhere else as well) then I would like to suggest that we should try to figure what exactly goes wrong with Bengali shortcuts there? Is it fixable? BTW, I am no GNOME expert. So my opinion can be pretty naive here. Personally, I think that use of EN shortcuts is at best a short term solution to the problem and certainly it is NOT going to be a long term solution. See the discussion in the mail from Omi. http://www.mail-archive.com/bengalinux-core@lists.sourceforge.net/msg01069.html It seems Bengali keyboards are already in (keeping aside the layout issue). So the original argument by Taneem following Indranil's proposal for the use of BN shortcuts is even more relevant today. Cheers Golam -- http://www.imsc.res.in/~golam/ --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid0709bid3057dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Re: shortcut key combinations
On 4/28/06, Jamil Ahmed [EMAIL PROTECTED] wrote: Progga is also using BN shortcuts in GNOME. IIRC he is using GNOME on FreeBSD. So if it is GNOME specific problem (surely, it can be there somewhere else as well) then I would like to suggest that we should try to figure what exactly goes wrong with Bengali shortcuts there? Is it fixable? BTW, I am no GNOME expert. So my opinion can be pretty naive here. Currently I have access to a Ubuntu Dapper box. BN shortcuts are working in GNOME. OK! I just did a test myself. My access machine runs GNOME in Debian Sarge. I was testing with Firefox. My keyboard switching combination is SHIFT+SHIFT [ setxkbmap -rules xfree86 -symbols en_US(pc104)+ben(probhat)+group(shift_toggle) ] It works FINE both with EN and BN shortcuts. Then I did a re-run of the test with a new keyboard switching combination CTRL+SHIFT. [ setxkbmap -rules xfree86 -symbols en_US(pc104)+ben(probhat)+group(ctrl_shift_toggle) ] It works for some but does not work for all shortcuts. This happens with both EN and BN shortcuts. I may be wrong but it seems to me that CTRL+SHIFT switching is potentially problematic. For example: suppose we need to press CTRL+KHA then (with probhat) we need to press CTRL+SHIFT+k. Please note that this combination can be a conflicting combination with the keyboard switching. Please feel free to add your opinion. Cheers, Golam --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid0709bid3057dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Re: shortcut key combinations
On 4/27/06, Jamil Ahmed [EMAIL PROTECTED] wrote: ?? :-)) I guess, its because of GMail's infamous encoding bug. See .. Message text garbled? options in GMail for more. In any case, I am enclosing my last mail in default encoding this time. -- On 4/26/06, Deepayan Sarkar [EMAIL PROTECTED] wrote: In other words, I think it's perfectly acceptable to have different policies for different desktops. I agree. I think, it makes sense. If I am not wrong then shortcuts are used by people whose geekiness is considerably high :-) Also, such users are generally hooked to a particular desktop of their choice and certainly dont switch between desktops every morning. So it is OK and desirable (if needed) to have different policies for KDE and GNOME shortcuts. Golam - --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid0709bid3057dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Prize money for Ubuntu Translations
On 4/10/06, Ershad Shafi Chowdhury [EMAIL PROTECTED] wrote: After discussion with Jamil, we have decided to encourage translators by offering cash incentives. Is it Ubuntu specific? Or is it also applicable to Kubuntu? In any case, it sounds encouraging. Hope, Bengali in ubuntu will have greater support because of your efforts! Cheers, Golam --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnkkid0944bid$1720dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] announcement: BOCRA - an OCR software for Bangla etc.
Hi, I would like to ask one question. During OCR operation (assuming I have already trained it) what should one do if there is an illustration (non-text element) in the scanned image? For example, it is pretty common to have pictures of King/Queen within the text in History books. Golam -- --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnkkid0944bid$1720dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Bangla Sorting
On 3/30/06, Jamil Ahmed [EMAIL PROTECTED] wrote: It is manually sorted alphabetically acording to Bangla dictionary. According to sort its not sorted :-) Following should be the correct ordering - গায়ানা গুয়াতেমালা গ্রিস গ্রেট বৃটেন - instead of - গায়ানা গ্রেট বৃটেন গ্রিস গুয়াতেমালা --- Cheers, Golam
[Ankur-core] Re: want to work for bengalinux.org
On 3/16/06, rayhun rashid [EMAIL PROTECTED] wrote: hello there i want to work for bengalinux.org, You are welcome! Given there are many areas where you can contribute, its better that you decide where you will prefer to work. Nevertheless, you can always contribute by translating stuffs alongside your own interests. Specially Bengali KDE needs major help from translators. Cheers, Golam --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnkkid0944bid$1720dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] need help
On 2/18/06, saeed hassan [EMAIL PROTECTED] wrote: hi, i am a regular computer user.i use XP on my pc.but i am interested in linux. i heard that linux is safe to use for internet.i have got many troubles with worms and trojans using internet. can u advise which linux distribution is best for me? In my (biased) opinion Mandriva (aka Mandrake)!! If you are willing (and wont mind!) to setup multimedia stuffs by your own then I would recommend (K)ubuntu! In any case you can use Bengali interface along with English. You haven't mentioned whether you use dialup or broadband connection. In case, you are on dialup then check your modem whether it has linux driver!! what about GNOME and KDE?are they linux distributions? No! They are desktop environment. They come in bundle with almost all linux distributions. can i run SAS SPSS S-PLUS FORTRAN in linux? I can confirm FORTRAN but for rest I don't even know what are these! ps: i live in dhaka,bangladesh.there are not many linux distributions available in the market. Anyone from Dhaka? knows about any place? maybe BDLUG member can help! Cheers, Golam -- http://www.imsc.res.in/~golam/ --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid3432bid#0486dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Computing in BENGALI
On 2/18/06, Taskinoor Hasan [EMAIL PROTECTED] wrote: i have a dream that our PC is running in Bengali, from the setup of OS to the shut down. can u give me a suggestion? what things should i learn to work? Welcome dreamer :-) I think, all major Linux distributions have already started to support Bengali form setup of OS to the shutdown!! But the support is not yet complete and for that obviously one needs people like you! If you are willing to contribute then for having a broad idea regarding your goal, please visit Ankur's website http://www.bengalinux.org Cheers, Golam -- http://www.imsc.res.in/~golam/ --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid3432bid#0486dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [off-list] Re: [Ankur-core] Computing in BENGALI
On 2/20/06, Progga [EMAIL PROTECTED] wrote: You have forgot to CC Mr. Taskinoor and Mr. Saeed Hasan :-( Oops!! I have just forwarded those mails to them :-) Thanks! Cheers, Golam -- --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid3432bid#0486dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] My Name in volunteer list
On 1/27/06, Jamil Ahmed [EMAIL PROTECTED] wrote: Both of your names are included now. :) Thanks! :-) --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid3432bid#0486dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] My Name in volunteer list
Hmm!! I think, its a nice moment to put a request for my name as well :-) Cheers, Golam -- http://www.imsc.res.in/~golam/ --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid3432bid#0486dat1642 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] The 'r-a' problem
In continuation of my last mail, it seems for PDF files it is even simpler. One needs to run pdfbook -2 input.pdf output.pdf You may need to download pdfbook, if you don't have it already. Cheers, --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37alloc_id865op=click ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] bengali utf-8 font
On 12/8/05, ashok rawal [EMAIL PROTECTED] wrote: Dear sir, No sir please, there are Madam too :-). I think, its best to say Friends. Kindly refer or send the bengali utf-8 fonts Please visit these links http://www.nongnu.org/freebangfont/ http://www.omicronlab.com/avrokeyboard/bangla-fonts/ The above site includes fonts from the previous one. http://www.ekushey.org/projects/otf_bangla_fonts/ I hope, these will serve your purpose. You can also try Google. Cheers, Golam --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37alloc_id865op=click ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Anubadok: status report
Thanks indeed !! I had planned to have 0.1 beta by now but it seems, life too had some different plan for me :-) Honestly, I was down badly, and yet to be fully up! But I think, now I am near to that :-) Hope, this weekend I can give sometime to it nearly after three months. Another thing, if you are translating some words in new_words.list file, ,then please feel free to send me the bdict.new.db file when you are done. I can then merge it with the main E2B dictionary. I think, its words count now stands above 6K, courtesy Progga. Thanks, On 9/19/05, Jamil Ahmed [EMAIL PROTECTED] wrote: I was trying today to convert debian-installers POT file and the output was simply amazing! Great work! Please keep it up. :-) `Jamil --- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42 plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: SPAM-LOW: [Ankur-core] Thunderbird -- BanglarBird
On 7/2/05, Omi Azad [EMAIL PROTECTED] wrote: It should be BaazPaakhi... Yep!! It sounds really cool !! :-) --- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_idt77alloc_id492op=click ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Updated Probhat layout for unicode 4.1
On 6/26/05, Jamil Ahmed [EMAIL PROTECTED] wrote: Long time ago, I informed Taneem bhai about this. http://www.bengalinux.org/images/probhat_layout.png Khanda-Ta is added on AE08 replacing * Hi, Has it been merged with XFree86 distribution ? In my machine (few months old Sarge), the probhat layout still doesn't have daandi ( key AB09 is still period). It was disappointing:-( cheers, --- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_idt77alloc_id492op=click ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Updated Probhat layout for unicode 4.1
On 6/26/05, Jamil Ahmed [EMAIL PROTECTED] wrote: Why don't u hack urself and update? :-) Or should I provide you the correct one? Thanks!! I think its also there somewhere in Ankur webpage. Factually, I am not the root of my machine (and don't like to reboot the machine either :-)). Also, so far I haven't bother our sys-ad to do that. Sometime its more pleasurable to work in a less privileged world :-)) cheers --- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_idt77alloc_id492op=click ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] Use of TreeTagger for Anubadok and its licence
On 6/17/05, Deepayan Sarkar [EMAIL PROTECTED] wrote: If I send you a PO file with a few messages, would you be willing to run them through Anubadok? Yep!! Plz. send me the file. I just need to run $ po_anubadok your_file.po(t). It takes few seconds. For example, I just tried translating gnome-blog.HEAD.pot . It contains 60 msgid. time output is real0m12.975s user0m4.330s sys 0m0.460s in my P4, 2.2GHz,256M machine). BTW, I am not claiming that all of them were correct translation !!! But thats the part of life. I mean trial n error :-) cheers, golam --- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_idt77alloc_id492op=click ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Use of TreeTagger for Anubadok and its licence
Hi, This post is regarding some license issues for the use of TreeTagger with the translator Anubadok. Prof. Helmut Schmid (creator of TreeTagger) has granted a license allowing me to integrate and distribute the TreeTagger as a component of Anubadok. The use of it, is free for free translations of free stuffs :-). So I have made a tar-ball of necessary files (6.9M) with a simpler (relatively) installer script and is available for download. Please note that it does not include Anubadok itself rather it's just for tree-tagger-english. Frankly, he has given me more rights than I had asked for. However, its a non-GPL license. The parts-of-speech tagger is a core requirement for any natural language processing (NLP) software. Unfortunately, there aren't many such tools available under GPL (even for English!!) which can be readily used by other programs. For last two weeks, I was mainly hacking through LPost which is based on Brill's rule-based-tagger, written in C and available under GPL. However, it didn't have any in-built tokenizer (something that understand that in the sentence Mr. Sayamindu is writing his examination., the dot at the end of Mr. is not really the end of sentence but the dot at the end of examination. really is :-)) Also it didn't have the lemmatizer (something that tells that the base form of ate is eat). In fact this was the crucial drawback (specially for a translator of Sanskrit based language where it is essential to know the root verb). Nevertheless, its measured tagging accuracy (for Wall Street Journal corpus) is reasonably high. It is around 91% whereas TreeTagger (uses probabilistic algorithm) accuracy is around 93%. So after two weeks of hacking (sometime I felt that it may be easier to write a new one than to understand someone else C program :-)), its new avatar is now fitted with tokenizer and lemmatizer and can be used as drop-in substitute for TreeTagger. Although, it has slightly less accuracy but it has great advantages. Like, you can easily train it for newer verbs like download whereas you can't do it yourself for treetagger as its database comes in binary. Also, its around 0.67M compared to 6.9M for treetagger. I will release it both as C and Perl API and most likely it will be integrated with Anubadok to make it stand-alone GPL software. Unfortunately, I really couldn't give any time for Anubadok itself, apart from some bug-fixing. The tagger itself took almost all of my free time for last two weeks. I hope, I can now switch the mode. Now I will do bug-fixing for tagger and coding for Anubadok :-) BTW, if any of you have used Anubadok and have some suggestions for improvements, please let me know. Cheers, golam --- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_idt77alloc_id492op=click ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] Ankur dictionary project and anubadok
Friends, This post is regarding the ankur's Bengali Dictionary Project. In particular, how it may be possible to put the dictionary project back on track and how anubadok can become an useful tool in making it so. Undoubtedly (at least for me), a successful dictionary project is one of the core requirements for having a meaningful Bengali language support. I think, it is true for any language. I understand that it is mainly a data-entry job, so its kind of boring :-)). In such situation, it is rather suggestive that we make it more spicy or make it a by product of something else that is not so boring :-). First of all, many (almost all) of the volunteers here, actively participate in translating the POT files. Here, I would like suggest that one can use po_anubadok for doing the data-entry job. When one runs it for the first time for a given pot file, it creates new_words.list. Then one needs to translate these words and run the program again. One can subsequently edit the po file. Let me emphasize that this process involves much less typing compared to translating a pot file by hand. Also, in its current avatar, po_anubadok can perform fairly well for pot files. This is mainly due to the fact that the strings those appear in PO files are generally much simpler in their sentence structure. So, to summarize this point, if you use the translator for translating the pot file then the EVERY SINGLE WORD THAT YOU TRANSLATE, SERVES DUAL PURPOSES. Firstly, you are translating your PO file and at the same time you are contributing to the dictionary project as a by product. Secondly, the volunteers who are involved with the Bengali Wikipedia (translating/creating ), are also potential users who can contribute significantly to the Bengali dictionary project and again as by product. I am right now trying to find out what are the technical requirements for a reasonable wiki_anubadok. So if you have any suggestion in this regard, please feel free to post. It will be quite helpful for me. I am inviting a discussion on the above suggestions from you all, specially from the current lead(s) of dictionary project to post comments, criticism, further suggestions... Cheers, golam --- This SF.Net email is sponsored by Yahoo. Introducing Yahoo! Search Developer Network - Create apps using Yahoo! Search APIs Find out how you can build Yahoo! directly into your own Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005 ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] An idea and the attempt
On 5/16/05, Kaushik Ghose [EMAIL PROTECTED] wrote: Hi, no, it is just a list of bangla words (mostly) corrected for spelling Thanks. So! it has to be built from the scratch :-( On the second issue, I need to fix the conventions. For example, what should be the proper translation for You do 1) bn aapni koren /bn OR 2) bn aapni korun /bn ? I am slightly confused. There are few more but I will put up a list of them later. Thanks in advance. Cheers, golam --- This SF.Net email is sponsored by Oracle Space Sweepstakes Want to be the first software developer in space? Enter now for the Oracle Space Sweepstakes! http://ads.osdn.com/?ad_idt12alloc_id344op=click ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
[Ankur-core] An idea and the attempt
Friends, For last few weeks, I have been experimenting with an idea of a machine translator for Bengali. The main motivation was to deal with the PO files such that the machine translator will do the data-entry job. The volunteers need to do only the editing job. It can speed up the translation job considerably. However, I now feel that its possible to reach beyond PO file translator. We can have a much better machine translator. This can be a useful tool in translating the contents of Wikipedia. I am making available the development version of the program. There are two strong reasons for doing it even though it has to go a long way to reach its goal. First reason is the LOW word counts in its Eng2Ben dictionary. I know, its boring job. I have tried to spice it up little bit. You can directly feed a html page (say from news.bbc) to the program. It will generate a translated html page and a list of new words for you. For few sites, you may need to change charset info to utf-8 manually (it makes prelim. attempt.) for proper viewing through a browser. Secondly, I need to fix the conventions for translations of basic phrases. For example, what should be the proper translation for You do 1) bn aapni koren /bn OR 2) bn aapni korun /bn Obviously, you may have some interesting suggestion as well. You can download it from my page. http://www.imsc.res.in/~golam/anubadok/ Hope to hear you. Cheers, golam --- This SF.Net email is sponsored by Oracle Space Sweepstakes Want to be the first software developer in space? Enter now for the Oracle Space Sweepstakes! http://ads.osdn.com/?ad_ids93alloc_id281op=click ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core
Re: [Ankur-core] An idea and the attempt
On 5/16/05, Kaushik Ghose [EMAIL PROTECTED] wrote: Are you releasing it under GPL (The page does not say so - at least not where I looked)? Yep. It says so in the first paragraph of its first page. The (currently hibernating) bangla dictionary project on ankur has a large word list (from Barda, Abhijit Das) that you may be able to use. I am not sure. Can you please tell me whether they are in pair form (a English word and the corresponding Bengali word)? Cheers, golam --- This SF.Net email is sponsored by Oracle Space Sweepstakes Want to be the first software developer in space? Enter now for the Oracle Space Sweepstakes! http://ads.osdn.com/?ad_ids93alloc_id281op=click ___ Bengalinux-core mailing list Bengalinux-core@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bengalinux-core