Re: [Ankur-core] XML standard for Ankur's Abhidhan

2009-05-13 Thread Deepayan Sarkar
On 5/12/09, Salahuddin Pasha salahuddi...@gmail.com wrote:
 Dear all,

  I was working on অভিধান - Abhidhan for XML support.  To
  enable various application and tools to utilize our dictionary.

  Basic work is already done, but we need to define a standard XML (XML
  DTD or XML Schema).

  Any suggestion or comments ?

Back in 2003, the bengalinux dictionary list had a discussion on this.
Nothing ever came out of it, and when Golam first started on anubadok,
his emphasis was more specialized. In any case, that discussion may
provide some suggestions.

You can get it from the list archives, and I'm also attaching a
cleaned up and edited version of the thread here:

thread from May 2003



[Ankur-dictionary] dictionary.dtd
From: Kaushik Ghose kgh...@wa... -2003-05-14 04:17



  Hi,
  here is the descriptor file.
  I'm new to XML and DTDs so please go over the semantics as well as the
  syntax an see if this serves our purpose...


  ?xml version=1.0?
  !ELEMENT entry*(word_bn, info_bn*)
  !ELEMENT word_bn (#CDATA)
  !ELEMENT info_bn (english, pronounciation_bn,meaning_bn)
  !ELEMENT english  (#CDATA)
  !ELEMENT pronounciation_bn  (#CDATA)
  !ELEMENT meaning_bn  (#CDATA)

  thanks
  -kg   



From: Kaushik Ghose kgh...@wa... -2003-05-14 05:12


 Ok, small correction, QTs DOM class seems to parse this correctly

  dictionary.dtd

  ?xml version=1.0?
  !ELEMENT dictionary (entry*)
  !ELEMENT entry (word_bn, info_bn*) 
  !ELEMENT word_bn (#CDATA)
  !ELEMENT info_bn (english?, pronounciation_bn?,meaning_bn?)
  !ELEMENT english  (#CDATA)
  !ELEMENT pronounciation_bn  (#CDATA)
  !ELEMENT meaning_bn  (#CDATA)


  test.xml

  ?xml version=1.0?
  !DOCTYPE entry SYSTEM dictionary.dtd
  dictionary
  entry
  word_bn?   ???/word_bn
  info_bn
  englishseedling/english
  pronounciation_bnankur/pronounciation_bn
  meaning_bn???   ???
  ??
  ??/meaning_bn
  /info_bn
  /entry

  entry
  word_bn?   ?/word_bn
  info_bn
  englishbangla/english
  pronounciation_bnbangla/pronounciation_bn
  meaning_bn???   ?
  ,?   ???
  ?   ?/meaning_bn
  /info_bn
  info_bn
  englishbengali/english
  /info_bn
  /entry
  /dictionary

  thanks
  -kg




From: Deepayan Sarkar deepa...@st... -2003-05-14 07:03


  Ha! A friend of mine once corrected me on this, now I can correct
  someone else :) 'pronounciation' should be spelled
  'pronunciation'.

  I'm not an expert on DTDs (though I know someone who knows much
  more, whom I can ask after after we make some progress). I find it
  very difficult to understand DTD's, and much easier to understand
  examples of what the final thing would look like. Let's work that
  way, and we can write out the DTD on ce we decide on the 'look'.

  I don't know if you know this, but there's something called
  attributes which might be useful. For instance, with multiple
  meanings as different parts of speech.  Here's an example (I'm using
  slightly different tags) --- 'pos' is part of speech, 'plural' is
  whether the word has a plural form, etc.:

  entry
wordchhaanaa/word
info pos=noun plural=false origin=deshi
  meaningdudh theke toiri ek dhoroner .../meaning
  synonyms.../synonyms
  antonyms.../antonyms## ???
translation lang=encottage cheese (?)/translation
pronunciationchhaanaa/pronunciation
/info
info pos=noun origin=tatbhabo  #it's probably not, but...
  meaningshishu, bachchaa/meaning
translation lang=enchild, young/translation  # comma 
separated
translation lang=hnbachcha/translation  #hindi is hn ? 
not sure
pronunciationchhaanaa/pronunciation
derivative form=thechhaanaaTaa, chhaanaaTi/derivative
derivative form=of num=singularchhaanaaTir/derivative
derivative form=of num=pluralchhaanaader/derivative
/info
  /entry

  (I've used romanized bengali in place of what should be bengali, but
  you get the idea.)

  I think we should handle derivative words here (and not have
  separate entries for them

Re: [Ankur-core] Bangla OCR progress

2009-05-09 Thread Deepayan Sarkar
On 5/9/09, Debayan Banerjee debaya...@gmail.com wrote:
 2009/5/9 Deepayan Sarkar deepayan.sar...@gmail.com:

  Debayan,
  
   I have been meaning to ask you: is your character segmentation
   algorithm in a form that could be easily separated out?

 The segmentation algorithm can be found here
  (http://tesseractindic.googlecode.com/files/clipmatra_pseudocode.pdf)

But this is your original algorithm which segmented গ etc (at least
for some fonts), isn't it? I thought you had an improved algorithm
which works around some of those problems (or maybe I misunderstood
your mail).

  If it could be
   easily done, I would like to try it out in BOCRA. Unfortunately, I
   don't think I will have enough time in the near future to figure out
   how ocropus/tesseract does things.


 Kindly read the paragraph in this

 (http://hacking-tesseract.blogspot.com/2009/05/bengali-stats.html)

 post regarding reducing number of character classes to be trained. I
  want to know if this is possible using BOCRA.

No it's not. From the beginning, my design for BOCRA was based on the
idea of on-the-fly training, because that's the only approach I
thought was feasible given the combination of non-standard fonts and
so many potential conjuncts. In most realistic examples, the number of
conjuncts is actually quite limited. After accounting for the most
common ones, the frequency of the rest are probably lower than normal
OCR error rate anyway.

-Deepayan

--
The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your
production scanning environment may not be a perfect world - but thanks to
Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700
Series Scanner you'll get full speed at 300 dpi even with all image 
processing features enabled. http://p.sf.net/sfu/kodak-com
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Bangla OCR progress

2009-05-08 Thread Deepayan Sarkar
Debayan,

I have been meaning to ask you: is your character segmentation
algorithm in a form that could be easily separated out? If it could be
easily done, I would like to try it out in BOCRA. Unfortunately, I
don't think I will have enough time in the near future to figure out
how ocropus/tesseract does things.

-Deepayan

--
The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your
production scanning environment may not be a perfect world - but thanks to
Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700
Series Scanner you'll get full speed at 300 dpi even with all image 
processing features enabled. http://p.sf.net/sfu/kodak-com
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] [X-Post]Bengali to English transliteration anyone?

2009-04-19 Thread Deepayan Sarkar
On 4/19/09, Debayan Banerjee debaya...@gmail.com wrote:
 Does anyone know of any libraries that can transliterate bengali to
  english. There are tools to the reverse. I need this to solve the last
  remaining road-block in OCR.
  The thing is Tesseract-OCR uses a data structure called
  directed-acyclic-word-graph to store dictionaries for lookup. After an
  OCR has been performed the OCR system matches the output with entries
  in this d.a.w.g. file. Unfortunately the data structure is not suited
  to complex scripts like ours
  
 http://groups.google.com/group/tesseract-ocr/browse_thread/thread/5495c4e348a4b272/a6dcfe5d92babb35?lnk=gstq=dawg%2Bwieghts#a6dcfe5d92babb35.
  There are 2 solutions. 1) I figure out a suitable data structure that
  handles Indic script and implement. 2) I transliterate the entire
  dictionary and the OCR output to english (26 characters instead of the
  500 odd for bengali) and then match. I think this should work.
  Any suggestions?

Take a look at the uni2rb.py script in

http://bocra.svn.sourceforge.net/viewvc/bocra/bocra/trunk/src/python/

-Deepayan

--
Stay on top of everything new and different, both inside and 
around Java (TM) technology - register by April 22, and save
$200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco.
300 plus technical and hands-on sessions. Register today. 
Use priority code J9JMT32. http://p.sf.net/sfu/p
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] KDE 4 - Bengali Translation

2008-07-23 Thread Deepayan Sarkar
On 7/23/08, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:
 Hi All,

  It seems KDE 4.1 release candidate was out last week
  and final release is scheduled for next week. Given,
  this release is going to be the first production
  release of KDE 4 series,  I was wondering whether
  there are any related plans by some of our Ankur
  members.

  Also, I am bit confused about the BN and BN-IN issue.
  How and where should one contribute new PO file translation?
  Does one need to contribute separate translation (specially
  if it doesn't have any country-specific issues) ?

  It would be helpful if Deepayan, Runa, and others can
  shed some light on these issues.

I don't have much information on the bn_IN part, as that has more or
less completely bypassed the Ankur lists.

I haven't been brave enough to update to KDE 4 yet (so don't know the
quality of the current translations), and haven't been able to put
much effort into translation either. I was planning to take a closer
look after 4.1 was out.

-Deepayan

-
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK  win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100url=/
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Bangla OCR progress

2008-07-02 Thread Deepayan Sarkar
On 7/2/08, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:
 On Wed, Jul 2, 2008 at 9:32 AM, Sayamindu Dasgupta [EMAIL PROTECTED]

  This guy seems to be doing some interesting progress for a Bangla OCR
   - or more precisely, enabling Bangla in Tesseract.
   http://debayanin.googlepages.com/hackingtesseract

Cool. I had some interaction with the tesseract/ocropus folks, and it
sounded like a good base. It's nice that someone's actually doing
something with it. It takes the old matra removal approach, and he's
facing the same problems I did (notice in his first example that গ is
segmented into 2 parts, and শু is not). On the other hand, having
something that works even partly is a good start.

 Yes, it looks definitely interesting.

   Looks like he needs some more training data - can we provide him with some 
 ?

 If I remember correctly, there was a sample file for testing completeness
  of Bengali fonts. Since it has all letters and conjuncts typed-in, the
  file might
  be useful for training Tesseract as well .

  Deepayan should be able to give some input here. He has working experience
  with R and may have some training sample as well.

Well, we have a bunch of unicode documents. For some of them, I have
print versions too, and can scan them if needed. A simpler approach
would be to render them using different fonts and take screenshots.

Apparently he also needs some box-files, whatever they are, which need
to be produced using tesseract. I haven't installed tesseract yet, and
will try, but let me know if anyone else manages.

-Deepayan
-
Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW!
Studies have shown that voting for your favorite open source project,
along with a healthy diet, reduces your potential for chronic lameness
and boredom. Vote Now at http://www.sourceforge.net/community/cca08
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Ankur's E2B dictionary:: Database enhancement

2007-06-12 Thread Deepayan Sarkar
On 6/9/07, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:
 Hi all,

 I am planning to replace the current text-file
 database of Ankur's English to Bengali Dictionary
 by a more formal database.

 I am thinking of using SQLite3 as database
 back-end. Perl binding for SQLite
 is available, so it will make the transition easy.

 I am proposing following fields for each
 row of the database table. Please feel free
 to give your suggestions for more fields
 or anything else that you think will
 be useful.


 ===
 id INTEGER PRIMARY KEY,
 en_word TEXT,
 pos_tag TEXT,
 bn_word TEXT,
 pronunciation TEXT,
 explanation TEXT,
 example TEXT,
 original_contributor TEXT,
 last_editor TEXT
 status TEXT
 ===

 Let me give an explicit example:

 
 id=123,
 en_word=critical,
 pos_tag=JJ,  ( Penn TreeBank parts-of-speech tag for adjective)
 bn_word=সঙ্কটজনক,
 pronunciation=saṅkaṭ-janok,
 explanation=অতি প্রতিকূল বা কঠিন অবস্হা,
 example=তার শারীরিক অবস্থা এখন সঙ্কটজনক।,
 original_contributor=Some good fellow,
 last_editor=Another good fellow,
 status=Unedited
 

 For each different Bengali meaning of a given English
 word there will be different row (entry) in the database.

 Having a different field for explanation from bn_word
 will allow the machine translator to use the same database.
 original_contributor field will help to find out
 the contributions by a given user (was suggested
 by Dipankar-da).

 So please feel free to post your comments/suggestions...

This looks good. It might be a good idea to have reasonably flexible
tools, from the beginning, that can convert between a database and a
plain text format. That way, adding new fields later wouldn't be hard.

-Deepayan
-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] [Proposal] Changing the assignment of ZWNJ in Probhat

2007-05-09 Thread deepayan . sarkar
On 5/9/07, Runa B [EMAIL PROTECTED] wrote:
 das wrote:
  On Tue, 2007-05-08 at 14:43 -0400, Golam Mortuza Hossain wrote:
 
  Since, usability is the main issue, I also agree with Deepayan on
  relocation
 
  of 'hasant'.   [ \ ] intuitaviely resembles more like actual hasanta
  than
  current [ / ].
 
 
 
  This point has occurred to me too, time and again. If [\] does anything,
  that should be 'hasanta'. But there is a problem. While typing, all the
  more used things should always be within the three main rows. Use of
  'hasanta' is not at all that common in modern Bangla, but the other job
  that is done by [/] now is conjoining, and that is pretty common, almost
  once every word in average, or even more than that. And moving just the
  little finger of the right hand helps in keeping up the speed, that will
  not be true for [\], every time for a conjoined letter right palm will
  get physically moved, when the bases of both the palms are fixed on the
  table, that is the natural position of the palms, and the healthy one
  for typing for long periods of time (ergonomic too). By just stretching
  the right little finger you cannot get [\]. You have to move it. This is
  true for both the keyboards I use, the TVS-Gold (it is a quite reputed
  one) and even the cramped keyboard of the Acer Aspire 3002 laptop. So, I
  think it will be true for any keyboard. This speed thing is so very
  important for the heavy bulk Bangla writers. So, I think, in no way it
  will be prudent at all to give the conjoining function to the [\] key.
 
 I absolutely support Dipankarda's views here about the hasanta. Moving
 it to \  is an inconvenient way of writing conjuncts.


 Secondly, about DS' query regarding (in)frequent usage of ZWJ:

 Sayam's blog entry regarding the matter is pretty much conclusive

 http://sayamindu.randomink.org/ramblings/2007/04/09/ra-japhala-in-bengali-bn-and-unicode-50/

 Unicode rules prescribe the usage of ZWJ (and not ZWNJ) for writing
 ra-japhala.

 quote-from-blog/book
  …Unicode Standard adopts the convention of placing the character
 U+200D ZWJ immediately after the ra to obtain the ra-yaphaala…
 /quote-from-blog/book

Yes, my point was (and still is) that this use of ZWJ is very very
infrequent compared to the use of ZWNJ. At one point you suggested
that we have ZWNJ on a shift key and ZWJ on a non-shift key, and I was
trying to understand your reasons for that.

 Since ZWNJ is also used by Uniscribe's older versions perhaps the

Again, we seem to have a misunderstanding. Your implication seems to
be that ZWNJ no longer has any valid use. Is that what you are trying
to suggest?

 following scheme can be adopted to keep both on non-shift keys:

 [`]  ZWJ
 [\] ZWNJ

I'll be perfectly happy with that.

-Deepayan
-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] [Proposal] Changing the assignment of ZWNJ in Probhat

2007-05-09 Thread Deepayan Sarkar
On 5/8/07, das [EMAIL PROTECTED] wrote:
 On Tue, 2007-05-08 at 11:14 -0700, Deepayan Sarkar wrote:
  Hope that helps.

 Yeah, it helped. Thank you for this. Why don't you write a good Bangla
 primer on these things and related matters? Can you suggest me a bit
 more documentation on Unicode and Bangla, not very technical though, I
 am no developer. Not those on the unicode-org site. Is there any? Good
 but simple and short? Why don't you write one in Bangla?

The trick is not in writing a guide, the trick is to make people read
it. Where would you suggest I put it? Sometimes answers-on-demand are
just easier :-)

-Deepayan

-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] [Proposal] Changing the assignment of ZWNJ in Probhat

2007-05-08 Thread Deepayan Sarkar
On 5/8/07, Runa B [EMAIL PROTECTED] wrote:
 Jamil Ahmed wrote:
  On 4/30/07, das [EMAIL PROTECTED] wrote:
 
  On Sun, 2007-04-29 at 15:18 +0600, Jamil Ahmed wrote:
 
  [ `] =  ZWNJ
  [~] =  ZWJ
 
 
  +1
 
 
  Anyone else? DS/SDG/RunaB/Golam/Others??
 
 I guess I had proposed this..mostly for the ra-japhala issue as mentioned by 
 sayam.

 [~] =  ZWNJ
 [ `] =  ZWJ

 I'd still go with this. The ZWJ is currently on shift+= (or +) and it'll be 
 easier to write,
 if it doesn't need a shift key.

I'm not sure I understand. Are you proposing that we have the rarely
used (only for ra-japhala) ZWJ bound to a non-shift key and the fairly
common ZWNJ bound to a shift key? Why?

Why are we not trying to use [\]? It seems pointless to have it free.

-Deepayan

-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] [Proposal] Changing the assignment of ZWNJ in Probhat

2007-05-08 Thread Deepayan Sarkar
On 5/8/07, mak [EMAIL PROTECTED] wrote:
 On 09/05/07, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:
 
  As it could be a major upgrade for Probhat layout, it would be nice
  if people post their other concerns/comments/usability issues that they
  might have!!
 
 
 If possible think about the dot / fullstop / period symbol; to get this
 either go to numeric keypad, or change the layout into English again.

I would say | (vertical bar) is the more intuitive daanri symbol,
but then we'll have to find something for double-daanri.

Here's another idea that may or may not be worth pursuing: X has (or
at least had) this concept of groups -- in fact what used to happen
when you switched a layout was you moved to a new group. (I'm speaking
in the past tense because there has been some restructuring with the
Xorg move and I'm not sure if this has changed). The interesting thing
was that even though a single keystroke could only emit one character,
it could also change the group.

I used this feature to fake a finite state machine and created a
modified probhat layout that had the same key bound to different code
points depending on context. So, for example, vowel keys like u would
be bound to উ in state 1 and ু in state 2. Pressing any consonant key
would change the state to 2. Pressing any vowel key would change to
state 1, etc. The advantage of this is that (1) it's more intuitive
and (2) frees up more keys.

I gave up on using this after moving to Xorg because my old layout
changing commands didn't work any more and I couldn't figure out how
to fix it. I'm not sure if it's at all possible now, but if it is and
if others are interested, this could be the basis of an alternative
layout.

-Deepayan
-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] [Proposal] Changing the assignment of ZWNJ in Probhat

2007-05-08 Thread Deepayan Sarkar
On 5/8/07, Omi Azad [EMAIL PROTECTED] wrote:
 Jamil Ahmed wrote:
  Yes, we can use [`] and [\], both without pressing shift key. :-))
 
 

 If you are thinking for [\] then I suggest \ = ZWNJ :)

Agreed.

-Deepayan

-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Ankur's English to Bengali Dictionary

2007-05-06 Thread deepayan . sarkar
On 5/6/07, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:
 Hi all,

 I have updated the interface of Ankur's E2B dictionary with Murshed's
 really nice BN Web tool. So now people can contribute new Bengali
 word without changing his/her keyboard layout and it should just work :-)

 http://www.bengalinux.org/cgi-bin/abhidhan/bdict.pl

 So please feel free to test it and post any suggestions that you
 may have.

 In April 2007, for first time Ankur E2B dictionary has reached
 five digit hits  (11,927) per month. Of late, Google ranks it
 very highly for relevant terms and of course Google ranking
 is the main reason for the surge.

 Now, I would like to put a issue for discussion. Does  use of
 Google AdSense  would make any sense? What are
 the main moral/legal issues for/against it?  Please feel
 free to share your frank opinions!!

I have nothing against making money, but the question is who makes it?
who pays taxes? etc. I'm not really sure how other open source
projects deal with this. It's probably not a big issue for projects
essentially run by individuals. Some projects create some sort of
legal entity that handles the money. I don't know if going that far is
worth it for us.

Then there's the question of what's going to be done with the money,
which is a whole different issue.

-Deepayan

-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Rules for Probhat layout

2007-04-24 Thread Deepayan Sarkar
On 4/24/07, S M Mahbub Murshed [EMAIL PROTECTED] wrote:
 I can help you with that. You can keep one Bangla and the English keyboard in 
 the exact same interface. Users must change language using 'esc'. Also you 
 can add few more layouts and provide radio buttons for selection.

 Let me know your thoughts.

Whatever you end up with, it would be very helpful if you try to
separate reusable parts of it in a javascript library of some sort. I
have thought about creating and maintaining something like it on
behalf of Ankur, but never got around to it. It has many possible use
cases; Anubadok is one, another obvious one is a search box in various
contexts (e.g. the archive, or wikipedia).

-Deepayan

-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] bugs in pango

2007-04-08 Thread Deepayan Sarkar
On 4/8/07, Sayamindu Dasgupta [EMAIL PROTECTED] wrote:
 On 4/6/07, Deepayan Sarkar [EMAIL PROTECTED] wrote:
  Hi,
 
  I have recently noticed a fairly bad bug in Pango w.r.t. the handling
  of ZWNJ; in fact I'm surprised no one has mentioned it before. Anyway,
  I've put up a test page with screenshots at
 
  http://bengalionline.net/rendering-bugs/rendering-bugs.html
 

 Deepayan - could you take a look at the latest Pango from svn and
 confirm if ঝন্ঝন্ is still there. I can't seem to reproduce that.

I'll take your word for it (I don't have time to test). For the
record, I was going to say that replacing ZWNJ by ZWNJ ZWJ seemed to
fix the rendering, so that could have been a quick and dirty solution.

  Could someone confirm the behaviour I see and file an appropriate bug
  report for Pango? I will follow up on the Qt stuff myself. It would be
  nice if someone could also look into openoffice, but it has bigger
  problems than this (which I'm happy to discuss if anyone's
  interested).

 OOo is confusing for me. ...anyway - what are the problems ?

Look at

http://dsarkar.fhcrc.org/bn/sumo.bn.pdf

which is obtained by

1. wget http://bengalionline.net/Archive/Rabindranath/Poetry/sumo.bn.html
2. remove first line because OOo doesn't understand XHTML
3. open with OOo and export to PDF

I get weird placement of some glyphs in a systematic but hard to
describe manner. This is with Jamrul, and I haven't tested with other
fonts (which is why I didn't want to bring this up yet).

PDF exporting isn't the issue, I see the same problem inside OOo, I
just posted the PDF as something invariant. If not for this issue, OOo
would be ideal for creating PDF files for the archive (it can even be
scripted using UNO).

-Deepayan
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


[Ankur-core] bugs in pango

2007-04-05 Thread Deepayan Sarkar
Hi,

I have recently noticed a fairly bad bug in Pango w.r.t. the handling
of ZWNJ; in fact I'm surprised no one has mentioned it before. Anyway,
I've put up a test page with screenshots at

http://bengalionline.net/rendering-bugs/rendering-bugs.html

Could someone confirm the behaviour I see and file an appropriate bug
report for Pango? I will follow up on the Qt stuff myself. It would be
nice if someone could also look into openoffice, but it has bigger
problems than this (which I'm happy to discuss if anyone's
interested).

-Deepayan

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Mukti version 1.99.1

2007-03-22 Thread Deepayan Sarkar
On 3/21/07, das [EMAIL PROTECTED] wrote:
 On Wed, 2007-03-21 at 19:08 -0700, Deepayan Sarkar wrote:
  I don't see
 
  ক্+ট্+র
 
  on the list, and I can't think of a Bengali word that would need it,
  but one could spell use it in ইলেক্ট্রিক. Is that a valid use?

 It is valid. But, this kind of use should be progressively abandoned.
 'ইলেকট্রিক' was suggested by grammar specialists, or even with 'hasanta'
 for the time being, like 'ইলেক্ট্রিক'। Though there is the 'ি  '
 there, which is less problem-some. But if it ended in 'অ', as Runa
 Bhattacharya once pointed out, like 'অক্ট্রয়' (octroi) , it looks
 primarily more awkward, like 'অকট্রয়'. But it works, you see, gradually
 people get used to it. When the English and other alien words started
 becoming received into the language, it was still a Sanskrit hegemony.
 And hence, many grammar rules were directly imported from Sanskrit. Like
 'station', a popular version of the 'বাংলা বর্ণীকরণ'  was 'ষ্টেশন',
 which was very correct according to Sanskrit 'ষত্ব-বিধি', till now you
 will see it in the signboards of the suburban all-purpose shops in the
 form of 'ষ্টেশনারি'.  (The prescribed spelling is 'স্টেশন', or
 'স্টেশনারি')

 This queer spelling with 'ষ' was done due to the Sanskrit 'ণত্ব-বিধি' of
 'ষ' always getting added up with 'ট'. Or, say, Bangla 'ঘন্টা',  was
 spelled as 'ঘণ্টা'. So, later, the highest Bangla grammar specialists,
 including Suniti Chatujje, suggested that for all directly Sanskrit
 words (বাংলায় যাদের আমরা তত্সম বলি, যারা মূল সংস্কৃতের সঙ্গে একই আছে) we
 will use Sanskrit grammar rules, like 'নির্ঘণ্ট' with a 'ণ' but for the
 changed and internalized and folk-root words, like 'ঘন্টা', it would be
 'ন'. And when we are writing alien words (English, European, Arabic,
 Farsi, and so on) we will always avoid:

 1. ণ and ষ

 2. ঈ, ী, ঊ, ূ

 3. ঋ and 'li' (i have forgotten the key in Probhat)

 4. As much as conjuncts should be avoided as possible. This will depend
 on common-sense. And also 'ৎ' and 'ঃ'  should be totally dispensed
 with.

 Sorry for taking so long (maybe you did not exactly want it, or maybe
 you knew it from before, but you can't help it with an old talkative
 teacher, and i am a third generation teacher almost reaching fifty).

My question was rhetorical, and I agree with you with for the most
part (although I didn't know about the recommendation about 'ঘন্টা',
for which I used to prefer ণ), but thanks for sharing, one always
learns something new :-)

My point was that there is no definitive list of 'valid conjuncts'.
For example, even if ক্+ট্+র was somehow 'outlawed', you would still
want to use it in a document that says so, since it used to be valid
at one point. Consequently, SM's goal of waiting till a full list is
compiled is doomed to failure, and he should just put out what's
available.

Deepayan
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Mukti version 1.99.1

2007-03-22 Thread Deepayan Sarkar
On 3/22/07, Shaheed [EMAIL PROTECTED] wrote:
 On Thursday 22 March 2007, Deepayan Sarkar wrote:
  There's at least one mistake:
 
  ন্+ধ+য   ন্দ্য
 

 Could you suggest the correct form here please?

Just remove the + signs: ন্+ধ্+য   -  ন্ধ্য

Deepayan
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Mukti version 1.99.1

2007-03-21 Thread Deepayan Sarkar
On 3/19/07, Shaheed [EMAIL PROTECTED] wrote:
 On Monday 19 March 2007, Sankarshan Mukhopadhyay wrote:
  On 3/18/07, Shaheed [EMAIL PROTECTED] wrote:
   On the latter point, perhaps the attached is of use? This is a
   generalisation of a test document I originally got from someone on this
   list...sorry, I forget who that was.
 
  Is the set of glyphs in the document complete ? ie Can this be put
  forward under appropriate licenses as *the complete (as on date) test
  case* when it comes to Bengali fonts ? If the answer to that is yes,
  then I'd suggest that this be done as soon as one can. Allows other
  organisations to standardise and measure their font completeness
  against this.

 It should be close. I did not add the various punctuation and other symbols.

 Since Bengali is not my first language (though it is my mother tongue!), the
 real question for you native speakers is whether I got the tabluations of
 conjunctions right/complete?

 Once I get that sorted out, I'll go thorough and add any missing symbols or
 odd conjuncts that don't fit the tabluation.

There's at least one mistake:

ন্‌+ধ+য ন্দ্য

As for completeness, I don't think it's realistic to ask for a
complete set that does not actually list all possible 2- and
3-conjuncts. There is no real rule that dictates inclusion in the
list. For example, I don't see

ক্‌+ট্+র

on the list, and I can't think of a Bengali word that would need it,
but one could spell use it in ইলেক্ট্রিক. Is that a valid use?

Also, if the goal is to test fonts (rather than create a list of
conjuncts), then this file is definitely missing many
conjunct+vowel-sign forms; e.g. স্তু, দ্রূ, etc. From a typographical
point of view, any conjunct ending in ট or ঠ followed by ি or ী is
interesting, even more so when followed by a chandrabindu.

-Deepayan
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Anubadok online: PO file translations

2007-03-07 Thread Deepayan Sarkar
On 3/7/07, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:

 On 3/7/07, mak [EMAIL PROTECTED] wrote:
   So until temproary file storing is implemented,
 
  will it take long?

 Fortunately not.  It is FIXED.  So now you can upload
 a partially translated file and download the complete file
 instead of only untranslated messages of original file.

I haven't had a chance to use it yet (but should soon). I have one
question in anticipation: is there a size limit on the number of
strings in a PO file (I remember some issues in the CLI version).

Deepayan

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Congratulations to BGhosh

2006-11-24 Thread Deepayan Sarkar
On 11/24/06, Progga [EMAIL PROTECTED] wrote:

 BG, Congratulations for being redditted -
   http://people.ubuntu-in.org/~ghoseb/esr.html

 :-)

But I thought Progga liked ESR :)

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Congratulations to BGhosh

2006-11-24 Thread Deepayan Sarkar
On 11/24/06, Baishampayan Ghose [EMAIL PROTECTED] wrote:
 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1

 On Saturday 25 November 2006 01:48 AM, Deepayan Sarkar cobbled together
 some glyphs to say:
  BG, Congratulations for being redditted -
http://people.ubuntu-in.org/~ghoseb/esr.html
 
  :-)
 
  But I thought Progga liked ESR :)

 Only like? Everybody _loves_ Eric Raymond [1] ;)

 Regards,
 BG

 [1] http://geekz.co.uk/lovesraymond/archive/show-them-the-code

Great link, thanks :-)

-Deepayan

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


[Ankur-core] RFC: new experimental interface for bangla archive project

2006-11-02 Thread Deepayan Sarkar
Hi,

I've recently been playing around with AJAX, using the Bangla archive
as an example. Today I finally managed to get something that works
reasonably well in Firefox/Konqueror/IE (only IE 7 tested so far). The
results can be seen here:

http://www.bengalinux.org/projects/archive/dynamic.php

Comments/suggestions welcome.

-Deepayan

-
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] RFC: new experimental interface for bangla archive project

2006-11-02 Thread Deepayan Sarkar
On 11/2/06, dipankar das [EMAIL PROTECTED] wrote:
 On Friday 03 Nov 2006 7:02 am, Deepayan Sarkar wrote:

  I've recently been playing around with AJAX, using the Bangla archive
  as an example. Today I finally managed to get something that works
  reasonably well in Firefox/Konqueror/IE (only IE 7 tested so far). The
  results can be seen here:
 
  http://www.bengalinux.org/projects/archive/dynamic.php

 It was quite interesting. There are a few problems though. 'বঙ্গভাষা' entry in
 'মধুসূদন দত্ত' is not working, at least in Konqueror.

Not working in what sense?

It used to be that Konqueror didn't implement the white-space:pre CSS
property, and to work around that I had nbsp's inserted all over the
place in pre-formatted content (i.e. poetry). This seems to have been
fixed in the latest version (I'm running 3.5.5), so I didn't bother to
do that any more (it increases file sizes by a lot). The old versions
are still available though (see below).

If there's some other kind of problem, please give more details.

 And if i click anything
 while the 'loading' signal is there, the whole thing at times goes to a
 freeze. Is it some problem with the Java in my box, or the package?

No java involved, only javascript. But I tried doing that here, and it
seemed OK (i.e. the last click got processed). Maybe I should make
sure the earlier requests are aborted.

One (bad) way of debugging (in Firefox/Konq) is to turn off the
stylesheet (konq View-Use Stylesheet-Basic Page Style), in which
case a message box with some debugging messages is visible.

 But it
 was interesting: it is so novel to see Bangla text archives, not images, or
 PDF-s.

This is actually a fairly old project (started circa 2001, and
languishing since 2004). The link given above actually goes to

http://www.stat.wisc.edu/~deepayan/Bengali/AltWebPage/dynamic.html

The old version is available at

http://www.stat.wisc.edu/~deepayan/Bengali/WebPage/bengali.html

and it's corresponding new version is at

http://www.stat.wisc.edu/~deepayan/Bengali/AltWebPage/bengali.html

The difference in the new version is that most of the HTML files are
now XHTML and as I said, the nbsp;-s are dropped.

 Is the archive searchable?

Through google, which seems good enough to me. I have thought of
adding a search box on the site itself (along with a little phonetic
input tool), but haven't managed to do it yet.

 Did you type in all the things? Is there any news of Bangla OCR on the
 horizon? Any works of this kind are so limited by the absence of a reasonably
 good OCR. And so many millions of good old Bangla works are there outside
 copyright.

That's one of my favorite topics :-), and unfortunately, no, I don't
know of any good OCR software available (there are many that _claim_
to exist, but are not available either commercially or otherwise). I
have, naturally, tried to experiment with it myself over the years,
and my best effort so far is hosted at

http://bocra.sourceforge.net

Test cases and some results are available at

http://bocra.sourceforge.net/doc/gb-bn.html

Needless to say, there are still lots of problems. I'll be happy to
discuss more if anyone's interested.


Since you mention PDF's, there's something I've been meaning to ask on
the list (and you specifically, since you work with openoffice). Last
time I tried, both gnome and kde did a lousy job of creating PDF files
from HTML. On the other hand, Openoffice does really great, in terms
of embedding vector fonts and keeping file sizes down. Do you (or
anyone else) know of a good way to convert HTML to PDF using
openoffice through a command line script? If there is a way, it might
be nice to be able to offer PDF versions as well.

-Deepayan
-
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Anubadok Online

2006-09-25 Thread Deepayan Sarkar
On 9/25/06, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:
 On 9/25/06, mak [EMAIL PROTECTED] wrote:
  It's good to see the dream taking shape towards being true :)

 :-)

  msg The amount of text entered is more than allowed at a time /msg
  what is the allowed limit ?

 Right now, it is set to just 500 characters but there are no technical
 reasons for it being so.  It would be good, if you could suggest some
 other number!

500 is too little. Isn't there some constraint imposed by the tagger
(or was that the old non-free one)?

Deepayan

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] bengali font

2006-09-09 Thread Deepayan Sarkar
On 9/9/06, T K Bose [EMAIL PROTECTED] wrote:

 I am interested to install a system in my computer by which I can easily
 write script in bengali language. Although I am bengali I can not write but
 can read printed writing. Iam a 56 year person now very much interested to
 write in bengali in my computer specificali for learning the languge. If you
 can help me I shall be very much obliged. Tarun Kumar Bose

I assume you are using Windows. I don't use Windows, so I don't really
know how to write Bengali in it. However, you should find some helpful
information here:

http://bn.wikipedia.org/wiki/%E0%A6%89%E0%A6%87%E0%A6%95%E0%A6%BF%E0%A6%AA%E0%A7%87%E0%A6%A1%E0%A6%BF%E0%A6%AF%E0%A6%BC%E0%A6%BE:Bangla_script_display_help

If the link doesn't work, go to http://bn.wikipedia.org and click on
the link How to read and write in Unicode Bangla. I'm also CC-ing
[EMAIL PROTECTED], in case someone else has any hints.

Deepayan

-
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Bengali support in Padma extention?

2006-09-08 Thread Deepayan Sarkar
On 9/8/06, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:
 Do you have any idea what is the character inside
 the marked region of the screenshot

 http://www.imsc.res.in/~golam/abp-firefox1.jpg

Could be 00DE or 00FE (see
http://www.unicode.org/charts/PDF/U0080.pdf, Latin-1 punctuation).

Deepayan

-
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] [Ankur-dictionary] how to print bengali

2006-08-27 Thread Deepayan Sarkar
On 8/26/06, santanu sutar [EMAIL PROTECTED] wrote:
  Hello,
  Does anybody has any idea how to print the Bengali files (say, the
  dictionary files or the .po files)?

Not sure what all the options are, but last time I tried, openoffice's
printing (as well as the export as PDF option) was pretty good.

Deepayan

-
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


[Ankur-core] anubadok and wikipedia

2006-08-08 Thread Deepayan Sarkar
Hi,

as the anubadok website points out, translating wikipedia articles is
a natural use of anubadok. This would be facilitated if there were a
webpage similar to

http://www.bengalinux.org/cgi-bin/abhidhan/bdict.pl

that could work with paragraphs rather than words or phrases (much
like http://babelfish.altavista.com/). Would that be hard to do?

Deepayan

-
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Contributions to online English to Bengali Dictionary at Ankur

2006-08-05 Thread Deepayan Sarkar
On 8/5/06, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:
 Hi!

 An about it page is linked to the E2B dictionary main
 page.

 http://www.bengalinux.org/abhidhan/about.htm

 Please post your comments, suggestions...on it.
 I will modify the same accordingly.

 Also, can someone from old boys of Ankur :-)
 post a brief history (few lines) of Ankur's
 dictionary project? I will put that in the about it
 page.

Well, this dictionary project (english to bengali) is the opposite in
intent of the old dictionary project (bengali to bengali/english).
It's history is simple: বলা সহজ, করা শক্ত. No actual work was ever
done, AFAIK. Actually, I might have a bit more free time in the
foreseeable future than I have had for the past few years, and I would
be willing to put in some coding time if there's any interest in
reviving the old dictionary project.

Deepayan
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] http://bn.shunya.in - call for Bangla contributions and moderators

2006-06-02 Thread Deepayan Sarkar
On 6/1/06, Baishampayan Ghose [EMAIL PROTECTED] wrote:
 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1

 Golam,
  If no one objects then I would like to propose your name as a
  moderator of bn.shunya.in. You can begin after mid-June. Hopefully,
  by then your academic workload will come down to normal :-)

 It's not just academic work-load. I have too many projects lined up in
 the queue. In the past I have taken up projects but have never managed
 to maintain them or work on them. I would like to rectify this now.
 So I'd prefer that some one among you guys take it up. I have no
 objections if you, Mak or anybody for that matter take it up. A Bengali
 tech-news website is a novel concept and it'd be great if we can
 maintain it.

Is it necessary to have only one moderator? slashdot has several. I'm
happy to volunteer as a moderator, but not if I'm the only one.

Deepayan


___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


[Ankur-core] Re: How to test a new Bengali font?

2006-05-10 Thread Deepayan Sarkar

On 5/9/06, Shaheed [EMAIL PROTECTED] wrote:

Hi,

Thanks for all the replies. I've done some testing, and the basic stuff
seems
to be working reasonably well. There is a bit more cosmetic tidying up to
do,
but I'd like to submit what is there at present:

ftp://[EMAIL PROTECTED]/Raga-1.5.tgz

for your feedback, and consideration as a new contribution to the Free
Bangla
Font project. Deepayan, are you the right contact for that?


Not really. I am probably an admin, but I have never actually done
anything on the website, all the actual work was done by Sayamindu. In
fact my own fonts are outdated there. (Now that sourceforge supports
subversion, I am planning to start a separate project there for my
fonts.)

Deepayan


[Ankur-core] Re: How to test a new Bengali font?

2006-05-08 Thread Deepayan Sarkar

On 5/7/06, Shaheed [EMAIL PROTECTED] wrote:

On Thursday 04 May 2006 01:33, Progga wrote:
 On Wed, May 03, 2006 at 09:10:36PM +0100, Shaheed wrote:
  Is there, for example, a test page and an application to use?

 Not sure about test pages, but bn academy has a list of conjuncts and I am
 sending it as attachment.

 Testing the font on the Bangla Literary Archive should help too.

By Bangla Literary Archive, do you mean www.boipara.com? if so, then I'm
afraid I cannot get Konqueror or Firefox to show anythibng other than
gibberish (even after selecting UTF-8 encoding). Any hints welcome!


He probably meant

http://www.ankurbangla.org/projects/archive

which points to

http://www.stat.wisc.edu/~deepayan/Bengali/WebPage/bengali.html

Deepayan


Re: [Ankur-core] Re: shortcut key combinations

2006-04-30 Thread Deepayan Sarkar

On 4/29/06, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:

On 4/30/06, Jamil Ahmed [EMAIL PROTECTED] wrote:

 So we are sticking with BN shortcuts then...

..but before that Runa's concern must be taken care off!
She has brought out several points in this regards.

#1 BN shortcuts are not working smoothly everywhere.

Personally, I have found that it can become erratic
in some cases. Ex. using CTRL+SHIFT for switching.
Please feel free to add other potentially problematic
situation. So that we can compile a sort of DO NOT list.
Runa, can you please give your response here.


I'm slightly confused about this. There are two types of shortcuts,
one in menu items etc (indicated by _ or  in the translated strings),
and one like CTRL-Q to quit an application. Which ones are we talking
about? The first type are activated by pressing ALT. Where does the
CTRL key come in? If we are talking about the second,  I didn't even
know that they could be translated. Can they? How?

As far as KDE is concerned, it's not a good idea to use CTRL-SHIFT as
a shortcut. Quoting from
http://www.bengalinux.org/documents/bengali-keyboard-mini-howto.php,

quote...you can use an option like grp:shift_toggle which will
toggle by pressing both shift buttons. In which case your command
becomes:


setxkbmap -symbols us(pc101)+ben -option grp:shift_toggle


This would in fact be preferable if you use KDE, since many standard
KDE keyboard shortcuts involve the Ctrl+Shift combination./quote

An easier alternative in KDE is to use the KDE keyboard switcher (by
default uses CTRL-ALT-K to switch layouts)


#2 Duplication of BN shorcuts

This is a serious problem. Checking manually for
duplication is not an easy task! However, I think
its not difficult to write a script to check them
automatically. In fact, I already have some codes
(as a part of future version of po_anubadok) that
aims to do that. First, it splist po files depending
on their location (msggrep --location=SOURCEFILE)
then it checks for repeatation of shortcuts.


I don't think this will work. First of all, it's fine to have the same
shortcut in different submenus. Also, consider the two strings Enable
Proxy and Disable Proxy. They will probably be close together in
the sources and in the same submenu as well, but will never be
displayed together. So, it's fine if they have the same shortcut.

For KDE, there's a tool for this (although I have never actually used
it) called Dr Klash. See

http://www.kdedevelopers.org/node/520

Deepayan


Re: [Ankur-core] Re: shortcut key combinations

2006-04-30 Thread Deepayan Sarkar

On 4/30/06, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:

On 4/30/06, Deepayan Sarkar [EMAIL PROTECTED]

 Quoting from
 http://www.bengalinux.org/documents/bengali-keyboard-mini-howto.php,


But isn't the same page advocating the use of CTRL+SHIFT
for Bengali keyboard switching??  :-( I think it would be nice to
update the page.


Well, it's one of those things. Someone (I forget who) wrote the first
draft. I complained about the KDE problem, but no one responded. I
didn't think it would be right to change it completely without
permission, so I added the alternative with a comment about KDE.

It is probably due for an update anyway with xorg 7.0. I had problems
following those instructions, so I'm using KDE's switcher nowadays.


  #2 Duplication of BN shorcuts
 
  This is a serious problem. Checking manually for
  duplication is not an easy task! However, I think
  its not difficult to write a script to check them
  automatically.

 I don't think this will work. First of all, it's fine to have the same
 shortcut in different submenus. Also, consider the two strings
 Enable Proxy and Disable Proxy. They will probably be
 close together in the sources and in the same submenu as
 well, but will never be displayed together. So, it's fine if they
 have the same shortcut.


Maybe I didn't get you properly. You are saying, having same bn
shortcuts MAY still work which is correct. But problem is really
with situations where it DOES NOT work. Don't you agree that
if one ensures all shortcuts are different in a submenu
(location) then that's is SUFFICIENT condition such
that all shortcuts are functional?


Yes, your solution is sufficient (so I shouldn't have said it wouldn't
work) but suboptimal (and possibly impractical). It may force
unnatural choices unnecessarily. There may easily be more shortcuts in
an application than available characters. Also, this may not work for
modular applications, e.g. KDE's control center, where the source code
for each module is in a different file, but the shortcuts should not
conflict the main menu entries.

Deepayan


[Ankur-core] Re: shortcut key combinations

2006-04-28 Thread Deepayan Sarkar

On 4/27/06, Jamil Ahmed [EMAIL PROTECTED] wrote:

On 4/26/06, Deepayan Sarkar [EMAIL PROTECTED] wrote:



[...]




 I have been meaning to respond on this issue, but I couldn't find a
 particular reference I was looking for. It's not any of the above, and
 maybe it was not a public thread (perhaps someone else will be able to
 remember). It was, if I remember correctly, a discussion that ended
 with the _final_ decision that bengali shortcuts would be used (this
 happened around the time KDE translation was started). I was initially
 against this, but Taneem made an argument that convinced me.

See these links. :)

http://sourceforge.net/mailarchive/message.php?msg_id=540
http://sourceforge.net/mailarchive/message.php?msg_id=541
http://sourceforge.net/mailarchive/message.php?msg_id=542
http://sourceforge.net/mailarchive/message.php?msg_id=543
http://sourceforge.net/mailarchive/message.php?msg_id=5416065


Thanks, the last one is the one I was looking for. I wish
sourceforge's mailing list search was a bit more sophisticated.

Deepayan


[Ankur-core] Re: shortcut key combinations

2006-04-27 Thread Deepayan Sarkar
On 4/27/06, Jamil Ahmed [EMAIL PROTECTED] wrote:
 ??

Gmail bug, I think. I couldn't see the mail either, but it's readable
in the online archives.

 On 4/26/06, Golam Mortuza Hossain [EMAIL PROTECTED] wrote:
 


 ---
 Using Tomcat but need to do more? Need to support web services, security?
 Get stuff done quickly with pre-integrated technology to make your job
 easier
 Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
 http://sel.as-us.falkag.net/sel?cmdlnkkid0709bid3057dat1642
 ___
 Bengalinux-core mailing list
 Bengalinux-core@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/bengalinux-core



--
http://www.stat.wisc.edu/~deepayan/


[Ankur-core] Re: What is Unicode

2006-04-25 Thread Deepayan Sarkar
On 4/25/06, Omi Azad [EMAIL PROTECTED] wrote:
 I think you need not to write a big mail for explaining your complains.
 I believe no one shut down your opportunities yet. Just suggest your
 opinion followed by the mistakes. That is what we don't do. People
 complains about many things on may list/places. But it's also very
 painful for us (who are related with Bangla Computing and l10n) to
 believe that some people are very poor who cannot afford 2 minutes for
 submitting a proper feedback.

 Don't take my words wrong. I just want to make you understand that we
 are working without *any* profit and কাজের বিষয়ে কেউ কটু মন্তব্য করলে ভালো
 লাগেনা।

 One more thing. Bangla became very handy in Bangladesh. For example বাড়ী
 became বাড়ি, হাতী became হাতি so no one actually complains much about
 spelling. Work is only matter for us. It's hard to believe that Bangla
 version came on the site after almost 2 years of Hindi. :( Poor Language na?

 I'd like to see your suggestions followed by the mistakes I have done so
 far.

If you mean that you are willing to consider suggested changes, then
that's great, but unless you say that explicitly (which you did not do
in your original post or in the translation), it's presumptuous to
assume that's what you meant. Dipankar did offer to proofread this and
future texts.

For what it's worth, I think the points he raised are very valid, and
you are overreacting (as usual). If you don't want constructive
criticism, you shouldn't post on an open source list.

Deepayan


[Ankur-core] Re: announcement: BOCRA - an OCR software for Bangla etc.

2006-04-06 Thread Deepayan Sarkar
On 4/6/06, Omi Azad [EMAIL PROTECTED] wrote:
 I didn't find ৎ in your documentation. :)

I can't see what you wrote, so I assume it's khanda-ta. If so, then
yes, I haven't updated my scripts/fonts yet. BOCRA itself doesn't know
much about Bengali though (just the bare amount necessary to reorder
i-kaar, e-kaar, o-kaar, etc), so you can always add it with a new name
and it will show up in the romanized output. It's very easy to fix the
code that converts this to unicode.

Deepayan


[Ankur-core] announcement: BOCRA - an OCR software for Bangla etc.

2006-04-05 Thread Deepayan Sarkar
Hi,

this is to announce a new project on sourceforge called BOCRA to
develop an OCR software targeted towards Bengali, but also useable for
other scripts, especially ones similar to Bengali. See the webpage

http://bocra.sf.net

for more details (the page is slightly outdated, but mostly accurate).
There's a mailing list too, so feel free to ask questions there. 
Before you get your hopes up, note that BOCRA does not work out of the
box (it needs to be trained for every new font) and is very unlikely
to work well with noisy input images.


Status and availability:

The software is currently in alpha status (by my definition), i.e. it
mostly works to my satisfaction in most of my machines. It uses Qt 4
and R (both of which are required to compile the sources). The sources
(licensed under the GNU GPL) are hosted as a subversion archive on
sourceforge, so you need a subversion client to get up to date
sources. A snapshot tarball is also available, which you can download
without a subversion client.

OS requirements:

Both Qt 4 and R are available on all major platforms. However, I have
tested BOCRA only on Linux, so I don't know if it will work on other
platforms (e.g. Windows). There will probably be issues in linking Qt
and R together on Windows, so I don't expect it to work out of the
box.

Deepayan
--
http://www.stat.wisc.edu/~deepayan/


[Ankur-core] policies (was: Re: Formation of Bengali(bn) translation team for Ubuntu?)

2006-03-01 Thread Deepayan Sarkar
On 3/1/06, Jamil Ahmed [EMAIL PROTECTED] wrote:
 On 3/1/06, Baishampayan Ghose [EMAIL PROTECTED] wrote:
  -BEGIN PGP SIGNED MESSAGE-
  Hash: SHA1
 
  Jamilbahi,
   Would not it be nice - if you announced it at Ankur's mailing [ie.
   [EMAIL PROTECTED] list first?
  The team is not an exclusive team. Everybody is welcome to join it but
  the issue is that we need to first talk to the Rosetta admins and get
  the official status first. That's what I was waiting for. I'd have then
  announced it on Ankur Core anyway.

 Generally we announce at Ankur Core first. I guess you don't know our
 policy as you are new comer. :)

I wonder why. It's not like anyone bothers to consider any feedback
any more. Take for example this thread:

http://sourceforge.net/mailarchive/forum.php?thread_id=8986799forum_id=12023

where a `redesigned' Ankur website was announced (by none other than
Jamil, in fact). Since I had strong reservations about this new
design, I pointed them out as politely as possible, along with
possible alternatives. As usual, there were no responses. However, a
few days later, without any further announcement whatsoever,
www.bengalinux.org began to be redirected to this new site. This was
more than a month ago, so it's unlikely to have been a temporary
experiment.

I, for one, certainly do not plan to waste any more of my time on
``Discussion about our future direction, projects, goals'' (which,
apparently, is the purpose of the core list).

Deepayan


[Ankur-core] Re: Problem :: Localized Debian Installer

2005-12-06 Thread Deepayan Sarkar
On 12/6/05, Jamil Ahmed [EMAIL PROTECTED] wrote:
 Hi,

 Pls check, http://www.bengalinux.org/screenshots/debian/index.php

 Is it Font problem? or Rendering problem?

Definitely font. I can't be sure, but I think the font being used is
unifont (which doesn't have any opentype tables for bengali). In the
past, I have seen this being used by GTK even when a proper bengali
font is available, to the extent that all the translated menu entries
in gnome applications used it. I assume the priority can be somehow
controlled by fontconfig, but I couldn't figure out how. You should
ask on the appropriate debian list.

Deepayan
N�HY޵隊X���'���u���[���
ަ�k��!���W�~�鮆�zk��C� [EMAIL PROTECTED],a{���,�H��4�m���i�(��ܢo�v'

Re: [Ankur-core] Ankur :: Re-designed

2005-11-17 Thread Deepayan Sarkar
On Wednesday 16 November 2005 12:12, Sankarshan Mukhopadhyay wrote:
 Jamil Ahmed wrote:
  Pls check the url and let us know your comments.
  http://www.bengalinux.org/new/

 With Firefox 1.0.7 in en locale on RHEL4U2 I see this problem:

 When you click on the grey sidebar on the left to navigate to a
 particular page, there is a small dotted extension of the segment that
 is present for the duration of the click.

 Can someone tell me if this is a problem with my OS+application
 environment or is this something that can be reproduced across OS ?

I see the problem on Konqueror too.

  Thanks to Salahuddin [EMAIL PROTECTED].

 Good work.

Yes, great work. However, I have several questions and concerns:

1. Why?

Why do we need this drastic a change? I see a few new features: 

(a) A login interface. Can anyone tell me what exactly it is that we would be 
able to do after logging in? Is this logging into sourceforge? There's a link 
to the SF project page for that, and in any case, I'm sure that can be done 
with the current setup as well.

(b) Poll. Who cares?

(c) How many people are logged in. Again, who cares?


2. Look

Changing the look and feel of the site is a legitimate goal. However, that can 
be done with the current version just as well. If you have a browser that 
supports alternate stylesheets, go to www.ankurbangla.org and choose the 
'Alternate Experimental Stylesheet' (on my firefox it's under 'View-Page 
Style'), or see the screenshot at 

http://www.stat.wisc.edu/~deepayan/ankur-alternate.png

Note that this doesn't change one bit of the HTML/PHP, just the stylesheet. 
This is a crude first attempt, obviously it can be improved a lot by someone 
willing to spend some time on it. For a nice and less garish example, see 

http://www.mozilla.org/about/


3. Underlying HTML content

I really, really, really hate it when people use tables for formatting, or for 
anything except tabular data. This is almost as bad as using GIF images to 
render bengali. This used to be a valid approach in the days of Frontpage 
Express (just as using images to render bengali could have been justified in 
the days before unicode). Nowadays there is something called CSS. If you want 
to do web design, do it well; learn CSS.

To give a relevant example, here is the HTML that sits behind the navigation 
bar in the current page:


div class=navbar

h2a href=/ Home /a/h2
h2a href=/bn/index.phpবাংলা পাতা/a/h2  
h2 General /h2
pa href=/projects/ People  Projects /a
pa href=/documents/ Documents /a
pa href=/downloads/ Downloads /a
pa href=/contact.php Contact /a/p
pa href=/screenshots/ Screenshots /a/p
pa href=/lists.php Mailing Lists /a/p
h2 Miscellaneous /h2
pa href=http://sf.net/projects/bengalinux/; SF.net project page /a/p
pa href=http://www.linux.org/; Linux.org /a/p
pa href=/links.php Related Links /a/p
/div


and here is the (almost) same piece in the new version:


table width=100% border=0 cellpadding=0 cellspacing=0
tr align=lefttda href=index.php?option=com_frontpageamp;Itemid=1 
class=mainlevel id=active_menuHome/a/td/tr
tr align=lefttda 
href=index.php?option=com_contentamp;task=viewamp;id=5amp;Itemid=6 
class=mainlevel বাংলা পাতা/a/td/tr
tr align=lefttda 
href=index.php?option=com_contentamp;task=categoryamp;sectionid=4amp;id=22amp;Itemid=29
 
class=mainlevel Projects/a/td/tr
tr align=lefttda 
href=index.php?option=com_contentamp;task=viewamp;id=31amp;Itemid=33 
class=mainlevel People/a/td/tr
tr align=lefttda 
href=index.php?option=com_contentamp;task=categoryamp;sectionid=5amp;id=14amp;Itemid=26
 
class=mainlevel Documents/a/td/tr
tr align=lefttda 
href=index.php?option=com_contentamp;task=viewamp;id=32amp;Itemid=32 
class=mainlevel Downloads/a/td/tr
tr align=lefttda 
href=index.php?option=com_contentamp;task=viewamp;id=33amp;Itemid=34 
class=mainlevel Screenshots/a/td/tr
tr align=lefttda href=index.php?option=com_contactamp;Itemid=3 
class=mainlevel Contact Us/a/td/tr
tr align=lefttda 
href=index.php?option=com_contentamp;task=blogcategoryamp;id=15amp;Itemid=28
 
class=mainlevel Mailing Lists/a/td/tr
tr align=lefttda href=index.php?option=com_searchamp;Itemid=5 
class=mainlevel Search/a/td/tr
tr align=lefttda 
href=index.php?option=com_contentamp;task=categoryamp;sectionid=1amp;id=1amp;Itemid=35
 
class=mainlevel News/a/td/tr
/table/div
div class=moduletable

table width=100% border=0 cellpadding=0 cellspacing=0
tr align=lefttda href=http://sourceforge.net/projects/bengalinux/; 
target=_blank class=mainlevel SF.net Project Page/a/td/tr
/table



What does something this horribly complicated and completely unreadable gain 
us? To say nothing of the fact that the whole page is formatted as a table.

I have no idea how much of this is imposed by Joomla, so this is not 
necessarily a complaint against it. However, I have spent a lot of time 
'fixing' the HTML in the current site, and I will be very disappointed to see 
it go back to 

Re: [Ankur-core] Re: [Debian-in-workers] Bengali in new Debian Installer

2005-09-22 Thread Deepayan Sarkar
On Thursday 22 September 2005 13:01, Jamil Ahmed wrote:
 Soumyadip Modak wrote:
 On Thu, 2005-09-22 at 13:40 +0600, Omi Azad wrote:
 Deepayan Sarkar wrote:
 AFAICS, Debian itself is not Unicode 4.1.0 compliant yet (the unstable
  version of console-data has [1]
  /usr/share/unidata/UnicodeData-4.0.1d1b.txt). The change is the
  addition of one (moderately rare) code point, but the font editor we
  use (fontforge) doesn't know about it yet (I haven't checked the CVS
  version recently though).
 
 In any case, this is a minor problem (that will get solved eventually),
  and more importantly, completely orthogonal to the issue at hand.
 
 Deepayan
 
 But Unicode 4.1.0 works fine in my Ubuntu 5.04 box. Windows and it's
 range of software also doesn't have any update yet, but that works fine
 without any problem. It's just a character besides a particular code, so
 there should not be any problem getting the output if we fix the font.
 In fact I'm not getting any problem in both Windows and Linux. I just
 added the codepoint in the glyph with FontLab (Windows) and it's working
 well. You can check it by downloading any of the font from
 http://www.ekushey.org.
 
 What Deepayanda probably meant is that there are significant parts of
 Debian-specific (and by extension Ubuntu-specific) software that is not
 Unicode 4.1.0 compliant. The compliance of the Desktop portion of the
 distribution doesn't mean we can call the distribution Unicode 4.1.0
 compliant. Of course the modularity of a *nix system maybe a trifle too
 difficult to understand for people used to MS Windows :)

 So what is the solution now? :)

I don't see the problem yet.

The only question would be one of policy. If a translated string involves a 
khanda-ta (which, for those who don't know, is the only Bengali 'character' 
whose encoding changed from 4.0.1 to 4.1.0), should it be encoded using the 

(a) 4.0.1 rule or 
(b) 4.1.0 rule ?

I think (b) would be the obvious choice, as long as it doesn't crash anything. 
Upside: this would remove any future conversion headaches. Downside: at most 
cause khanda-ta's not to be displayed correctly. It is quite possible, as Omi 
suggests, that dropping in a replacement unicode 4.1-compatible font would 
solve any display problems.  My guess is that this problem would be mostly 
irrelevant because the translated strings will have few (if any) khanda-ta's 
(which we cannot verify unless enough translations are done). So I suggest 
that we get on with the non-trivial task of translation rather than worry 
about unimportant problems.

Deepayan


---
SF.Net email is sponsored by:
Tame your development challenges with Apache's Geronimo App Server. 
Download it for free - -and be entered to win a 42 plasma tv or your very
own Sony(tm)PSP.  Click here to play: http://sourceforge.net/geronimo.php
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Re: Post Fair Re-Union of Internet Fair'05 and some other issues

2005-05-31 Thread Deepayan Sarkar

[Dropping [EMAIL PROTECTED] since it rejects non-member postings]

On Tuesday 31 May 2005 02:57 am, Sajed Chowdhury wrote:
 Deepayan Sarkar wrote:
  Suzan wrote:
   [...]
   3. If user will not use bengali unicode font, then how
   website developer will display their bengali website to
   the user? Does he use embedded font or anything?
 
  They could, but that comes with its own set of disadvantages (with no
  advantages, from my point of view as a Linux user).

 But is it not desirable to provide graceful degradation for
 non-Unicode enabled browsers?  What's the disadvantage with that?
 (other than a little bit extra effort)

I'm not sure what you mean. Are you asking about existing Bengali websites 
that use embedded fonts, or new ones that are trying to decide which 
technology to use? What extra effort are you talking about?

As far as I know, the only browser and Operating system combination that 
supports embedded fonts is IE on Windows (Bitstream, the technology used by 
Anandabazar, is also supported, for instance, with Netscape 4.x on Solaris, 
but that's hardly a combination worth serious consideration). IE on Windows 
is not a 'non-Unicode enabled browser', so I don't see your point. Any other 
(realistic) browser/OS combination is 'non-embedded font enabled', so whether 
or not they are 'non-Unicode enabled' is irrelevant.

Deepayan


---
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into your own
Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Re: Post Fair Re-Union of Internet Fair'05 and some other issues

2005-05-31 Thread Deepayan Sarkar
Tuesday 31 May 2005 1:07 pm , Sajed Chowdhury 
:
 Deepayan Sarkar wrote:
  Are you asking about existing Bengali websites
  that use embedded fonts, or new ones that are trying to decide which
  technology to use? What extra effort are you talking about?

 I'm talking about new ones and also existing ones that would like to
 migrate to unicode.  The extra effort would be having (some backend
 script generate) the same content in 3 forms depending on the
 capability of the browser (in order of preference):
 1. unicode
 2. font embedding using eot/pfr
 3. text as gifs/pngs

For those that currently do 2 (and/or 3), they have to decide whether they 
want to support 1.

 The only problem is that automatic detection of browser capabilities
 can be error prone, so for instance, users would have to click on a
 link like Click here for a graphic version of this page or somesuch
 to get option #3.

  IE on Windows
  is not a 'non-Unicode enabled browser'

 What about IE on Windows98?  Unfortunately a lot of people (e.g.,
 Bangladeshi cybercafes) are still using that combination.

I don't know how to react to that. Supporting obsolete systems is a perennial 
dilemma, since that decreases the incentive to upgrade, which is not a good 
thing. If it were up to me, I would probably ignore these users completely, 
and hope that some of them would ditch these (invariably illegal) 
installations and start using GNU/Linux instead. However, that's obviously 
not a pragmatic/acceptable solution (even though cyber cafe-s are probably 
the ideal test-beds for GNU/Linux deployment).

Deepayan


---
This SF.Net email is sponsored by Yahoo.
Introducing Yahoo! Search Developer Network - Create apps using Yahoo!
Search APIs Find out how you can build Yahoo! directly into your own
Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] OO and other issues

2005-05-25 Thread Deepayan Sarkar
On Wednesday 25 May 2005 06:18 am, Jamil Ahmed wrote:
 - Original Message -
 From: Taneem Ahmed [EMAIL PROTECTED]
 To: [EMAIL PROTECTED]
 Sent: Wednesday, May 25, 2005 12:21 AM
 Subject: [Ankur-core] OO and other issues

  btw, given that this is the core@ list, not meant for the general user, I
  am going to change posting permission for this list, unless I hear any
  objections. It will still be open for everyone to read, but only members
  will be able to post. The idea of the core list was to have a place for
  discussion among group members regarding issues that affect the group.

 That will be great! But already we have spread this core@ to the world for
 communicate with Ankurians. :-P

We can easily modify the list settings so that non-member posters will get an 
email message stating that they should use users@ instead.

I agree with Taneem that the core@ list should be restricted. I'm not sure 
what his reasons are (hopefully he will elaborate soon), but I think it's 
important for us to clearly define which individuals we consider to be core 
members. I'm not sure what Ankur's name is worth in the real world nowadays, 
but clearly there has been some abuse of it. The only reasonable way of 
disassociating Ankur with such abuse is to make clear which individuals have 
the authority to act 'officially' as Ankur representatives. By default only 
core members should have this authority.

Deepayan


---
SF.Net email is sponsored by: GoToMeeting - the easiest way to collaborate
online with coworkers and clients while avoiding the high cost of travel and
communications. There is no equipment to buy and you can meet as often as
you want. Try it free.http://ads.osdn.com/?ad_id=7402alloc_id=16135op=click
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] problem with reading bengali newspaper in linux

2005-04-20 Thread Deepayan Sarkar
On Wednesday 20 April 2005 10:18, Kaushik Ghose wrote:
 Stock firefox has supported unicode since the fist version I
 downloaded (0.9) and shows bangla pages with OTF just fine.

One should not make such global statements about software that have 
different rendering backends on different OS's. I'm sure your statement 
is correct for the particular version of the particular OS you were 
running at the time, but it is uninformative unless you tell us what 
that was (I'm guessing some form of Windows).

 -kg

 Progga wrote:
  On Wed, Apr 20, 2005 at 03:53:17PM +, Shekhar Nath wrote:
 The pictures and other things open
 clearly but the bengali fonts don't come properly.
 
   They won't, because stock Mozilla doesn't support Open Type fonts
  which are needed to view Unicode-encoded webpages.  The newly
  released BanglaFox browser ( a derivative of Mozilla FireFox ) can
  show you such webpages correctly.  You can also use Konqueror.  But
  if the webpages you are trying to browse uses an encoding different
  from Unicode then all the possibility is that it uses an embedded
  font.  The solution is to findout the name of the font from the
  html source of the webpage and then download and install that font.
 
   You can send the URL of the site if you wish.


---
This SF.Net email is sponsored by: New Crystal Reports XI.
Version 11 adds new functionality designed to reduce time involved in
creating, integrating, and deploying reporting solutions. Free runtime info,
new features, or free trial, at: http://www.businessobjects.com/devxi/728
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] problem with reading bengali newspaper in linux

2005-04-20 Thread Deepayan Sarkar
On Wednesday 20 April 2005 11:54, Kaushik Ghose wrote:
 Ah. So this
 They won't, because stock Mozilla doesn't support Open Type fonts
 which are needed to view Unicode-encoded webpages.

 referred to Linux only. Ok.

Yes, although I suspect the validity of the causality implied by that 
statement (in other words, even if Mozilla rendered Opentype correctly, 
it probably wouldn't have helped).

 I got the impression that BanglaFox was a localized version of
 FireFox, not that it added bangla rendering capability (which exists
 in stock).

 So firefox on linux does not display OTF ?

I'm not very conversant with all the issues here (being a Konqueror 
user), but apparently there's a compile-time flag that uses the 
relevant Pango modules to render text, in which case it does work. But 
not all distributions ship such a version (in particular, Debian 
doesn't, which is as far as my knowledge about this goes), which 
suggests that not everything is quite all right with this. The last 
time I tried a custom-compiled version, I noticed some issues with text 
selection (IIRC).

 -kg

 Deepayan Sarkar wrote:
  On Wednesday 20 April 2005 11:24, Kaushik Ghose wrote:
 win xp
 -kg
 
  Right. The original problem was on Linux, so your observation is
  irrelevant :-) (Any newspaper would have ensured that they are
  readable on Windows anyway. Although Shekhar hasn't told us which
  sites he was having problems with, I suspect this is in no way an
  unicode/opentype issue. The right way to approach this would be to
  bug the newspaper to support Linux, but of course they wouldn't
  care one bit.)
 
 Deepayan Sarkar wrote:
 On Wednesday 20 April 2005 10:18, Kaushik Ghose wrote:
 Stock firefox has supported unicode since the fist version I
 downloaded (0.9) and shows bangla pages with OTF just fine.
 
 One should not make such global statements about software that
  have different rendering backends on different OS's. I'm sure
  your statement is correct for the particular version of the
  particular OS you were running at the time, but it is
  uninformative unless you tell us what that was (I'm guessing some
  form of Windows).


---
This SF.Net email is sponsored by: New Crystal Reports XI.
Version 11 adds new functionality designed to reduce time involved in
creating, integrating, and deploying reporting solutions. Free runtime info,
new features, or free trial, at: http://www.businessobjects.com/devxi/728
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] wiki vandalism

2005-03-07 Thread Deepayan Sarkar
On Monday 07 March 2005 06:54, Mathijs Romans wrote:
 Hi,

 it seems your FAQ-page was hijacked by commerce. Also, there is an
 error when I try to use PageHistory, so I cannot revert the
 vandalism.

Thanks. There does seem to be errors in the wiki scripts. I hope someone 
else knows enough about our wiki setup to fix this. There are a couple 
of other pages vandalized as well.

For now, I've created a 


---
SF email is sponsored by - The IT Product Guide
Read honest  candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595alloc_id=14396op=click
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] wiki vandalism

2005-03-07 Thread Deepayan Sarkar

[sorry, hit send by mistake earlier]

On Monday 07 March 2005 06:54, Mathijs Romans wrote:
 Hi,

 it seems your FAQ-page was hijacked by commerce. Also, there is an
 error when I try to use PageHistory, so I cannot revert the
 vandalism.

Thanks. There does seem to be errors in the wiki scripts. I hope someone 
else knows enough about our wiki setup to fix this. There are a couple 
of other pages vandalized as well.

For now, I've created a new page at

http://www.bengalinux.org/wiki/index.php/newFAQ

from a backup taken (apparently) in July 2004, and linked this from the 
nav bar.

Deepayan


---
SF email is sponsored by - The IT Product Guide
Read honest  candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595alloc_id=14396op=click
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Subcription Option

2005-01-31 Thread Deepayan Sarkar
On Monday 31 January 2005 00:12, you wrote:
 Thanks for reply ...

 i want to subscribe in benglinux.org ... so that i can get familier
 with the developers and other technical stuffs ... like other
 websites ...

Hmm. Well, bengalinux is actually hosted on sourceforge, so you can get 
what (I think) you want at:

http://sourceforge.net/projects/bengalinux/

There should have been a link from the front page, so I've added one.

Deepayan


---
This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting
Tool for open source databases. Create drag--drop reports. Save time
by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc.
Download a FREE copy at http://www.intelliview.com/go/osdn_nl
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Subcription Option

2005-01-30 Thread Deepayan Sarkar
On Sunday 30 January 2005 21:57, Sayedul Huque wrote:
 Hi ...

 this is a great step towards open source movement.

 and it would be very nice if there was any subcription option ... if
 there any please let me know ...

Not sure what you mean by 'subscription' here...

Deepayan


---
This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting
Tool for open source databases. Create drag--drop reports. Save time
by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc.
Download a FREE copy at http://www.intelliview.com/go/osdn_nl
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Bangla linux project

2005-01-28 Thread Deepayan Sarkar
On Thursday 27 January 2005 20:16, priyabrata  mukherjee wrote:
 I am a Java/J2EE guy with 4 years of experience and I am good with
 Bangla (had studied in Bangla medium from Shanti Niketan)... Please
 let me know how I can help in the Bangla linux project.

 I can only be an occasional contributor as I am working full time in
 a pretty demanding job...

Welcome to Ankur. I would say you're ideal for translation work; no 
long-term commitments and a good grasp of bengali really helps. Take a 
look at 

http://www.bengalinux.org/projects/

Deepayan


---
This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting
Tool for open source databases. Create drag--drop reports. Save time
by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc.
Download a FREE copy at http://www.intelliview.com/go/osdn_nl
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Re: bn-BD Locale is done for Unicode (adopted by IBM)

2005-01-18 Thread Deepayan Sarkar
On Tuesday 18 January 2005 18:46, Progga wrote:
 On Tue, Jan 18, 2005 at 07:32:33PM +0600, Omi Azad wrote:
  But the correction I made by going through Deepaysn's
  upperstandard-places.po file, then why the correction is needed?

  Either
  DS has misspelled it in the first place
  or
  you have mistyped
  or
  The spellings are different in BD and WB.

  But I do have some confusions on the spellings of the central Asian
 republics' (?) names.

Of Progga's changes, I'm doubtful about

-   territory type=CR 
/territory
+   territory 
type=CR/territory

(what's the justicication for losing the gap?) and 

-   territory type=GR/territory
+   territory type=GR/territory

I'm quite sure  is the spelling used in WB. It's possible that 
it's 
different in BD, but I'd encourage Progga to double check just in case 
(in a history text maybe).

I don't think there are any standard spellings for some, like 

-   territory 
type=AZ/territory
+   territory 
type=AZ/territory

(this is probably one of the Central Asian names), I was just going by 
what I believed to be the pronunciation.


---
The SF.Net email is sponsored by: Beat the post-holiday blues
Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
It's fun and FREE -- well, almosthttp://www.thinkgeek.com/sfshirt
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] AM/PM in Bangla

2005-01-13 Thread Deepayan Sarkar
On Thursday 13 January 2005 12:20, Sajed Chowdhury wrote:
  do the words really get truncated

 No... just hypothetically.  Frankly speaking, the words 
  and
  just triggered a general concern that users who don't 
 have a
 strong bangla vocabulary (say, young'uns) might actually feel a bit
 intimidated when using a bangla desktop for the first time.

Shouldn't they use an English desktop then? I would rather lose a few 
users than lose the oppurtunity of teaching someone a few new words. I 
realise that most people are happy to live their whole lives trying to 
avoid learning anything, but I don't see why we should encourage them.

I understand your concern though, and I'm not sure what the best 
approach would be.  I generally prefer a more colloquial style for the 
same reason, while others (e.g., Progga) prefer a more formal style. 
Ultimately it's a matter of personal taste.

For AM/PM specifically, I would be reluctant to change 
 and 
 
(which are the correct spellings, BTW).  Any computer interface 
invariably contains lots of technical words that cannot be translated. 
If we start not translating even words that have such nice and 
unambiguous translations, what's the point of translating at all?

Deepayan


---
The SF.Net email is sponsored by: Beat the post-holiday blues
Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
It's fun and FREE -- well, almosthttp://www.thinkgeek.com/sfshirt
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Preliminary version of a font developer's guide

2005-01-12 Thread Deepayan Sarkar
On Tuesday 11 January 2005 13:39, Sayamindu Dasgupta wrote:
 http://sayamindu.randomink.org/conv_guide/

 Flames/brickbats solicited.

Looks nice (haven't had time to actually read it). One gripe though, we 
are not the 'Ankur Bangla Group', just 'Ankur Group' at most.


---
The SF.Net email is sponsored by: Beat the post-holiday blues
Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
It's fun and FREE -- well, almosthttp://www.thinkgeek.com/sfshirt
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Help needed-- Can't use Likhan

2005-01-05 Thread Deepayan Sarkar
On Wednesday 05 January 2005 01:19, Kushal Das wrote:
 How to use Likhan or other fonts to see those messages properly

We need more information to give an useful answer. What application are 
you using to check your email? If it is a web-based email (yahoo in 
particular), then the first thing you need to do is to change the 
encoding of the web page -- by default yahoo pages are not utf-8. There 
should be a menu item on your browser somewhere to change the encoding, 
change it to utf-8 (or 'Unicode').

That should at least show you some boxes instead of things like 
/.
 If you see boxes, 
that means IE doesn't know what font to use to show bengali (even if 
the font is installed). To fix this, use the IE menu to choose Tools - 
Internet Options - Fonts. There, for language Bengali, you should 
select an appropriate font.


---
The SF.Net email is sponsored by: Beat the post-holiday blues
Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
It's fun and FREE -- well, almosthttp://www.thinkgeek.com/sfshirt
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Chandra Bindu

2004-12-31 Thread Deepayan Sarkar
On Friday 31 December 2004 09:45, Omi Azad wrote:
 Can anyone please explain me the about the sequence of Chandra Bindu?

 As Unicode says, we have to type like Ka -  Aakar - ChandraBindu, but
 I think that is not right, we should type Ka - ChandraBindu - Aakar.

Could you explain why you think so?

 Please comment in this issue.

I think the recommendation of writing Ka -  Aakar - ChandraBindu is 
correct. I can't quote any grammar book to support me, but it's the 
only thing that makes sense to me. Chandrabindu/anusvara/bisarga are 
all marks that only apply to a vowel. For example, the unicode sequence 
'ka chandrabindu' actually represents 

ka hasanta a chandrabindu

('ka' having the implicit vowel for 'a'). If your opinion is correct, 
that would mean 'ka chandrabindu' would instead be 

ka hasanta chandrabindu a

which makes no sense to me, since 'ka hasanta chandrabindu' is not a 
meaningful entity. In fact I have no idea what you think the unicode 
sequence 'Ka ChandraBindu Aakar' (the one you think is correct) could 
possibly represent in actual grammatical terms.


---
The SF.Net email is sponsored by: Beat the post-holiday blues
Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
It's fun and FREE -- well, almosthttp://www.thinkgeek.com/sfshirt
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] QT and GPOS tables

2004-12-30 Thread Deepayan Sarkar
On Thursday 30 December 2004 13:26, Sayamindu Dasgupta wrote:
 Does QT support GPOS tables ??

I think so. Why?


---
The SF.Net email is sponsored by: Beat the post-holiday blues
Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
It's fun and FREE -- well, almosthttp://www.thinkgeek.com/sfshirt
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] OOo improves!

2004-12-23 Thread Deepayan Sarkar
On Thursday 23 December 2004 07:23, Jamil Ahmed wrote:
 Here is one with FC3
 http://www.bengalinux.org/screenshots/others/OOo-FC3-1-1-4.png

This illustrates what I was already going to point out, namely that OOo 
balks at showing split vowels (o-kaar - e-kaar + aa-kaar) and not 
juktakshars. Your Windows screenshot just happens to have none of 
those :-), so I can't say whether that's fixed there.

Deepayan


 Thanks,
 `Jamil


 - Original Message -
 From: Jamil Ahmed [EMAIL PROTECTED]
 To: [EMAIL PROTECTED]; [EMAIL PROTECTED]
 Sent: Thursday, December 23, 2004 12:25 PM
 Subject: [Ankur-core] OOo improves!

  There is no juktakkhor rendering problem in 1.1.4 [windows]
  release. Hope it will work with *nix too! Can anybody check it?
 
  Keep smiling,
  `Jamil


---
SF email is sponsored by - The IT Product Guide
Read honest  candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now. 
http://productguide.itmanagersjournal.com/
___
Bengalinux-core mailing list
Bengalinux-core@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] We are /. - ted again

2004-12-08 Thread Deepayan Sarkar
On Wednesday 08 December 2004 06:12, Jamil Ahmed wrote:
 - Original Message -
 From: Deepayan Sarkar [EMAIL PROTECTED]
 To: [EMAIL PROTECTED]
 Sent: Tuesday, December 07, 2004 8:48 PM
 Subject: Re: [Ankur-core] We are /. - ted again

  On Monday 06 December 2004 23:10, Jamil Ahmed wrote:
  Check
  http://linux.slashdot.org/linux/04/12/03/0457205.shtml?tid=110tid
 =10 6
 
  btw, who changed the layout of www.ankurbangla.org site? :-(
 
  Changed how exactly? It's usually me that messes around with the
  layout, but I haven't done anything in a long time.

 It was back again to the usual color. :-P
 btw, before that it had reddish/brownish background!

Yes, that sounds like an alternate stylesheet I was experimenting with 
(you can see it if your browser supports alternate stylesheets). It 
should never have been the default (and I didn't change anything 
recently). What browser were you using?


---
SF email is sponsored by - The IT Product Guide
Read honest  candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now. 
http://productguide.itmanagersjournal.com/
___
Bengalinux-core mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] I am interested in being part of project Ankur.

2004-10-01 Thread Deepayan Sarkar
On Thursday 30 September 2004 05:06, Indraneel Sikdar wrote:
 Hi,

 I got to know of you guys through some extremely covoluted process.
 I am not sure how I can help you all. But I am interesting in being
 part of project Ankur.

 I am what they call a probashi bengali. I have a somewhat stunted
 exposure to bengali literature but can speak the language and read
 the script with fair amount of competence.
 On the technical angle, I graduated from IIT Roorkee in 2002
 (Electrical Engineering) and while my work experience is mostly on
 core Java technologies on Unix platforms, I am ok with C too. I do
 not have any previous exposure to open source, but I do not think
 that should be a hinderance here.

 Do tell me if there is some way I can chip in

There's a FAQ which tells you what you can do if you want to join:

http://www.bengalinux.org/wiki/index.php/FAQ

The easiest way to contribute is by joining one of the translation 
projects, but that may not work for you if your grasp of bengali is not 
good. Some of the other projects involve interesting technical 
challenges, but it's up to you to look into them and decide what you 
find appealing. There's also the possibility of starting new projects.

Deepayan


---
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
___
Bengalinux-core mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Re: Bangla Text to Speech Synthesis System

2004-09-08 Thread Deepayan Sarkar
On Wednesday 08 September 2004 03:33, Amir Rahat wrote:
 -- Forwarded message --
 From: Amir Rahat [EMAIL PROTECTED]
 Date: Wed, 8 Sep 2004 14:24:03 +0600
 Subject: Re: Bangla Text to Speech Synthesis System
 To: Deepayan Sarkar [EMAIL PROTECTED]

 Thanks for ur opinion.I found ur discussions while i was searching
 with google n i read all of those.The thing is with Festival 1.4.3 u
 have to develop ur own diphone database (almost 1600 diphones for
 bangla) n u cant use the default diphones or phonesets used for
 english or french.Now as i found out most of the indian researchers
 depending on Festival for building a synthetic voice as its modular n
 u can add ur own modules very easily (like morphological analyzer for
 bangla) which u cant do with MBROLA (the other leading speech
 framework).If u carefully go through the framework documentation
 (Building a synthetic voice,http://www.festvox.org/bsv/) you will
 find out that its possible to build a better voice with festival.I
 guess u ppl read the system documentation first (as i followed the
 link given there).Besides English n french many languages like
 japanese, chinese,turky n other TTS are made with festival.Also the
 Indian IITs are switchd to festival (they were working based on
 MBROLA).S.P. Kishore n other ppl have started working with Alan W
 Black(http://www-2.cs.cmu.edu/~awb/) for building a better hindi TTS
 system.So i guess we should try with Festival first as we found out
 it as the best possible framework.I visited the dhvani site b4 but in
 many discussion i read that the voice quality is not that good.Also
 the Hindi n tamil voices which were built using MBROLA in CDEC n
 IIIT.So lets try with Festival first.You can run Festival on Windows
 also.
 For new features of Festival visit
 http://www.cstr.ed.ac.uk/projects/festival/

I'm sure that's  all true. We were looking for a quick solution and did 
not have the time to investigate long-term possibilities. So best of 
luck to you.

 We are now developing a diphone database but for that first we hav to
 define a bangla phoneset n till now i cldnt find out anything on the
 web.so thinking to visit Bangla Academy for help or otherwise we have
 to take help from any Linguistics.

For that at least, I think you'll fnd the work done by dhvani useful.  
Actually, that's basically equivalent to starting with the Bengali 
alphabet. The Devanagari alphabet is pretty scientific, in the sense 
that the letters of the alphabet correspond quite well to sounds. 
Bengali is a just a simplified version of that --- with essentially no 
difference between dantya-na and murdhanya-na, murdhanya-sha and 
talabya-sha, etc.

 I hope if we need any help you people will help us.

Sure. 

What are your eventual plans on licensing/redistribution terms?

Deepayan


---
This SF.Net email is sponsored by BEA Weblogic Workshop
FREE Java Enterprise J2EE developer tools!
Get your free copy of BEA WebLogic Workshop 8.1 today.
http://ads.osdn.com/?ad_id=5047alloc_id=10808op=click
___
Bengalinux-core mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Suggestions required

2004-08-25 Thread Deepayan Sarkar
Quoting Runa Bhattacharjee [EMAIL PROTECTED]:

 On Wed, 2004-08-25 at 17:35, Omi Azad wrote:
 Dear Omi,
  My Vote is below...
  
   host:  (The  is the correct one according to
 Bangla grammer. But 
   we are using  from a long time in many places like
  ;
 
 could you pls clarify where exactly have you been using these spellings?
 
  coz most of us have translated the foreign-origin words with a 
 strictly in view of the bnnotto-sotto bidhan/bn. and this can give
 rise to the already present inconsistencies.

Do you really mean that? Shatta-bidhan would prescribe that any 'sa' before a
'Ta' (or any other letter in the Ta-barga) would become a murdhanya Sha. This
doesn't seem to be what you are implying.

In my opinion, this is completely irrelevant for foreign words. They should be
spelled in the way that has the best chance of emulating their proper
pronunciation. While in practice all the 3 'sa'-s in Bengali almost always have
the same pronunciation (the one originally meant for talabya sha in
devanagari), it makes much much more sense to use dantya sa for the 'sa' sound
(as in cycle). A murdhanya sha is never ever pronounced as 'sa', and it seems
extremely stupid to me to spell words like 'host' and 'station' with a
murdhanya sha, which would suggest to a reader of the bengali versions that
they are supposed to be pronounced as 'hoshT' or 'shTeshon', which is a
misconception we should try not to perpetuate.

Deepayan


---
SF.Net email is sponsored by Shop4tech.com-Lowest price on Blank Media
100pk Sonic DVD-R 4x for only $29 -100pk Sonic DVD+R for only $33
Save 50% off Retail on Ink  Toner - Free Shipping and Free Gift.
http://www.shop4tech.com/z/Inkjet_Cartridges/9_108_r285
___
Bengalinux-core mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] (no subject)

2004-08-10 Thread Deepayan Sarkar
On Tuesday 10 August 2004 02:56, anirban adhikary wrote:
 respected sir

myself is an electronicstelecommunication engineer (completed
 8th semester in 30th july). my key interest area is computer
 networking.what type of skill required to join urs organisation? pls
 ans me.

Read 

http://www.bengalinux.org/wiki/index.php/FAQ

Deepayan


---
SF.Net email is sponsored by Shop4tech.com-Lowest price on Blank Media
100pk Sonic DVD-R 4x for only $29 -100pk Sonic DVD+R for only $33
Save 50% off Retail on Ink  Toner - Free Shipping and Free Gift.
http://www.shop4tech.com/z/Inkjet_Cartridges/9_108_r285
___
Bengalinux-core mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] need info bangla linux live cd

2004-08-06 Thread Deepayan Sarkar
On Friday 06 August 2004 06:51, mi su wrote:
 can u tell me  where i  got a bangla linux live CD ARE YOU RELISTED
 THIS CD ON THE MARKET ?

See http://www.bengalinux.org/wiki/index.php/FAQ


---
This SF.Net email is sponsored by OSTG. Have you noticed the changes on
Linux.com, ITManagersJournal and NewsForge in the past few weeks? Now,
one more big change to announce. We are now OSTG- Open Source Technology
Group. Come see the changes on the new OSTG site. www.ostg.com
___
Bengalinux-core mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] Re: [Freebangfont-devel] announcement/RFC: new fonts

2004-08-06 Thread Deepayan Sarkar
On Thursday 05 August 2004 15:28, Sayamindu Dasgupta wrote:
 On Thu, 2004-08-05 at 02:03 -0500, Deepayan Sarkar wrote:
  I recently (re)discovered some nice features of fontforge (formerly
  pfaedit) which significantly reduces the development time required
  to create new fonts (at least compared to the way I used to do it).
  Basically this consists of using the 'Expand Stroke' feature in
  fontforge, which has a variety of options leading to different
  styles from the same basic outline.

 Details !!
 Details !!

I have put up my working area at 

http://www.stat.wisc.edu/~deepayan/Bengali/WebPage/Font/Devel/

Read the README, then look at the fonts inside the Skeleton-* 
directories to get a better idea. (And ask me for clarification if 
required, of course.)

Deepayan


---
This SF.Net email is sponsored by OSTG. Have you noticed the changes on
Linux.com, ITManagersJournal and NewsForge in the past few weeks? Now,
one more big change to announce. We are now OSTG- Open Source Technology
Group. Come see the changes on the new OSTG site. www.ostg.com
___
Bengalinux-core mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


[Ankur-core] announcement/RFC: new fonts

2004-08-05 Thread Deepayan Sarkar

I recently (re)discovered some nice features of fontforge (formerly 
pfaedit) which significantly reduces the development time required to 
create new fonts (at least compared to the way I used to do it). 
Basically this consists of using the 'Expand Stroke' feature in 
fontforge, which has a variety of options leading to different styles 
from the same basic outline.

Anyway, using this approach, I have managed to create a couple of new 
fonts from scratch in about 5 days (starting from the template, of 
course, which takes care of the opentype lookups). The main work, i.e. 
creating the basic glyph shapes, took 3 days, and the rest was mainly 
figuring out suitable parameters (and occasionally booting up Windows 
to see if it liked the results).

I have put up the results at 

http://www.stat.wisc.edu/~deepayan/Bengali/WebPage/Font/Jamrul/JamrulNormal.ttf
http://www.stat.wisc.edu/~deepayan/Bengali/WebPage/Font/Hortuki/HortukiNormal.ttf

They are far from perfect, but good enough as a first draft. I would 
appreciate any testing and feedback. 

I'm happy to supply more details if anyone's interested.

Deepayan


---
This SF.Net email is sponsored by OSTG. Have you noticed the changes on
Linux.com, ITManagersJournal and NewsForge in the past few weeks? Now,
one more big change to announce. We are now OSTG- Open Source Technology
Group. Come see the changes on the new OSTG site. www.ostg.com
___
Bengalinux-core mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/bengalinux-core


Re: [Ankur-core] suggestion needed

2004-07-28 Thread Deepayan Sarkar
On Wednesday 28 July 2004 02:16, Jamil Ahmed wrote:
 What will be the translation for the following msgid?

 _:all weeks\n
 All

 _: \n
 

 or

 _:all weeks\n
 


Neither. It should be just 



(Anything of the form _: ...\n is a context comment that's meant to 
help the translator, it's not supposed to be translated itself.

Deepayan


[Ankur-core] Re: [Ankur-users] how to type ref

2004-06-30 Thread Deepayan Sarkar

[CC-ing to [EMAIL PROTECTED], since my knowledge of keyboard layouts is 
very limited]

On Tuesday 29 June 2004 00:49, you wrote:
 thanx

 actually my net had problem so i can't reply u quickly

 i can't setup the probhat keyboard

Why not? What command are you using to set up the bengali keyboard 
layout? What version is your X? Your layout file looks OK, but maybe 
the command you use to set up the layout doesn't work.

 i have tryed (other story)

 ref problem solved


 i wanna use it but i have tryed [] (frist dha then dot ( ] ) then
 it shows me dhaymindra is it oky

I have never managed to get the default ben layout to work (not sure 
why), so I won't be able to help you there.

 how to main taimn CVS
 i have just start translating sodipodi po file from gnome 2.8 extra

 how could i sure that other r not working with same file
 and documentation about the %c and other bangla words (documentation)

Announce to the gnome-translation list that you are working on it, and 
just submit it there when you are done. Translators don't need to use 
CVS directly.

%c is just part of C (see man 3 printf), keep it as it is. There is some 
simple documentation at 

http://www.bengalinux.org/projects/gnome/guide.php

and I'm sure there is more complete documentation at the GNOME i18m 
website. It's probably easiest to just ask on Ankur's gnome-translation 
list whenever you see something you don't understand.


 and where can i found the ben keyboard layout

Well, you can systematically type all the keys (Q W E R ...) to find 
out.

Deepayan


---
This SF.Net email sponsored by Black Hat Briefings  Training.
Attend Black Hat Briefings  Training, Las Vegas July 24-29 - 
digital self defense, top technical experts, no vendor pitches, 
unmatched networking opportunities. Visit www.blackhat.com
___
Bengalinux-core mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/bengalinux-core