Not a Bangla speaker, but they look like typos to me too. Only certain vowel diacritics double up in Indic languages (e.g. anusvaras). I'm not sure how you would even pronounce such sounds. I suppose such combinations of diacritics could be used to represent dipthongs in words from other languages, but some of these dipthongs already exist in the regular script.
I found things like this[1] on wikisource which seems like an OCR of some really garbled text. The text does indeed seem like it has additional vowel diacritics, but that could also be a scanning glitch. The same word appears twice in the document, but once in the text. Another sequence I found in [2][3] seems to only happen when the text is really garbled. All of these documents have random Latin stuff interspersed in the OCR, and sometimes Devanagri. [2] even has a Han character at the end. I think it's just an OCR algorithm handling garbled Bangla text poorly. Such an algorithm might have a tendency to produce certain specific invalid sequences like the ones listed in your email. Might want to double-check with a native Bangla speaker. Thanks, -Manish [1]: https://bn.wikisource.org/wiki/%E0%A6%AA%E0%A6%BE%E0%A6%A4%E0%A6%BE:%E0%A6%B0%E0%A6%BE%E0%A6%AE%E0%A6%BE%E0%A6%AF%E0%A6%BC%E0%A6%A3%E0%A6%AE%E0%A7%8D%E2%80%8C_-_%E0%A6%AA%E0%A6%9E%E0%A7%8D%E0%A6%9A%E0%A6%BE%E0%A6%A8%E0%A6%A8_%E0%A6%A4%E0%A6%B0%E0%A7%8D%E0%A6%95%E0%A6%B0%E0%A6%A4%E0%A7%8D%E0%A6%A8.pdf/%E0%A7%A7%E0%A7%A9%E0%A7%A7%E0%A7%A7 [2]: https://bn.wikisource.org/wiki/%E0%A6%AA%E0%A6%BE%E0%A6%A4%E0%A6%BE:%E0%A6%AC%E0%A6%BF%E0%A6%B6%E0%A7%8D%E0%A6%AC%E0%A6%95%E0%A7%8B%E0%A6%B7_%E0%A6%A8%E0%A6%AC%E0%A6%AE_%E0%A6%96%E0%A6%A3%E0%A7%8D%E0%A6%A1.djvu/%E0%A7%AD%E0%A7%AD%E0%A7%A6 [3]: https://bn.wikisource.org/wiki/%E0%A6%AA%E0%A6%BE%E0%A6%A4%E0%A6%BE:%E0%A6%B6%E0%A6%BF%E0%A6%95%E0%A7%8D%E0%A6%B7%E0%A6%BE%E0%A6%AC%E0%A6%BF%E0%A6%A7%E0%A6%BE%E0%A6%AF%E0%A6%BC%E0%A6%95_%E0%A6%AA%E0%A7%8D%E0%A6%B0%E0%A6%B8%E0%A7%8D%E0%A6%A4%E0%A6%BE%E0%A6%AC.pdf/%E0%A7%A7%E0%A7%AD%E0%A7%AE -Manish On Tue, Feb 7, 2017 at 10:08 AM, Eric Muller <[email protected]> wrote: > In looking at the wiki{pedia,book.source,tionary} corpus for Bengla, I see a > relatively large number of syllables with <... 09BF 09BE> or <... 09BF > 09C0>. I checked a couple of sources, and I did not find them listed > anywhere as being normally used. > > Are they in normal use or are those all typos? > > I did not find any occurrence in the Assamese corpus. > > Thanks, > Eric. > > The syllables (o is the number of occurrences): > > > <string s='কিী' o='198'/> > <string s='ক্তিা' o='262'/> > <string s='ক্রিা' o='447'/> > <string s='ক্রিী' o='77'/> > <string s='ক্লিা' o='245'/> > <string s='ক্ষিী' o='161'/> > <string s='ক্সিা' o='138'/> > <string s='খিা' o='949'/> > <string s='গিা' o='2671'/> > <string s='গিী' o='250'/> > <string s='গ্নিা' o='57'/> > <string s='গ্নিী' o='110'/> > <string s='গ্রিা' o='143'/> > <string s='ঘিা' o='83'/> > <string s='ঙ্কিা' o='403'/> > <string s='ঙ্গিা' o='267'/> > <string s='ঙ্গিী' o='150'/> > <string s='চিা' o='905'/> > <string s='চিী' o='135'/> > <string s='চ্চিা' o='91'/> > <string s='চ্ছিা' o='323'/> > <string s='ছিা' o='712'/> > <string s='ছিী' o='61'/> > <string s='জিা' o='527'/> > <string s='জিী' o='140'/> > <string s='জ্জিা' o='56'/> > <string s='ঝিা' o='81'/> > <string s='ঞিা' o='71'/> > <string s='ঞ্চিা' o='175'/> > <string s='ঞ্জিা' o='270'/> > <string s='ঞ্জিী' o='316'/> > <string s='টিা' o='807'/> > <string s='টিী' o='586'/> > <string s='ঠিা' o='549'/> > <string s='ঠিী' o='89'/> > <string s='ড়িা' o='1361'/> > <string s='ড়িী' o='135'/> > <string s='ডিা' o='257'/> > <string s='ঢ়িা' o='71'/> > <string s='ণিা' o='354'/> > <string s='তিী' o='270'/> > <string s='তি্যু' o='75'/> > <string s='ত্তিা' o='143'/> > <string s='ত্তিী' o='144'/> > <string s='ত্ত্বিা' > o='54'/> > <string s='ত্বিা' o='72'/> > <string s='ত্মিা' o='161'/> > <string s='ত্যিা' o='129'/> > <string s='ত্রিা' o='217'/> > <string s='ত্রিী' o='264'/> > <string s='ত্ৰিা' o='102'/> > <string s='থিা' o='290'/> > <string s='থিী' o='127'/> > <string s='দিী' o='514'/> > <string s='দ্ধিা' o='228'/> > <string s='দ্বিা' o='505'/> > <string s='দ্বিী' o='121'/> > <string s='দ্যিা' o='53'/> > <string s='ধিী' o='235'/> > <string s='নিী' o='551'/> > <string s='ন্তিা' o='100'/> > <string s='ন্ত্রিা' > o='93'/> > <string s='ন্ত্রিী' > o='171'/> > <string s='ন্দিা' o='102'/> > <string s='ন্দ্রিা' > o='238'/> > <string s='ন্দ্রিী' > o='79'/> > <string s='ন্ধিা' o='109'/> > <string s='ন্মিা' o='98'/> > <string s='পিা' o='1199'/> > <string s='প্তিা' o='67'/> > <string s='প্রিা' o='203'/> > <string s='ফিা' o='174'/> > <string s='ফ্রিা' o='60'/> > <string s='বিী' o='715'/> > <string s='ব্রিা' o='87'/> > <string s='ভিা' o='908'/> > <string s='ভিী' o='80'/> > <string s='মিী' o='373'/> > <string s='ম্পিা' o='55'/> > <string s='ম্বিা' o='117'/> > <string s='ম্মিা' o='67'/> > <string s='যিা' o='204'/> > <string s='রিা' o='4703'/> > <string s='র্ণিা' o='55'/> > <string s='র্তিী' o='56'/> > <string s='র্বিা' o='105'/> > <string s='র্মিা' o='68'/> > <string s='র্মিী' o='70'/> > <string s='র্ষিা' o='65'/> > <string s='লিী' o='419'/> > <string s='ল্পিী' o='113'/> > <string s='শিী' o='216'/> > <string s='শ্বিা' o='145'/> > <string s='ষিা' o='376'/> > <string s='ষ্টিা' o='269'/> > <string s='ষ্ট্যিা' > o='75'/> > <string s='ষ্ঠিী' o='99'/> > <string s='সিা' o='760'/> > <string s='সিী' o='117'/> > <string s='স্কিা' o='106'/> > <string s='স্ট্রিী' > o='157'/> > <string s='স্তিা' o='311'/> > <string s='স্তিী' o='50'/> > <string s='স্থিা' o='1946'/> > <string s='স্বিা' o='97'/> > <string s='স্মিা' o='74'/> > <string s='হিী' o='424'/> > <string s='হ্যিা' o='89'/> > <string s='ৰিী' o='204'/> > <string s='ৰ্ত্তিা' > o='125'/> > <string s='ৰ্ত্তিী' > o='118'/> > <string s='ৰ্ম্মিা' > o='58'/> > <string s='ৱিা' o='264'/> > >

