Hi Tibor,

Tibor Simko <[email protected]> wrote:
> 
> On Mon, 01 Sep 2014, Ferran Jorba wrote:
>> May I know how does it handle Catalan middle dot character, used between
>> two l (that is, «l·l»), for words like «paral·lel» or «cal·ligrafia»?
[...]

> Perhaps the middle dot can be always stripped away before
> indexing/searching/comparing terms?  Are there cases like "foo·bar·baz"
> and "foo·barbaz" and "foobar·baz" meaning three different things,

No, it is just phonetics and orthography, no semantics (writing
«parallel» or «paralel» instead of «paral·lel» is just an ortographic
error, there is no word in the first two forms).  It has been a long
philological entertainment among us to discuss whether it is a single,
double or triple character sign; yes, it seems almost a theological
discussion, and it even has its own web site (http://www.l·l.cat/).  So
far, I don't known of any casualty, though ;-)

> yet having the same dotless ASCII transliteration?  If not, perhaps
> one could always treat 'l·l' as 'll'?

Yes, that would be great, as long as it is treated consistently in the
index and the user input box, of course.  If a special case case to be
done, it would be user friendly to take the second, incorrect form («•»)
as equivalent, for those who are using software that produces it ;-)

Thanks again,

Ferran

Reply via email to