On Dec 12, 2010, at 6:16 PM, Anand Chitipothu wrote:

>> Are you systematicly deleting all unicodes ?
>
> Not at all. Open Library is a multi-lingual catalog.

I don't know how hard it would be to do, but it would be nice to have  
searches that were more accommodating to accented and un-accented  
variations of letters.

For example, searching for "Gabriel Faure" does not  bring back  
"Gabriel Fauré".  In this case, there are two people with this name.   
One reference I found said that the travel author should be written  
without the accent, and the music composer should be written with the  
accent.  Not surprisingly, many or most of the imported records for  
the composer lacked the accent on the e, making a nice little mess.

It is not too hard to unravel after you investigate the two people  
(though I don't know if that one reference was even correct), but the  
casual searcher for the composer is going to be greatly surprised  
when they only find the travel author.

I also found a duplicated author with accented letters, where one  
entry had the forward tick accent and another had the backward tick  
accent (I forget what those are called, grave accent and something  
else?).   If source catalogers got that wrong, casual searchers are  
going to get it confused also.

- Alan

_______________________________________________
Ol-tech mailing list
[email protected]
http://mail.archive.org/cgi-bin/mailman/listinfo/ol-tech
To unsubscribe from this mailing list, send email to 
[email protected]

Reply via email to