I've been looking at the id structure of dbpedia and wikipedia and 
finally found an example where case sensitivity issues really bite.

Cases like this with a "redirect" are a little obnoxious,

http://en.wikipedia.org/wiki/New_York_City
http://en.wikipedia.org/wiki/New_york_city

largely because there isn't a redirect...  The same page gets displayed 
at each URL. (Ok,  the "redirect" has a little extra stuff at the top 
saying that's a redirect)

dbpedia has separate resource pages for the above cases,  so at least 
it's explaining the situation clearly -- reasoning systems that work 
with dbpedia need to be able to read this.

Here's a case that's just plain bad...

http://en.wikipedia.org/wiki/Direct_instruction
http://en.wikipedia.org/wiki/Direct_Instruction

Last time I looked there were about 10,000 wikipedia urls that varied 
only by case.  In this particular one,  it's two articles about the same 
topic,  but there could be some cases where the two articles are about 
something different.

_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to