Hi, in DBpedia, the URI encoding scheme tries to be as close as possible to the one used in Wikipedia (see also http://dbpedia.org/UriEncoding). That is why '(' is encoded '%28' etc.
Inconsistencies mainly stem from external data sources that were not encoded properly. In the example http://dbpedia.org/page/Republican_Party_%28United_States%29 http://dbpedia.org/page/Republican_Party_(United_States) the existence of the second is a bug. The URIs used in the YAGO dump were not properly encoded before loading (as you can see this resource only has YAGO properties). This will be fixed in the next release. As it was already pointed out by Tom, decoded '#' can cause problems and should therefore be avoided. http://dbpedia.org/resource/Midfielder#Winger is in fact http://dbpedia.org/resource/Midfielder. The reason why http://dbpedia.org/resource/Midfielder%23Winger is an own resource is because there are infoboxes of players that link to the Winger part of the Midfielder page. Linking to Midfielder would therefore be incorrect or at least imprecise. Best, Max On Thu, May 12, 2011 at 19:31, Rupert Westenthaler <[email protected]> wrote: > Hi Tom > > On Thu, May 12, 2011 at 7:01 PM, Tom Morris <[email protected]> wrote: >> I agree it'd be nice to fix the encoding of parentheses, but with regard to >> #: >> >>> A similar thing is also the case for URLs that include an "#" >>> e.g. >>> http://dbpedia.org/resource/Midfielder%23Winger is an own Resource >>> http://dbpedia.org/resource/Midfielder#Winger returns the contents >>> for Midfielder >> >> The number sign is the URI fragment separator and most clients will >> only send the left-hand portion to the server (certainly that's true >> for browsers). The reason you're seeing the content for Midfielder is >> because that's what you requested. The "#Winger" part never makes it >> to the server. > I agree. I just noted it because for some URLs there actually exists > an own resource when using "%23" instead of "#" (see the above example). > > > -- > | Rupert Westenthaler [email protected] > | Bodenlehenstraße 11 ++43-699-11108907 > | A-5500 Bischofshofen > > ------------------------------------------------------------------------------ > Achieve unprecedented app performance and reliability > What every C/C++ and Fortran developer should know. > Learn how Intel has extended the reach of its next-generation tools > to help boost performance applications - inlcuding clusters. > http://p.sf.net/sfu/intel-dev2devmay > _______________________________________________ > Dbpedia-discussion mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion > ------------------------------------------------------------------------------ EditLive Enterprise is the world's most technically advanced content authoring tool. Experience the power of Track Changes, Inline Image Editing and ensure content is compliant with Accessibility Checking. http://p.sf.net/sfu/ephox-dev2dev _______________________________________________ Dbpedia-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
