Hi,

in DBpedia, the URI encoding scheme tries to be as close as possible
to the one used in Wikipedia (see also
http://dbpedia.org/UriEncoding). That is why '(' is encoded '%28' etc.

Inconsistencies mainly stem from external data sources that were not
encoded properly.

In the example
  http://dbpedia.org/page/Republican_Party_%28United_States%29
  http://dbpedia.org/page/Republican_Party_(United_States)
the existence of the second is a bug. The URIs used in the YAGO dump
were not properly encoded before loading (as you can see this resource
only has YAGO properties). This will be fixed in the next release.

As it was already pointed out by Tom, decoded '#' can cause problems
and should therefore be avoided.
http://dbpedia.org/resource/Midfielder#Winger is in fact
http://dbpedia.org/resource/Midfielder.

The reason why http://dbpedia.org/resource/Midfielder%23Winger is an
own resource is because there are infoboxes of players that link to
the Winger part of the Midfielder page. Linking to Midfielder would
therefore be incorrect or at least imprecise.

Best,
Max

On Thu, May 12, 2011 at 19:31, Rupert Westenthaler
<[email protected]> wrote:
> Hi Tom
>
> On Thu, May 12, 2011 at 7:01 PM, Tom Morris <[email protected]> wrote:
>> I agree it'd be nice to fix the encoding of parentheses, but with regard to 
>> #:
>>
>>> A similar thing is also the case for URLs that include an "#"
>>> e.g.
>>>  http://dbpedia.org/resource/Midfielder%23Winger is an own Resource
>>>  http://dbpedia.org/resource/Midfielder#Winger returns the contents
>>> for Midfielder
>>
>> The number sign is the URI fragment separator and most clients will
>> only send the left-hand portion to the server (certainly that's true
>> for browsers).  The reason you're seeing the content for Midfielder is
>> because that's what you requested.  The "#Winger" part never makes it
>> to the server.
> I agree. I just noted it because for some URLs there actually exists
> an own resource when using "%23" instead of "#" (see the above example).
>
>
> --
> | Rupert Westenthaler             [email protected]
> | Bodenlehenstraße 11                             ++43-699-11108907
> | A-5500 Bischofshofen
>
> ------------------------------------------------------------------------------
> Achieve unprecedented app performance and reliability
> What every C/C++ and Fortran developer should know.
> Learn how Intel has extended the reach of its next-generation tools
> to help boost performance applications - inlcuding clusters.
> http://p.sf.net/sfu/intel-dev2devmay
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>

------------------------------------------------------------------------------
EditLive Enterprise is the world's most technically advanced content
authoring tool. Experience the power of Track Changes, Inline Image
Editing and ensure content is compliant with Accessibility Checking.
http://p.sf.net/sfu/ephox-dev2dev
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to