Hello,

Another problematic character is '&' which is valid in the path component of
the URL. Example:

http://en.wikipedia.org/wiki/Mumford_&_Sons
http://dbpedia.org/page/Mumford_%26_Sons

Thanks,

nick.


On 03/08/2010 17:17, "Robert Isele" <[email protected]> wrote:

> Hi,
> 
> we try to be as close as possible to the Wikipedia title encoding scheme.
> Unfortunately, we mistakingly encoded the "," character, thanks for
> the hint. This behavior will be fixed in the next Release.
> 
> The current behavior is as follows:
> - The alphanumeric characters "a" through "z", "A" through "Z" and "0"
> through "9" remain the same.
> - The special characters ".", "-", "*",  "/", ":" and "_" remain the same.
> - The space character " " is converted into a plus sign "_".
> - All other characters are unsafe and are first converted into one or
> more bytes using  UTF-8 encoding. Then each byte is represented by the
> 3-character string "%xy", where xy is the two-digit hexadecimal
> representation of the byte.
> - Furthermore, multiple underscores are collapsed into one.
> 
> Cheers,
> Robert
> 
> On Thu, Jul 29, 2010 at 3:37 PM, Nicholas Humfrey
> <[email protected]> wrote:
>> 
>> Hello,
>> 
>> I am trying to construct dbpedia URIs from Wikipedia page titles, but it
>> dbpedia seems to be escaping more characters than are required by RFC3986 -
>> for example ',' is encoded to %2C.
>> 
>> What is doing the escaping? Is there a list of characters that are escaped
>> in titles?
>> 
>> Thanks,
>> 
>> nick.
>> 
>> 
>> http://www.bbc.co.uk/
>> This e-mail (and any attachments) is confidential and may contain personal
>> views which are not the views of the BBC unless specifically stated.
>> If you have received it in error, please delete it from your system.
>> Do not use, copy or disclose the information in any way nor act in reliance
>> on it and notify the sender immediately.
>> Please note that the BBC monitors e-mails sent or received.
>> Further communication will signify your consent to this.
>> 
>> 
>> 
----------------------------------------------------------------------------->>
-
>> The Palm PDK Hot Apps Program offers developers who use the
>> Plug-In Development Kit to bring their C/C++ apps to Palm for a share
>> of $1 Million in cash or HP Products. Visit us here for more details:
>> http://p.sf.net/sfu/dev2dev-palm
>> _______________________________________________
>> Dbpedia-discussion mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion


nick.


http://www.bbc.co.uk/
This e-mail (and any attachments) is confidential and may contain personal 
views which are not the views of the BBC unless specifically stated.
If you have received it in error, please delete it from your system.
Do not use, copy or disclose the information in any way nor act in reliance on 
it and notify the sender immediately.
Please note that the BBC monitors e-mails sent or received.
Further communication will signify your consent to this.
                                        

------------------------------------------------------------------------------
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to