It would really help if Wikipedia (and dbpedia) redirect to the canonical
URL for a page, rather than serving up a 200.

In Wikipedia links, they never percent-encode brackets.

This is the ruby code that I use to convert page titles to URIs:

  def escape_title(title)
    URI::escape(title.gsub(' ','_'), ' ?#%"+=')
  end


nick.


On 05/09/2011 15:42, "Yves Raimond" <[email protected]> wrote:

> It looks like it is an encoding-related issue -
> http://dbpedia.org/page/John_Paul_Jones_(musician) gives a 404, and
> http://dbpedia.org/page/John_Paul_Jones_%28musician%29 gives a 200 -
> so it's probably my fault!
> 
> y
> 
> On Mon, Sep 5, 2011 at 3:38 PM, Yves Raimond <[email protected]> wrote:
>> Ok - I am *really* confused now - I promise this page didn't exist 10
>> minutes ago! (I have logs to prove it, in case it's needed)
>> 
>> y
>> 
>> On Mon, Sep 5, 2011 at 3:35 PM, Pablo Mendes <[email protected]> wrote:
>>> I cannot reproduce.
>>> This works for me:
>>> http://dbpedia.org/page/John_Paul_Jones_%28musician%29
>>> Cheers,
>>> Pablo
>>> 
>>> On Mon, Sep 5, 2011 at 3:55 PM, Yves Raimond <[email protected]> wrote:
>>>> 
>>>> Hello!
>>>> 
>>>> I spotted a few missing DBpedia URIs, both on the currently live
>>>> dataset and on DBpedia live, for example:
>>>> http://en.wikipedia.org/wiki/John_Paul_Jones_%28musician%29 should
>>>> exist at http://dbpedia.org/page/John_Paul_Jones_%28musician%29, but
>>>> there is nothing there.
>>>> 
>>>> What could explain that some Wikipedia pages are left off the
>>>> extraction process? This page, in particular, is a bit odd: it has
>>>> been in existence since 2010, and has a very detailed infobox.
>>>> 
>>>> On that note, I was wondering if the DBpedia team ever considered
>>>> using persistent URIs for DBpedia terms - we are using DBpedia to tag
>>>> programmes at the BBC, and the fluctuation in DBpedia URIs is very
>>>> hard to deal with. There are persistent identifiers accessible through
>>>> the Wikipedia API that would be much more useful for keying DBpedia
>>>> URIs than ever-changing URL slugs. DBpedia Lite [0] uses those and as
>>>> a result has very stable URIs.
>>>> 
>>>> Best,
>>>> y
>>>> 
>>>> [0] http://dbpedialite.org/
>> 


nick.


http://www.bbc.co.uk/
This e-mail (and any attachments) is confidential and may contain personal 
views which are not the views of the BBC unless specifically stated.
If you have received it in error, please delete it from your system.
Do not use, copy or disclose the information in any way nor act in reliance on 
it and notify the sender immediately.
Please note that the BBC monitors e-mails sent or received.
Further communication will signify your consent to this.
                                        

------------------------------------------------------------------------------
Special Offer -- Download ArcSight Logger for FREE!
Finally, a world-class log management solution at an even better 
price-free! And you'll get a free "Love Thy Logs" t-shirt when you
download Logger. Secure your free ArcSight Logger TODAY!
http://p.sf.net/sfu/arcsisghtdev2dev
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to