Thanks Bina,

Here is the summary of the chat in case anyone else on this list is curious:

Normally Wikipedia numeric IDs don't change. However, if a page is
deleted, and then recreated fresh (so it has a brand new history),
then it gets a new ID. However, if it gets deleted and then undeleted
(so the page history is intact), then it keeps the same ID.
Pages also keep the same IDs if they get renamed, so if “Barack_Obama”
tomorrow gets renamed “President_Obama” it will keep the same numeric
ID.

Thanks
/Omid


On Fri, Aug 27, 2010 at 2:11 PM, Sabine Cretella <[email protected]> wrote:
> Hi Omid,
>
> they should not change - I will ask the developers for confirmation and send
> you the discussion on IRC in some mins by private mail.
>
> Cheers, Bina
>
>
> On Fri, Aug 27, 2010 at 10:48 PM, Omid Rouhani <[email protected]>
> wrote:
>>
>> I'm looking at the Wikipedia data dumps and I wonder if anyone familiar
>> with the format knows if the "id" field is a constant never-ever changing
>> field?
>>
>> The data dump I use is:
>>
>> http://download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2
>>
>> And the XML contains something like this:
>> ---------
>>   <page>
>>     <title>AccessibleComputing</title>
>>     <id>10</id>
>>     <redirect />
>>     <revision> ... </revision>
>>     ....
>>   </page>
>> ---------
>>
>> My question is:
>> Is the "<id>" value constant?
>> Will the same Wikipedia page always have the same numeric "id" value?
>> Especially since pages can be deleted, created etc, can one trust that the
>> mapping from a Wikipedia page name (such as "AccessibleComputing" above)
>> will always remain the same ( "10" in the example above)?
>>
>> Thanks
>> /Omid
>>
>>
>>
>>
>>
>> ------------------------------------------------------------------------------
>> Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
>> Be part of this innovative community and reach millions of netbook users
>> worldwide. Take advantage of special opportunities to increase revenue and
>> speed time-to-market. Join now, and jumpstart your future.
>> http://p.sf.net/sfu/intel-atom-d2d
>> _______________________________________________
>> Dbpedia-discussion mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>>
>
>
> ------------------------------------------------------------------------------
> Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
> Be part of this innovative community and reach millions of netbook users
> worldwide. Take advantage of special opportunities to increase revenue and
> speed time-to-market. Join now, and jumpstart your future.
> http://p.sf.net/sfu/intel-atom-d2d
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>

------------------------------------------------------------------------------
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to