I'm looking at the Wikipedia data dumps and I wonder if anyone familiar with 
the format knows if the "id" field is a constant never-ever changing field?

The data dump I use is:
http://download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2

And the XML contains something like this:
---------
  <page>
    <title>AccessibleComputing</title>
    <id>10</id>
    <redirect />
    <revision> ... </revision>
    ....
  </page>
---------

My question is:
Is the "<id>" value constant?
Will the same Wikipedia page always have the same numeric "id" value? 
Especially since pages can be deleted, created etc, can one trust that the 
mapping from a Wikipedia page name (such as "AccessibleComputing" above) will 
always remain the same ( "10" in the example above)?

Thanks
/Omid



                                          
------------------------------------------------------------------------------
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to