I'm looking at the Wikipedia data dumps and I wonder if anyone familiar with
the format knows if the "id" field is a constant never-ever changing field?
The data dump I use is:
http://download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2
And the XML contains something like this:
---------
<page>
<title>AccessibleComputing</title>
<id>10</id>
<redirect />
<revision> ... </revision>
....
</page>
---------
My question is:
Is the "<id>" value constant?
Will the same Wikipedia page always have the same numeric "id" value?
Especially since pages can be deleted, created etc, can one trust that the
mapping from a Wikipedia page name (such as "AccessibleComputing" above) will
always remain the same ( "10" in the example above)?
Thanks
/Omid
------------------------------------------------------------------------------
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users
worldwide. Take advantage of special opportunities to increase revenue and
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion