Hi Katie,
you can also use http://wikidata.dbpedia.org/resource/Q544 for linking.
You will get the Wikidata data plus the proper links to DBpedia data,
which is extracted from the article and enriched with taxonomies and links.
We also do a lot of quality control. see this
paperhttp://www.semantic-web-journal.net/content/wikidata-through-eyes-dbpedia
In the future, we are considering propagating the Wikidata IDs as the
main IDs.
From my experience:
If you have a more static set of Ids, e.g. for planets of the Solar
system, chemical elements, countires of the european union, descriptive
IDs are fine.
There might be some change, but it is minimal.
For highly dynamic entities, e.g. names and events and opinions
extracted from text you should definitely use UUIDs
https://en.wikipedia.org/wiki/Universally_unique_identifier
Anything in between can be a mixture, e.g. like an md5 hash over the
most relevant properties.
All the best,
Sebastian
On 31.05.2016 16:47, Katie Frey wrote:
Dear Markus,
Thank you for the insight. We might also try to assign both numeric
and descriptive IDs to a concept. It seems as though best practices
don't really exist in this area, other than the general imperative to
keep the URIs simple and as stable as possible.
Best,
Katie
--
Katie E. Frey
John G. Wolbach Library, Harvard-Smithsonian Center for Astrophysics
60 Garden Street, MS-56, Cambridge, MA 02138
email: kf...@cfa.harvard.edu <mailto:kf...@cfa.harvard.edu> | phone:
617-496-7579
http://astrothesaurus.org | http://library.cfa.harvard.edu/
"Surprising what you can dig out of books if you read long enough,
isn’t it?"
- Rand al'Thor (in Robert Jordan's The Shadow Rising, Book Four of the
Wheel of Time)
"This is insanity!" "No, this is scholarship!"
- Yalb and Shallan (in Brandon Sanderson's Words of Radiance, Book Two
of the Stormlight Archive)
On Thu, May 26, 2016 at 4:06 PM, Markus Kroetzsch
<markus.kroetz...@tu-dresden.de
<mailto:markus.kroetz...@tu-dresden.de>> wrote:
Dear Katie,
DBpedia mostly uses descriptive URIs that are based on the titles
of Wikipedia articles in a specific language. These URIs change if
pages are renamed, but for many concepts, this does not occur so
often. You would probably only notice it if you are using the URIs
for several years.
If you instead want to use numeric IDs based on Wikipedia pages
(or DBpedia URIs), you can take them from Wikidata. These IDs are
stable, but not descriptive. They are kept unique in that they can
only be deleted but not reused. For example,
http://dbpedia.org/page/Solar_System is currently the same as
https://www.wikidata.org/entity/Q544.
The Wikidata URI uses content negotiation to redirect you to the
HTML page if you open it in a browser, and to RDF if you open it
with an RDF crawler. See
https://www.wikidata.org/wiki/Wikidata:Data_access for direct
links to the RDF content.
To manually find out what the Wikidata ID is for a Wikipedia page,
you can go to the Wikipedia page and use the link to "Wikidata
item" on the left. To do this in an automated fashion, you can use
the SPARQL endpoint, e.g., with the query
SELECT *
WHERE {
<https://en.wikipedia.org/wiki/Solar%20System> schema:about ?item .
}
(try it in the Wikidata SPARQL UI: http://tinyurl.com/jlk4fz2)
The Wikidata Web API can also map page titles to IDs for you
prefer JSON over SPARQL:
https://www.wikidata.org/w/api.php?action=wbgetentities&format=json&sites=enwiki&titles=Solar+System&props=
Each of these methods can also be used to fetch many IDs at once.
So basically it is fairly straightforward to translate from
DBpedia URIs to Wikidata URIs. The mapping between the two changes
over time only when DBPedia URIs change their meaning (e.g., if
"Solar System" is renamed to "Solar System (astronomy)" or something).
Best regards,
Markus
On 26.05.2016 20 <tel:26.05.2016%2020>:43, Katie Frey wrote:
Hello,
How are concept IDs handled for DBpedia? It looks like the
concept URIs
are descriptive (i.e. for the concept
http://dbpedia.org/page/Solar_System, the concept ID is
"Solar_System"). Are the descriptive IDs used throughout all
of dbpedia
(back and front end) or are terms ultimately kept unique by using
numeric identifiers?
I've been developing a controlled vocabulary and I would also
like to
use URIs so that my terms can be used with other linked data
schemes.
My group and I have had a lot of discussions regarding the
concept IDs;
some want them to be descriptive, based on the preferred term
for each
concept so that they are human readable but this could cause
problems if
the terms used to describe each concept change over time,
others want
them to be randomly generated so that if the description of a term
drifts over time the URI for the concept will always remain
static.
We are trying to figure out if there are any standards or best
practices
we should be looking towards when it comes to concept IDs. Any
thoughts/comments/justifications would be appreciated.
Best,
Katie
--
Katie E. Frey
John G. Wolbach Library, Harvard-Smithsonian Center for
Astrophysics
60 Garden Street, MS-56, Cambridge, MA 02138
email: kf...@cfa.harvard.edu <mailto:kf...@cfa.harvard.edu>
<mailto:kf...@cfa.harvard.edu <mailto:kf...@cfa.harvard.edu>>
| phone:
617-496-7579 <tel:617-496-7579> <tel:617-496-7579
<tel:617-496-7579>>
http://astrothesaurus.org | http://library.cfa.harvard.edu/
"Surprising what you can dig out of books if you read long
enough, isn’t
it?"
- Rand al'Thor (in Robert Jordan's The Shadow Rising, Book
Four of the
Wheel of Time)
"This is insanity!" "No, this is scholarship!"
- Yalb and Shallan (in Brandon Sanderson's Words of Radiance,
Book Two
of the Stormlight Archive)
------------------------------------------------------------------------------
Mobile security can be enabling, not merely restricting.
Employees who
bring their own devices (BYOD) to work are irked by the
imposition of MDM
restrictions. Mobile Device Manager Plus allows you to control
only the
apps on BYO-devices by containerizing them, leaving personal
data untouched!
https://ad.doubleclick.net/ddm/clk/304595813;131938128;j
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
<mailto:Dbpedia-discussion@lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
--
Markus Kroetzsch
Faculty of Computer Science
Technische Universität Dresden
+49 351 463 38486 <tel:%2B49%20351%20463%2038486>
http://korrekt.org/
------------------------------------------------------------------------------
What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic
patterns at an interface-level. Reveals which users, apps, and protocols are
consuming the most bandwidth. Provides multi-vendor support for NetFlow,
J-Flow, sFlow and other flows. Make informed decisions using capacity
planning reports. https://ad.doubleclick.net/ddm/clk/305295220;132659582;e
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
--
All the best,
Sebastian Hellmann
AKSW/KILT research group at Leipzig University
Insitute for Applied Informatics (InfAI) at Leipzig University
DBpedia Association
Events:
* *April 24, 2016* Submission Deadline, SEMANTiCS 2016: Workshop &
Tutorial Proposals
<https://docs.google.com/document/d/1VOWyU9Oo2KOuRh09R9CJeC9aWcSNokDW4yHeuGHICPI/pub>
* *May 10th, 2016* Submission Deadline, SEMANTiCS 2016: Research &
Innovation Papers
<https://docs.google.com/document/d/1bYXVeO-vWwIctJgDv4rS3H5oFv0Yz_tyTo0xQcAEtPU/pub>
* *Sep 12th-15th, 2016* SEMANTiCS 2016, Leipzig <http://semantics.cc/>
Venha para a Alemanha como PhD: http://bis.informatik.uni-leipzig.de/csf
Projects: http://dbpedia.org, http://nlp2rdf.org,
http://linguistics.okfn.org, https://www.w3.org/community/ld4lt
<http://www.w3.org/community/ld4lt>
Homepage: http://aksw.org/SebastianHellmann
Research Group: http://aksw.org
Thesis:
http://tinyurl.com/sh-thesis-summary
http://tinyurl.com/sh-thesis
------------------------------------------------------------------------------
What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic
patterns at an interface-level. Reveals which users, apps, and protocols are
consuming the most bandwidth. Provides multi-vendor support for NetFlow,
J-Flow, sFlow and other flows. Make informed decisions using capacity
planning reports. https://ad.doubleclick.net/ddm/clk/305295220;132659582;e
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion