Hello Sebastian,
Thanks for the help. Please see my comments/questions below.
On Fri, Jun 14, 2013 at 8:09 AM, Sebastian Hellmann <
[email protected]> wrote:
> Am 13.06.2013 23:21, schrieb Fabien Snauwaert:
>
> Hi,
>
> I'm quite new to the world of DBpedia and SPARQL endpoints, so please
> feel free to point me in the right direction if I could find the answer to
> those questions elsewhere (Google, StackOverflow and SemanticWeb didn't
> turn any result.)
>
> I've got two simple questions on Wiktionary.DBpedia.
>
> 1) ENGLISH VS. OTHER SUPPORTED LANGUAGES
>
> The Wiktionary DBpedia project page states the currently available
> languages as "English, German, French, Russian". However, the lexical
> entries consistently only point to dbpedia-en-* resources, here are a few
> examples:
>
> http://wiktionary.dbpedia.org/page/%D0%BC%D0%B0%D1%88%D0%B8%D0%BD%D0%B0
> http://wiktionary.dbpedia.org/page/manger
> http://wiktionary.dbpedia.org/page/test
>
> ie. : no dbpedia-ru-*, dbpedia-fr-* anywhere in there. Which makes it look
> like only the English Wiktionary pages are actually connected to the
> project. What am I missing here and how can I check the extent of support
> for each of the currently available languages? My goal is to extract data
> from ru.wiktionary pages, through DBpedia.
>
>
> I am not sure, what you mean.
> http://wiktionary.dbpedia.org/page/manger-French-Noun-1ru for example is
> from the Russian manger page:
> http://ru.wiktionary.org/wiki/manger
>
My point is that this query returns results from en.wiktionary data
(judging from the "@en" tag in the results) meanwhile I would expect them
to come from ru.wiktionary instead (because of the "@ru" tag in the query):
# EXTRACT THE PRONUNCIATION OF A RUSSIAN WORD - Mind the @ru
PREFIX dc: <http://purl.org/dc/elements/1.1/>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX wt:<http://wiktionary.dbpedia.org/terms/>
SELECT DISTINCT ?spell ?pronounce
WHERE {
?spell rdfs:label "manger"@ru;
wt:hasLangUsage ?use .
?use dc:language wt:French;
wt:hasPronunciation ?pronounce .
}
# Results: Mind the @en
# spell pronounce
# http://wiktionary.dbpedia.org/resource/manger "/mɑ̃ʒe/,"@en
# http://wiktionary.dbpedia.org/resource/manger "[mɑ̃ː.ˈʒe]"@en
# http://wiktionary.dbpedia.org/resource/manger "mangeai"@en
It may seem trivial on such an example, but on the below query, it is quite
significant. Trying to extract the pronunciation (IPA transcription) of a
word from ru.wikpedia rather than en.wikipedia (because en.wikpedia misses
results that ru.wikipedia does have):
# EXTRACT THE PRONUNCIATION OF A RUSSIAN WORD
PREFIX dc: <http://purl.org/dc/elements/1.1/>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX wt:<http://wiktionary.dbpedia.org/terms/>
SELECT DISTINCT ?spell ?pronounce
WHERE {
?spell rdfs:label "бы"@ru;
wt:hasLangUsage ?use .
?use dc:language wt:Russian;
wt:hasPronunciation ?pronounce .
}
# Returns an empty result
# Meanwhile en.wikipedia does not have the pronunciation (IPA) for that
word http://en.wiktionary.org/wiki/%D0%B1%D1%8B
# While, ru.wikipedia does have it
http://ru.wiktionary.org/wiki/%D0%B1%D1%8B
Could you please clarify if en.wiktionary data has a special role inside of
DBpedia as opposed to other languages? (or explain the above results to me)
> 2) SPECIFICATIONS ON AVAILABLE WIKTIONARY DATA
>
>
> Is all of the data from a Wiktionary page supposed to be available through
> Wiktionary.DBpedia?
>
> For example, I can see the definition for the word машина in Russian here
> http://en.wiktionary.org/wiki/%D0%BC%D0%B0%D1%88%D0%B8%D0%BD%D0%B0 ("1.
> machine 2. engine 3. mechanism", etc.) but not there
> http://wiktionary.dbpedia.org/page/%D0%BC%D0%B0%D1%88%D0%B8%D0%BD%D0%B0-Russian
>
> Where can I find a list of Wiktionary sections that are officially
> supported by the Wiktionary DBpedia project? Also, where can I find the age
> of the data available on Wiktionary.DBpedia?
>
>
> Of course there are holes. The dump is a little bit outdated. You could
> rerun them here: https://github.com/dbpedia/dbpedia-wiktionary
> Actually, I should update some of the documentation and do a release
> today.
> Any data or improvements you are gladly accepted.
> All the best,
> Sebastian
>
Thanks!
>
>
> Thank you for the help. I look forward to using and contributing to the
> project.
>
> Sincerely,
> Fabien
>
>
>
> ------------------------------------------------------------------------------
> This SF.net email is sponsored by Windows:
>
> Build for Windows Store.
> http://p.sf.net/sfu/windows-dev2dev
>
>
>
> _______________________________________________
> Dbpedia-discussion mailing
> [email protected]https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>
>
> --
> Dipl. Inf. Sebastian Hellmann
> Department of Computer Science, University of Leipzig
> Events: NLP & DBpedia 2013 (http://nlp-dbpedia2013.blogs.aksw.org,
> Deadline: *July 8th*)
> Venha para a Alemanha como PhD: http://bis.informatik.uni-leipzig.de/csf
> Projects: http://nlp2rdf.org , http://linguistics.okfn.org ,
> http://dbpedia.org/Wiktionary , http://dbpedia.org
> Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann
> Research Group: http://aksw.org
>
------------------------------------------------------------------------------
This SF.net email is sponsored by Windows:
Build for Windows Store.
http://p.sf.net/sfu/windows-dev2dev
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion