Some thoughts...
On Fri, Jun 1, 2012 at 5:29 PM, Jonas Brekle <[email protected]> wrote:
> regarding the forms: currently that is not part of the dataset yet. And
> unfortunatley its not very easy to add it. I think it even would require
> some enhancement to the extractor (not just the config). But its on my
> todo list...
> However such "boxes" of word forms are probably easier to extract with
> the default DPpedia infobox extractor. Maybe the DBpedia community could
> help with that. The biggest problem there would be to determine the
> right "context" (i.e. the subject URI)...
I think you don't really need to enhance your extractor. Just run the
DBpedia MappingExtractor in addition. You could do the following:
- set up a mappings wiki for DBpedia Wiktionary (1)
- add a mapping for {{Deutsch Substantiv Übersicht}} to the mappings wiki
- during the DBpedia Wiktionary extraction, also run a
MappingExtractor instance that uses the mappings from your mappings
wiki
(1) or add namespaces to the existing mappings wiki - although it's
getting a bit crowded as far as namespaces are concerned :-)
As far as I can tell, DBpedia Wiktionary currently only has subject
URIs for words from en.wiktionary.org, right? So you'd probably have
to add URIs like http://de.wiktionary.dbpedia.org/resource/Haus.
I don't know if properties like "Nominativ Singular=das Haus" should
be extracted as URIs or as literals.
Just my 0.02€... I don't know much about DBpedia Wiktionary. Of
course, we could somehow work this into the main DBpedia extraction,
but I think the solution outlined above makes more sense.
Cheers,
JC
> i crossposted this to DBpedia, so they can reply
>
> Regards,
> Jonas
>
> Am Freitag, den 01.06.2012, 12:08 +0200 schrieb Lars Aronsson:
>> On 2012-05-31 12:42, Gerd Zechmeister wrote:
>> > I'd like to extract German noun forms (Kasus and Numerus) but didn't find
>> > this data in the provided dumps.
>> >
>> > Example: http://de.wiktionary.org/wiki/Haus
>> >
>> > I need the data from the box:
>> > Kasus Singular Plural
>> > Nominativ das Haus die Häuser
>>
>> This is provided in the wiki template call
>>
>> {{Deutsch Substantiv Übersicht
>> |...
>> |Nominativ Singular=das Haus
>> |Nominativ Plural=die Häuser
>> ...
>>
>> That you find in this XML dump (only 50 MB compressed),
>> http://dumps.wikimedia.org/dewiktionary/20120526/dewiktionary-20120526-pages-articles.xml.bz2
>>
>> An old Perl script for parsing the XML dumps is found here,
>> http://meta.wikimedia.org/wiki/User:LA2/Extraktor
>>
>>
>
>
>
> ------------------------------------------------------------------------------
> Live Security Virtual Conference
> Exclusive live event will cover all the ways today's security and
> threat landscape has changed and how IT managers can respond. Discussions
> will include endpoint security, mobile security and the latest in malware
> threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion