Hi Finn,
you discovered a limitation of my tool. It currently does not support
lexemes, since Wikidata Toolkit has not implemented the RDF export
support for them. I am not even sure if there is a JSON-representation
for lexemes? For now, I simply ignored lexemes. I should mention that
somewhere in the interface.
Also, the "Filter entities" only works for predicates which are wikidata
properties (it uses the WD search API), which is why dct:language and
ontolex:sense do not appear there (even if lexemes were supported).
Regards,
Benno
On 11.12.19 16:43, [email protected] wrote:
Hi Benno,
Thanks for the contribution.
Does your tool work for lexemes and other lexicographic data. When I
view "Filter entities" then I do not see the ability to set properties
such as dct:language and ontolex:sense.
best regards
Finn Årup Nielsen
https://people.compute.dtu.dk/faan/
On 11/12/2019 15:08, Benno Fünfstück wrote:
Hi everyone,
I am happy to announce a new tool I've been working on for the last
few months, WDumper.
The tool is available at https://tools.wmflabs.org/wdumps/.
The idea is to provide a user interface to easily generate RDF dumps
for subsets of the data contained in Wikidata.
As an example, the tool can generate dumps with only english labels
or for a subset of the properties.
The tool is based on Wikidata Toolkit and processes the original JSON
dumps provided by Wikidata.
When you submit a request to create a dump, it will be added to a queue.
The queue is processed in regular intervals (the maximum wait time in
queue is 1h).
You can view a list of created dumps on
https://tools.wmflabs.org/wdumps/dumps.
The generated dump can either be downloaded directly or uploaded to
Zenodo for archival, which also generates a DOI for easy referencing
in scientific publications.
I want to thank Prof. Dr. Markus Krötzsch for the original idea for
this tool and support during the development of the tool.
If you have any questions, feel free to ask them by mail or create an
issue on the GitHub page: https://github.com/bennofs/wdumper. The
current version does not have a lot of features yet, so ideas for
extending the tool with additional filters or options that you'd like
to use are valuable feedback as well.
Also a small word of caution: while I did of course test the tool,
the Wikidata data model is quite complex. Since the tool is new, bugs
are more likely, so always apply a sanity check to the results.
If you find bugs, please tell me or create an issue on GitHub.
Regards,
Benno Fünfstück
_______________________________________________
Wikidata mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata
_______________________________________________
Wikidata mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata
_______________________________________________
Wikidata mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata