Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-26 Thread Luigi Assom
on list for the Wikidata project." < > wikidata@lists.wikimedia.org> > Subject: Re: [Wikidata] Kickstartet: Adding 2.2 million German > organisations to Wikidata > > Laura, > > Talk to OpenCorporates and ask those questions yourself. > Get involved ! :) > > &g

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-26 Thread Laura Morales
quot; all over again, this time with data instead of software?     Sent: Wednesday, October 25, 2017 at 5:06 PM From: "Thad Guidry" <thadgui...@gmail.com> To: "Discussion list for the Wikidata project." <wikidata@lists.wikimedia.org> Subject: Re: [Wikidata] Kickstartet:

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-25 Thread Thad Guidry
Laura, Talk to OpenCorporates and ask those questions yourself. Get involved ! :) -Thad +ThadGuidry On Wed, Oct 25, 2017 at 3:22 AM Laura Morales wrote: > Is there any RDF dump available of OpenCorporates data? Or even any dump > at

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-25 Thread Jakob Voß
Hi Luigi, I favour cooperation with OpenCorporates instead of independently adding lots of company record to Wikidata. Sure there are parallel strategies but any effort should also include OpenCorporates to some degree. OpenCorporates is licensed under ODbL (just added this referenced

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-19 Thread Thad Guidry
No connections to Opencorporates, sorry. The good news is that the data sources in Opencorporates (the Registers) are accessible to you...sometimes in dump format. https://opencorporates.com/registers Hope that helps you further in your research and needs. I am not saying its easy :) -Thad

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-19 Thread Luigi Assom
Hi Thad, It is a really great project, I quote some of the points of Sebastian: >* # regarding Opencorporates *>* I have a critical opinion with > Opencorporates. It appears to be *>* open, but you actually can not get > the data. If somebody has a *>* data dump, please forward to me. Thanks. *

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-19 Thread Thad Guidry
Hi Luigi, Have you looked at https://opencorporates.com ? Thad +ThadGuidry ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-19 Thread Luigi Assom
Hi, I would like to join thread I found in the archive: https://lists.wikimedia.org/pipermail/wikidata//2017-October/011259.html I worked in contextual research to facilitate knowledge transfer. One of the domain I would like to treat is visualisation of economics networks. I seek for an

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Sebastian Hellmann
Ok, I put some effort into https://www.wikidata.org/wiki/Wikidata:Requests_for_permissions/Bot/Handelsregister to move the discussion there. All the best, Sebastian On 16.10.2017 18:06, Yaroslav Blanter wrote: Dear All, it is great that we are having this discussion, but may I please

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Sebastian Hellmann
Hi Yaroslav, in addition to this list, I added it here: https://www.wikidata.org/wiki/Wikidata:Requests_for_permissions/Bot/Handelsregister and here: https://www.wikidata.org/wiki/Wikidata:Project_chat#Handelsregister but I received more and longer answers on this list. All the best,

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Yaroslav Blanter
Dear All, it is great that we are having this discussion, but may I please suggest to have it on the RfP page on Wikidata? People already asked similar questions there, and, in my experience, on-wiki discussion will likely lead to refined request which will accomodate all suggestions. Cheers

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Sebastian Hellmann
ah, ok, sorry, I was assuming that Blazegraph would transitively resolve this automatically. Ok, so let's divide the problem: # Task 1: Connect all existing organisations with the data from the handelsregister. (No new identifiers added, we can start right now) Add a constraint that all

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Ettore RIZZA
While I'm on the subject, I would like to draw attention to the Neckar project , which aims precisely to classify Wikidata entities in people, places and organizations. Frequently updated Json dumps are available. 2017-10-16 16:08 GMT+02:00 Ettore

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Ettore RIZZA
@Antonin : Thanks for this counting method, it seems very effective (I already knew that there were 3.6 M of humans (Q5) in Wikidata).

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Antonin Delpeuch (lists)
And… my own count was wrong too, because I forgot to add DISTINCT in my query (if there are multiple paths from the class to "organization (Q43229)", items will appear multiple times). So, I get 1 168 084 now. http://tinyurl.com/yaeqlsnl It's easy to get these things wrong! Antonin On

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Antonin Delpeuch (lists)
Thanks Ettore for spotting that! Wikidata types (P31) only make sense when you consider the "subclass of" (P279) property that we use to build the ontology (except in a few cases where the community has decided not to use any subclass for a particular type). So, to retrieve all items of a

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Ettore RIZZA
> > - Wikidata has 40k organisations: https://query.wikidata.org/#SELECT %3Fitem %3FitemLabel %0AWHERE %0A{%0A > %3Fitem wdt%3AP31 wd%3AQ43229.%0A SERVICE wikibase%3Alabel { > bd%3AserviceParam wikibase%3Alanguage "[AUTO_LANGUAGE]%2Cen". }%0A} Hi, I think Wikidata contains many more

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread hellmann
The best way then to not create duplicates is to look at all existing organizations in Wikidata and add the court and court number manually, if they are German and then exclude these from the import. Guarantees that there will be no duplicates. So the technical side is feasible. Barriers are

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Sebastian Hellmann
Ah yes, forgot to mention: there is no URI or unique identifier given by the Handelsregister system. However, the courts take care that the registrations are unique, so it is implicit. Handelsregister could easily create stable URIs out of the court+type+number like /Leipzig_HRB_32853 For

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Sebastian Hellmann
Hi all, the technical challenges are not so difficult. - 2.2 million are the exact number of German organisations, i.e. associations and companies. They are also unique. - Wikidata has 40k organisations: https://query.wikidata.org/#SELECT %3Fitem %3FitemLabel %0AWHERE %0A{%0A %3Fitem

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Federico Morando
Dear All, although in Italy these data are normally not available (not even the basic data) from the chambers of commerce, there are some open data from which we could extract several identifiers - of course these are biased toward the suppliers of Public Administrations, because contracting with

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Andra Waagmeester
There is an equal size of data on Belgian enterprises available. with the same objective to enrich wikidata with enterprise data I recently proposed the following property: https://www.wikidata.org/wiki/Wikidata:Property_proposal/NACE_code However, after some talks with others in the Wikidata

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Neubert, Joachim
Hi Sebastian, This is huge! It will cover almost all currently existing German companies. Many of these will have similar names, so preparing for disambiguation is a concern. A good way for such an approach would be proposing a property for an external identifier, loading the data into

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-16 Thread Sebastian Hellmann
Thanks, done. https://www.wikidata.org/wiki/Wikidata:Project_chat#Handelsregister On 15.10.2017 22:10, Yaroslav Blanter wrote: Hi Sebastian, I would say the best way is to file a request for the permissions for the bot https://www.wikidata.org/wiki/Wikidata:Requests_for_permissions/Bot

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-15 Thread Federico Leva (Nemo)
This is an area where I would very much like to see some important properties created and populated, to the benefit e.g. of various infoboxes on Wikipedias which contain data in need of frequent updates (especially income, revenue, market capitalization, number of employees, links to most

Re: [Wikidata] Kickstartet: Adding 2.2 million German organisations to Wikidata

2017-10-15 Thread Yaroslav Blanter
Hi Sebastian, I would say the best way is to file a request for the permissions for the bot https://www.wikidata.org/wiki/Wikidata:Requests_for_permissions/Bot and possibly leave a message on the Project Chat https://www.wikidata.org/wiki/Wikidata:Project_chat Cheers Yaroslav On Sun, Oct 15,