You can get the data from here:
http://dumps.wikimedia.org/wikidatawiki/20130417/

All items with all properties and their values are inside the dump. The
questions would be, based on this data, could we make suggestions for:

* when I create a new statement, suggest a property. then suggest a value
* suggest qualifier properties, then suggest qualifier values (there is no
data yet on qualifiers, but this would change soon)
* suggest properties for references, and values

Does this help?

Cheers,
Denny





2013/4/19 Nilesh Chakraborty <[email protected]>

> Hi,
>
> I am a 3rd year undergraduate student of computer science, pursuing my
> B.Tech degree at RCC Institute of Information Technology. I am proficient
> in Java, PHP and C#.
>
> Among the project ideas on the GSoC 2013 ideas page, the one particular
> idea that seemed really interesting to me is developing an Entity
> Suggester for Wikidata. I want to work on it.
>
> I am passionate about data mining, big data and recommendation engines,
> therefore this idea naturally appeals to me a lot. I have experience with
> building music and people recommendation systems, and have worked with
> Myrrix and Apache Mahout. I recently designed and implemented such a
> recommendation system and deployed it on a live production site, where I'm
> interning at, to recommend Facebook users to each other depending upon
> their interests.
>
> The problem is, the documentation for Wikidata and the Wikibase extension
> seems pretty daunting to me since I have not ever configured a mediawiki
> instance or actually used it. (I am on my way to try it out following the
> instructions at
> http://www.mediawiki.org/wiki/Summer_of_Code_2013#Where_to_start.) I can
> easily build a recommendation system and create a web-service or REST based
> API through which the engine can be trained with existing data, and queried
> and all. This seems to be a collaborative filtering problem (people who
> bought x also bought y). It'll be easier if I could get some help about the
> part where/how I need to integrate it with Wikidata. Also, some sample
> datasets (csv files?) or schemas (just the column names and data types?)
> would help a lot, for me to figure this out.
>
> I have added this email as a comment on the bug report at
> https://bugzilla.wikimedia.org/show_bug.cgi?id=46555#c1.
>
> Please ask me if you have any questions. :-)
>
> Thanks,
> Nilesh
>
> --
> A quest eternal, a life so small! So don't just play the guitar, build one.
> You can also email me at [email protected] or visit my
> website<http://www.nileshc.com/>
> _______________________________________________
> Wikitech-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l




-- 
Project director Wikidata
Wikimedia Deutschland e.V. | Obentrautstr. 72 | 10963 Berlin
Tel. +49-30-219 158 26-0 | http://wikimedia.de

Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e.V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg unter
der Nummer 23855 B. Als gemeinnützig anerkannt durch das Finanzamt für
Körperschaften I Berlin, Steuernummer 27/681/51985.
_______________________________________________
Wikitech-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to