Dear all,

we have recently released a dataset which might be of interest to some of you. 
Wikipedia Citations contains the nearly 30M citations to be found in English 
Wikipedia (as of May 1st 2020), of which approx. 4M are to scientific 
publications. Most of these 4M have also been equipped with identifiers (ISBN, 
DOI, etc.). All code is there for replication and updates.
 
Pre-print: https://arxiv.org/abs/2007.07022 <https://arxiv.org/abs/2007.07022>
Data and code: https://zenodo.org/record/3940692#.XyQjaPj7SL8 
<https://zenodo.org/record/3940692#.XyQjaPj7SL8>
 
We welcome feedback, ideas for collaboration and any question you might have in 
order to use the dataset for your research and work.
 
Best regards,
Giovanni Colavizza (with Harshdeep Singh and Bob West)
_______________________________________________
Wiki-research-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

Reply via email to