Research talk:Scholarly article citations in Wikipedia

From Meta, a Wikimedia project coordination wiki

Welcome![edit]

Please post questions and ideas here. We accept pull requests and address bug reports at https://github.com/halfak/Extract-scholarly-article-citations-from-Wikipedia. --Halfak (WMF) (talk) 22:41, 9 February 2015 (UTC)[reply]

Proposal[edit]

Is it possible to determine, how many DOIs from national publishers each language version of Wikipedia cites? Is it possible to fetch the language of a paper by it's DOI? --Kopiersperre (talk) 14:14, 23 May 2015 (UTC)[reply]

@Kopiersperre: this is a great question but it's hard to answer without explicitly storing additional metadata such as the location of the publisher or the language of the paper. Making bibliographic data on every source cited across all Wikipedias available in Wikidata is precisely meant to help answer this type of questions (see this slidedeck, p.52). I hope we can make some good progress soon.--Dario (WMF) (talk) 15:05, 2 June 2017 (UTC)[reply]

Figshare item deleted[edit]

sorry, this page is no longer available

This content has been intentionally removed or had its access disabled.

Reason: The content did not adhere to figshare's terms and conditions

What happened? Please repost it on Zenodo. Nemo 08:03, 26 August 2019 (UTC)[reply]

I am not sure of why the figshare item has been taken down, but in case others are looking for the dataset, it also is hosted here: https://analytics.wikimedia.org/datasets/archive/public-datasets/all/mwrefs/ --Isaac (WMF) (talk) 18:28, 28 August 2019 (UTC)[reply]

Count of domain names used in references[edit]

Hello friends. Great job on your work analyzing PMID's, DOI's, etc. I was wondering if you happen to have similar data, but instead of grouping by PMID, DOI, etc. you group by domain name? I would be interested in a list of the 5000 most referenced domain names on the English Wikipedia, with the goal of making sure my citation highlighter script includes the most used websites. Thanks a lot. Looking forward to your feedback. –Novem Linguae (talk) 06:51, 1 March 2021 (UTC)[reply]