Grants:IEG/Open Access Reader
Open Access Reader is a project to systematically ensure that all significant open access research is cited in Wikipedia.
Improving Quality, Increasing Participation
total amount requested:
2014 round 1
What is the problem you're trying to solve?
The 2007 research study What open access research can do for Wikipedia, identified a gap between available open publications and article content and proposed a model to improve this.
The Wikimedia movement has made considerable inroads in this broad area in the following ways:
- Elevating the coverage of academic publications use on the projects
- Raising awareness of open access publications more broadly WikiProject Open Access and WikiProject Open
- Defining the standard information collected about publications and the instance that relates to topic metadata Wikidata - Periodicals Task Force
Despite this gap between a wealth of resource and citations substantiating and improving WP article content persists – most recently discussed in a session at Wikimania 2013
What is your solution?
The Open Access Reader project ultimately aims to deliver a complete workflow from online open resource to editor, allowing editors to access highly relevant open research sources in the topic they are interested in developing in article space. For detail of the entire project, see the Open Access Reader Project Page
This grant proposal proposes scoping out and beginning to lay infrastructure for such a project, and other possible projects in the OA space:
- Evaluating existing open access research aggregators
- Evaluating existing open metrics for papers (e.g. citations)
- Evaluating open research topic metadata
- Evaluating existing content suggestion mechanisms for editors.
By funding time for product management and a couple of software development sprints, this grant enables an exploration of the open access scholarship ~ wikimedia space. We intend to:
- Engage with members of the community who either historically or currently worked across Wikimedia projects in this area, including
- the Wikidata periodicals task force and engineering team around past attempts at delivering customizable content based on topic (e.g. WikiFeed, SuggestBot)
- Wikimedia or academic community members who have worked with programmatic manipulation of OA papers.
- Identify and evaluate as many open access paper aggregators with appropriate APIs as we can find. In general, try and lay groundwork and create useful infrastructure for any future projects that would like to work with OA resources, including:
- Developing and delivering a specification for a filtering logic and programme that could screen available data and define notability of content. A suggested option has been how often a given paper is cited in other papers as an indicator of significance. There may be other filters that are relevant and consultation in both the academic and wikimedia communities would help feed into developing this specification.
- Paper metadata is of variable quality. We'll attempt to create a filter to categorising research papers by topic, which can then supported by an on-wiki system to reference metadata labels. These can then be matched to appropriate WikiProjects and it is possible that Wikidata: Periodicals task force instance labels are already beginning to reproduce this function
- If there is budget remaining, try prototyping various tools using this infrastructure in concert with editors to aid their workflow, and if they prove useful, promote these tools both within and without wikimedia.
- Research, including identifying relevant internal (community) and external (academic and publishing) groups to engage in the four phases of the project
- Incorporating feedback generated into specifications for filter, metadata lookup labels and template design and implementation
- Identify and contract with software developer(s) who will have relevant expertise to work on this project now and in future phases
- Schedule and collaborate on two sprint sessions (see below)
- Develop and update specifications for each project phase based on feedback and sprint activities
- Design database for aggregation of open access sources
- OA Repository API integration.
- Web front-end, if specification calls for it.
- Integrates with mediawiki api, if specification calls for it.
- Develop template and integration of metadata labels to project (NB this could be something developed by volunteers if interest generated)
Total amount requested
Hourly rates based off WMF HR benchmarks as supplied by Siko in an email 25/05/14:
- 40 hours of developer time spread across 2 sprints budgeted for equivalent of $50/hr: 2,000 USD
- 182 hours product management for 7 hours a week over six months for equivalent of $25/hr: 4,550 USD
- English Wikipedia contributors in WikiProjects where there is a pressing need to develop a density of citations in order to improve article quality.
- English Wikipedia contributors in topics where there has been a persistent lack of success increasing the number vital articles as opposed to featured or good.
- Open drafting of sprints schedule and specifications
- Linking these to appropriate Wikimedia project communities as identified in scoping phase for feedback
- Promoting the development of the tool to internal and external audiences throughout Wikimania 2014 & pre-event hackathons, particularly the Open Scholarship Hack Weekend.
- Work with chapter groups and thematic orgs (e.g. Wikimedia UK Technology Committee and Wiki Education Foundation
Fit with strategy
- Increase Participation: This project serves the movement's strategic goal to 'Increase Participation' because of the potential to leverage WikiProjects as a channel for editor engagement by providing them with fresh content, reactivating lapsed editors and supporting existing editors. We also believe that the OAR workflow may be a significant tool for recruiting new editors, e.g. in an undergraduate teaching context.
- Improve Quality: It is also significantly supportive of the strategic goal to 'Improve Quality' by virtue of providing better sourced content, with sources that can be routinely checked against newer research in a given topic area as a result of being open access.
- The project goal is to provide a prototype workflow that is delivering as envisioned. This is most manageable if after the first phase of development (Aggregator and API) the specifications for a filter, matching metadata to instance labels and WikiProject templates are initially focused on a small number of publications and corresponding WikiProjects. Delivering this successfully would act as proof of concept of the specifications developed, allow feedback to be incorporated and then a new specification developed based on new challenges faced in different topic areas.
- Currently this workflow mechanism is exclusively focused on delivering content to English language Wikipedia projects. However, once developed there would be scope to advise and collaborate with the international community if they wished to replicate the process for their in-country languages, wikis, and wiki projects.
- The further development of proactive notification or delivery of content to editors beyond availability on project pages or portals - perhaps by developing bots to cross reference user box interest and new content in a given metadata label, or subscription to a regular talk page delivery announcing new content.
Measures of success
- The engagement of at least 20 individuals from within related Open Access and Academic communities in notability filter specification design and feedback
- The engagement of at least 50 individuals from within the editing communities in metadata keyword matching specification design and feedback
- At least 50 articles show improvement with relevant content supported by the new citations
- Project Lead: EdSaperia (talk)
- Grantwriter: Katherine Bavage
- Open Access consultants
- Identified Developers
- Tool Labs/Mediawiki familiarity: User:A930913
- Web frontend: Kimi Lawrie
Do you think this project should be selected for an Individual Engagement Grant? Please add your name and rationale for endorsing this project in the list below. Other feedback, questions or concerns from community members are also highly valued, but please post them on the talk page of this proposal.
- Community member: add your name and rationale here.
- See Open Access Reader Project Page for more endorsements.
- Jodi.a.schneider (talk) 08:52, 1 April 2014 (UTC) Making open access content easier to find and cite would be beneficial, as the resulting references could be read and accessed by *all* readers.
- This would help us a great deal. ----Rsrikanth05 (talk) 19:35, 22 April 2014 (UTC)
- By allowing Wikipedians to find and city Open Access content easier, this project would strengthen the Open Access ecosystem. --Frank Schulenburg (talk) 21:21, 24 April 2014 (UTC)
- Endorse Jane023 (talk) 08:34, 29 April 2014 (UTC)
- Endorse --Aubrey (talk) 08:38, 16 October 2014 (UTC)
1=weak alignment 10=strong alignment
|(A) Impact potential
|(B) Innovation and learning
|(C) Ability to execute
|(D) Community engagement
|Comments from the committee:
Thank you for submitting this proposal. The committee is now deliberating based on these scoring results, and WMF is proceeding with it's due-diligence. You are welcome to continue making updates to your proposal pages during this period. Funding decisions will be announced by the end of May. — ΛΧΣ21 00:01, 13 May 2014 (UTC)