The Wikipedia Library/Building a Digital Library

From Meta, a Wikimedia project coordination wiki

The Wikipedia Library

Wikipedia Builds a Digital Library System

We've always thought that the world's largest encyclopedia should have a world-class library. Through the Wikipedia Library program, the encyclopedia's editors have free access to a collection of over 80,000 unique periodicals, like journals, magazines, newspapers, newsletters, pamphlets, and series, in addition to an untallyable number of books. This access has been facilitated by over 60 partners, including many of the world's leading publishers and aggregators.

The library of resources available to Wikipedians continues to grow, allowing these editors to use the best sources available to improve Wikipedia.

Why do we do this? Because facts matter.

That said, while an enormous amount of content is available, our current distribution and access processes leave a lot of room for improvement. Signing up for partners is currently done individually and on a per-partner basis, resulting in a slow turnaround on approval and distribution of access, taking on average three weeks from application to access, which is far too long. We've been limited in the number of accounts we can give out for most publishers and accounts are generally granted for exactly one year at a time, whether editors need longer perpetual access, or worse if they only want to grab a couple of references. Lastly, there hasn't been a way to search through the vast content across all our separate partners.

This year, we’re working hard to solve these problems. We're excited to share our plans with you, from the full rollout of the Wikipedia Library Card Platform to adding proxy access using Wikipedia logins to piloting a "bundle" of resources that could be accessed by thousands of qualifying editors at any time, as well as making Wikipedia's citations far more open and accessible for readers.

You build a thousand castles, a thousand sanctuaries, you are nothing;
you build a library, you are everything! -Mehmet Murat ildan


Wikipedia Library Card[edit]

Homepage of the Wikipedia Library Card platform
Homepage of the Wikipedia Library Card platform


At the centre of our plans for increasing and improving access is the Wikipedia Library Card platform. We rolled out phase one last quarter, addressing the slow signup and approval challenges, beta testing and improving it with our latest new partners. Already delivering access, it should improve signup speed from three weeks on average to closer to one week, won’t require editors to provide all their details every time they sign up for a new resource, and will be translated into as many languages as possible. By the end of March, we plan to move all partner signups over to the platform.


Proxy Access[edit]

Our next step, phase two, is to integrate a proxy authentication method, allowing users to use their single Wikipedia login for direct access to partners who can accept authentication through the Library Card. This will greatly improve the ease of access for editors, should reduce the workload for us and our partners, and will hopefully translate to increased usage and citation of available resources. We are aiming to have this ready in approximately 6 months.

Proxy integration alone isn’t a major departure from the current setup; the same individually approved users would have access to one partner's content per application, they will just be accessing it directly through a single authenticated login proxy rather than a username and password distributed for each website.

Wikipedia Library Bundle[edit]

A very exciting addition to our signup model, currently dependent on per-user approvals by volunteer coordinators, will be the Wikipedia Library Bundle. It would give any editor who meets account age, edit count, and recent activity criteria automatic access through the platform to a certain set of TWL partners, effectively replacing the account coordinator approval step, and covering approximately 25,000 editors across all language Wikipedias.

The Library Bundle will provide immediate access to participating partner resources for eligible Wikipedians, without having to sign up and with no need to worry about only using their access for a handful of sources at a time. We’re really excited about the opportunities and accessibility this access method will provide.

To automate the account coordinator check for recent activity and good standing in the community we will be implementing requirements beyond the current 500 edits and account age of 6 months. These automated checks would include recent activity (e.g. 10 edits in the past month) and not currently being blocked. The requirements aren’t yet finalised and there may be other restrictions such as a limit on total concurrent users, but we don’t aim to make the requirements more restrictive than the current checks carried out by account coordinators.

This will run on an opt-in model that some partners will choose to be a part of, and we have had encouraging responses from a number of publishers who are already excited to use this distribution method.


Integrated Search[edit]

Phase three of the Library Card Platform will seek to solve the issue of editors needing to browse partner-by-partner for needed resources. We will be implementing an integrated search tool which will index partner resources and provide search via a single interface. Not only will editors not need to log in separately to each of TWL’s partners’ websites, they will be able to search all their content from one place too.

Integrated search should pair powerfully with proxy authentication. If an editor finds a search result from a partner they've individually signed up for, they'll be able to click directly through to it from the Wikipedia Library Card platform. And, if that partner is a member of the Wikipedia Library Card Bundle, then they'll be able to access it automatically even if they've never signed up for it—just because they meet the basic criteria for account age, edits, and recent activity.


Open Citations[edit]

Our publishing partners have further empowered Wikipedia by sharing access to their scholarly and news sources. We are always experimenting with and evolving our publisher relationships to improve the ability of Wikipedia editors to do rigorous research more easily and more impactfully.  We also care about our readers and their ability to access full text.

OABot is the next step in bringing that openness to readers. OABot, technically approved but still pending community consensus, scans closed Wikipedia citations and finds free-to-read links available in open web repositories; it then adds a link to the open version into the existing Citation. Ideally, this added link will be tagged with an icon indicating that it's free-to-read.

Open Access Button is another useful tool, that pings paper authors when readers hit a paywall to their work. OAButton is working on a batch request infrastructure to contact thousands of authors simultaneously.

Since OABot scans Wikipedia to determine where a free-to-read link is missing but available elsewhere, by inverse, it can determine when a free-to-read link is missing and not available elsewhere. Thus, we can generate a batch of hundreds of thousands Wikipedia citations that aren't free to read, and send them to OAButton to contact authors with the specific use cases of becoming readable from Wikipedia. Completing the feat, we could have OAButton simultaneously instruct authors how to deposit a free version of their paper in a way and place that OABot will definitely find it, and add that newly open link directly into Wikipedia.

A virtuous circle from empowered editors to informed readers. That is what we're building.