FindingGLAMs/Information for GLAM Partners

Home

GLAM statistics

Datasets

Partners

Get Involved

Information for local teams

Information for GLAM Partners

White Paper

Documentation

Timeplan

Discussion

Information for GLAM Partners

About the Project

FindingGLAMs is a project aimed and collecting data about the world's cultural heritage institutions (galleries, libraries, archives and museums) and their collections, and making it available on Wikimedia projects, such as Wikidata, Wikimedia Commons and Wikipedia. By making this information readily available, more people will have the chance to learn about our cultural heritage institutions and the invaluable work they do. We believe that access and awareness are key factors in building an interest in preserving cultural heritage and understanding its importance.

Sharing your content

Media

As part of the project, we will help cultural heritage institutions share their media, such as images and audio files, on Wikimedia Commons. This will make them available for inclusion in Wikipedia and other projects, bringing them to a worldwide audience and increasing the visibility of the institution's collections and work.

We want to learn how to better support cultural heritage institutions by focusing on diverse content (such as maps, audio, 3D models and scanned documents) and subjects (e.g folk costumes, coins, originates, cultural and natural environments).

Data

The information we collect will be made freely available through Wikidata – a free and open database of structured data and one of Wikipedia's sister projects. Wikidata will enable users to integrate the information into Wikipedia in any of its 280+ languages. This can then act as a starting point for the world-spanning community of volunteers who are active on Wikipedia and Wikidata to further enrich the material.

By making the information more widely available we also believe that local communities, as well as the general public, can make use of the information in ways which allows them to both learn about their cultural heritage institutions and help with documenting them. Since the enriched information will be freely available to everyone, the resulting data can in turn be re-used, be it for tourism, as a complement to the official data or any other imaginable use.

The information will be clearly referenced so that it is obvious that it comes from official data, while at the same time clearly labeling any additional information which is added so as to make it clear that such info is not official nor is the agency in question responsible for it.

What sort of data are we looking for?

We are looking for datasets of cultural heritage institutions (museums, galleries, libraries and archives). For example, lists of museums in a certain country, state or other administrative unit.

How detailed is the data supposed to be?

The more detailed, of course, the better. But any data is better than nothing! Once it's uploaded to Wikidata, the data will become accessible to volunteers all around the world, who will then be able to edit and enrich it. There is a lot of interest in cultural heritage institutions on Wikimedia projects, meaning a high chance the data will be noticed, edited and re-used.

At the very least, we need this information to create a Wikidata item for a cultural institution:

The name of the institution (in one or more languages).
The type of the institution – is it a museum, a library, an archive?
The location of the institution. It can be general, such as a state or province, but more detailed data (city, street address, coordinates…) is better.

Once we're moving beyond the basics, there's a lot of additional information that can be converted to Wikidata properties:

Year of establishment
Number of visitors in a particular year
Collection size
Who is the director of the institution
Whom the institution is named after
Official website
Social media accounts

…and many more. For real examples, see Library of Congress (Q131454) or Hermitage Museum (Q132783).

What format is the data supposed to be in?

Since we will be working with datasets of hundreds or thousands of items, the data should be in a machine-readable format. As a rule, structured data is better than unstructured text, and open formats are better than proprietary ones. The data may be accessible either as downloadable files or via an API. Examples of good formats are csv/tsv and json.

On Wikidata:Open data publishing you can see on overview of different data formats and how they rate in terms of data openness.

Ideally, the dataset would also contain appropriate metadata, such as when and by whom it was created.

What about the copyright?

The data on Wikidata is available under the Creative Commons CC0 License. This license allows people to use the data without restrictions; no attribution is required. This is different from Wikipedia, which applies the Creative Commons Attribution license. CC0 is equivalent to Public Domain.

This means that in order for your data to be uploaded to Wikidata, it has to have been released under an open license. You can read more about copyright licensing on Wikidata on Wikidata:Licensing and about the benefits of open data publishing on Wikidata:Open data publishing.

About us

Wikimedia Sverige (Wikimedia Sweden) is a charitable non-profit organisation working out of Sweden dedicated to promoting free access to knowledge for everyone. We are a chapter of the Wikimedia Foundation which has the goal to "empower and engage people around the world to collect and develop educational content" and most famously does this through Wikipedia, the free-of-charge and free-of-advertisement multilingual encyclopedia which monthly serves more than half a billion unique visitors.

Wikimedia Foundation: The non-profit Wikimedia Foundation provides the essential infrastructure for free knowledge. They host Wikipedia, the free online encyclopedia, created and edited by volunteers around the world, as well as many other vital community projects. They welcome anyone who shares our vision to join us in collecting and sharing knowledge that fully represents human diversity.

The Swedish Postcode Foundation: provides the project funding for Finding GLAMs, they believe the world is getting better with the help of strong nonprofit organizations.