Static content group

From Meta, a Wikimedia project coordination wiki
Jump to: navigation, search
STATIC CONTENT

Static content group (talk)
CD/DVD on meta
WP 1.0 on meta
German CD on meta
Polska DVD on meta
Mandriva on meta
Offline readers
Offline task force
on strategy wiki

Software tools
Alt parsers (on MediaWiki)
WikiMiner(pl,en)
Kiwix
wiki2cd
GERMAN WP 1.0 (t)
de info in English

POLISH WP 1.0

ITALIAN WP 1.0 (it)

Malayalam WP 1.0 (ml)

ENGLISH WP 1.0 (t)
Bot (t) Criteria.
SOS Children DVD online browsable (t)
Version 0.5 (t) (bot)
(Nominations) (t)
Core topicsTorrent
Work via WikiProjects

Wikipédia Junior (active)
FRENCH CD (very old)

This is a group and set of projects dedicated to gathering and sharing static content from Wikimedia projects, including CDs and DVDs, single-file databases for use with specific browsers and readers, and pdf and html exports.

Background[edit]

In 2004, the first Wikipedia CD was released, and the concept of WikiReaders for offline printing and reading was refined. In 2006, the Special Projects Committee authorized the creation of a subcommittee dedicated to static snapshots of Wikimedia content, to identify the groups working on such projects, and to help them work effectively together and share their results. This page is derived from those efforts and the work of all who have pursued similar goals.

Related objectives

  • content quality/vetting for "Wikipedia 1.0"
  • content on paper
  • topical subsets
  • selections for different audiences

Goals[edit]

  1. Maintain, offer and assist interested parties in getting
    • static content of Wikimedia projects in a variety of formats
    • metadata about its completeness, complexity/age level, and quality.
  2. Coordinate efforts to produce and distribute snapshots, and software for viewing snapshots.
  3. Stimulate research about content and making content accessible in an offline or semi-online environment.

Status[edit]

Older status[edit]

As of mid-2006, the Wikimedia Foundation served to the world outdated version of static content of Wikipedias (November 2005 snapshot - [1]) and offered the same content for download ([2]). There seeem to be persistent small problems with static content installation (categories,search, perhaps other).

There were plans to set up an up-to-date server with current static content of Wikipedias.

MediaWiki 1.5 included routines to dump a wiki to HTML, rendering the HTML with the same parser used on a live wiki.

There have been several separate attempts at producing software to convert SQL dumps into data formats that are suitable. Directmedia Publishing GmbH in Germany had by then a successful history of distributing Wikipedia content on DVD for Windows and MacOSX (Linux is beta).

Recent status[edit]

Needs updating since 2007!

See List of Offline Projects for updated list of offline projects in the community.

Food for thought[edit]

German content[edit]

Other languages[edit]

Other project languages : Chinese? Dutch? Italian? Russian?

Other projects[edit]

Since before 2004:


Formats and readers[edit]

Distribution formats: plain (x)HTML, PDF, TomeRaider, Plucker, Webaroo pack, proprietary formats

e-Reader projects:

  • Directmedia (free, if not open source, on Linux and Mac platforms - Linux version (digibux) is GPL)
  • KDE
  • Browser-based (generic reader platform; javascript optional)
  • Other (See below)

Kiwix[edit]

Screenshot of the version 0.9 (screencast)
KIWIX Flyer - Your Wikipedia Offline
KIWIX Brochure - Your Wikipedia Offline

KIWIX - Wikipedia Offline in a Nutshell[edit]

Kiwix is an offline reader for web content. It's especially thought to make Wikipedia available offline. This is done by reading the content of the project stored in a file format ZIM, a high compressed open format with additional meta-data. KIWIX also gives you the freedom to copy, modify and distribute the data.
To sum up: KIWIX allows you to store the whole Wikipedia offline on your device, USB flash drive or DVD and access content incredibly fast.

Why offline matters[edit]

We're featuring a quote here from the UN Broadband Commission from their September 2013 report, because it's the easiest, most pragmatic and straight-forward way to show you the importance of disseminating knowledge - and information - offline, complementary to all activities that we do online: "“While more and more people are coming online, over 90% of people in the world’s 49 Least Developed Countries remain totally unconnected.”[1]

Projects that involve Wikipedia Offline[edit]

KIWIX is mostly installed in schools that cannot afford broadband internet access. In these cases, it's so much faster to use Wikipedia offline

Wikipedia offline in Jails[edit]

Since March 2013, prisoners who request can have an access to Wikipedia offline. The idea is to stimulate or to support the interest for education of prisoners who were, for a large majority, condemned to long-time sentences. After three months of pilot phasis, the project is successful: Among the 36 prisoners of the Bellevue’s prison in Gorgier, 18 possess or rent a computer. All of them requested the upload of Wikipedia offline on their PC. For security reasons, swiss prisoners have a very restricted access to internet. The feed-backs are unanimously positive: they reveal that Wikipedia is seen as an improvement of the education and/or information activities in jail. The follow-up of the project aims to use Wikipedia in the training program of the prisoners: use of Wikipedia in the classes, organization of general culture contests, even train new Wikipedia editors. The partnership between Wikimedia CH and the direction of the prison aims to be durable: Wikimedia CH installed the Kiwix files and trained the IT team of the prison, who can now upload the software for every new prisoner who requests.
WMCH is now collaborating with the Swiss Insitute for Education in Detention Centers to expand the coverage of Wikipedia offline in Swiss Prisons. Detention Centers for minors are excluded from this program in Switzerland as they get access to the internet and don't have the need to access Wikipedia offline. Canada, France and Belgium also have have similar projects in prisons that involve KIWIX.

Afripédia[edit]

To get information on the project Afripédia of Wikimedia France, you can go to the page of Afripedia here on Meta.

Enciclopedia de Venecuela[edit]

A selection of articles about Venecuela are made accessible for pupils and students, among others on OLP devices.

Wikipedia for Schools[edit]

"At SOS Children, we wanted to bring this fantastic resource to children without internet access around the globe. So we began work on an ambitious project to get the very best content from Wikipedia into a self-contained selection which could be distributed on a CD. We checked every article for child friendliness and structured the content around the national curriculum. Today, Wikipedia for Schools is in its fourth incarnation, and the new version is ready to go - this time on USB. At EduWiki 2013, we will show you how the project has benefited students and teachers here in the UK, and in countries across the developing world. With the help of others, we have distributed copies globally, and we have had an amazing response from the people who count. In the UK, Wikipedia for Schools has been a great classroom companion for students and teachers alike.” [2]

User Feedback[edit]

  • "Very important and helpful source of information" (User from Bahrain)
  • "Thank you for your help! Now my school can use Wikipedia offline."' (User from Mexico)
  • "I like to browse my favourite encyclopedia even when there is no network" (User from Yemen)
  • "I have no internet in my house. KIWIX is such a help, because I need Wikipedia for my study."' (User from Cuba)

Features[edit]

KIWIX provides a range of opportunities and here you go with a shortlist of the most important ones:

  • Portable: Kiwix is a portable application you don't need to install. KIWIX supports a wide range of systems and architectures.
  • User-friendly: KIWIX works like your web browser and is translated into your native language.
  • Library: KIWIX own library allows you to gather content at first sight.
  • Search Engine: KIWIX has got a title suggestion system. This helps you to quickly get the information you need.
  • Web Server: Kiwix allows you to share content on your LAN with kiwix-serve, the KIWIX HTTP server.
  • Open: KIWIX uses open formats and protocols. KIWIX produces open-source software.

Technical Specifications[edit]

  • Pure ZIM reader
  • Content and download manager
  • Case and diacritics insensitive full text search engine
  • Bookmarks & Notes
  • kiwix-serve: ZIM HTTP server
  • PDF/HTML export
  • Multilingual (UI in more than 110 languages)
  • Search suggestions
  • ZIM indexing capacity
  • Support for Android / MacOSX / Linux / Windows / Sugar
  • DVD/USB launcher for Windows (autorun)
  • Tabs

Do you want to get involved?[edit]

There are many ways to participate and to work with us in order to develop the KIWIX - Wikipedia offline project. The following list features many topics where help would really be appreciated:

  • Translations: The KIWIX user interface is translated into more than 100 languages. We still have some more work to do here.
  • Support: KIWIX has a broad community - we need to care for it! It's essential to maintain good communication internally and with our users; both should be able to quickly get the information and the help they need.
  • Projects: We have a lot of ideas and we try to implement the best ones. Supported by the Wikimedia Foundation, Wikimedia national chapters and a few other organizations, KIWIX is able to set up ambitious projects.
  • Development: KIWIX software development is assured by a really small team of developers. To continue the development of KIWIX, new talented developers are welcome. Mentored by an experienced team, they may work on new features or help to maintain the existing solution.

Get in touch[edit]

  • kiwix.org
  • twitter.com/kiwixoffline
  • facebook.com/kiwixoffline
  • contact@kiwix.org

See also[edit]

References[edit]

  1. http://www.broadbandcommission.org/Documents/bb-annualreport2013.pdf Annual UN Broadband Commission Report 2013
  2. https://wiki.wikimedia.org.uk/wiki/EduWiki_Conference_2013/Abstracts#Workshops by Jamie Goodland, who works with the international children’s charity SOS Children

Useful (external) links[edit]

Online content

Software

Database dumps

Participants[edit]

Initial members of the subcommittee:

Other interested people:

  • Eric Astor (working on OEPC)
  • Erik Garrison (working on related statistics)
  • Eyu100 (asked the get involved)

Guidelines and coordination[edit]

Style guidelines for each project should be written down, for the benefit of projects to come after them. Coordination across projects of aspects such as script writing can also be quite helpful -- in catching mistakes and corner cases, and in avoiding repeated effort. Some specific ideas follow.

Registering a new snapshot[edit]

Needed : a process for announcing and registering a snapshot for others to see in progress, contribute to, or download and use. Start with finished projects to date in German, English, and Polish.

Current snapshots:

  • En:wp: SOS CD project; Andrew wrote some scripts for this. 2006 articles on a CD.
    • Initially distributed to benefit a children's charity in early 2006
    • Wikiwizzy's version of the above for distribution in S.Africa
  • De:wp: Directmedia CD, then DVD.
  • Pl:wp: ?? DVD, planned for completion in October (slightly new deadline).

Questions[edit]

  • How should snapshots recognize authors? What's the best way to attribute WP as a project as well as individuals, not simply to satisfy the GFDL?
  • How can snapshots share algorithms? Part of snapshot design is choosing content.
  • How can snapshots get updated? Scripting the creation so they don't get stale; minimizing editorial time needed.
  • Style : different ways to handle
    • templates (navigational, other)
    • foul language
    • images (size, content)
    • text (when too long, for balance)
    • citations
    • redlinks
    • interlanguage, interwiki links
  • Enhancing content : how to handle
    • delicate subjects (warning templates)
    • conflicted subjects (pov templates)
    • fast-changing subjects (news / current event templates)
    • (note: all this can update the dynamic database directly)

Listing related scripts[edit]

'Main article: :
  • Interactivity and interfaces : Front-ends to read and interact with different snapshot formats.
  • Reducing text : summarizing, auto-excerpting
  • Ranking text : bot-assisted reviewing/vetting/rating, metric analysis (apsp, grank, hit-popularity, edit-popularity, expertise, writing style, &c)
  • Metadata : bot-assisted annotation (audience, type, categorization)
  • Spellchecker, grammar checker
  • Copyvio checker
  • Image resizing & compression
  • Metadata extraction
    • History metadata (list of users, freshness, &c)
    • Image/media metadata
  • Index generation (for browsing)
    • Category tree generation

Updates and recommendations[edit]

  • Add links to new moulin and kiwix projects, update links from local wikipedias re: 1.0 (via interwiki links). +sj | help with translation |+
  • Add links to review ideas, including content stamping (since 2000) +sj | help with translation |+ 20:15, 26 February 2008 (UTC)
  • set up a cron job with a script to update static content whenever content dump is done
    This actually sounds like a project for the Wikimedia Toolserver. -- Mathias Schindler 09:19, 3 June 2006 (UTC)

See also[edit]