Wikimedia Blog/Drafts/In just over a year 100000 new Wikipedia articles have been written with Content Translation

From Meta, a Wikimedia project coordination wiki

Title ideas[edit]

  • In just over a year 100000 new Wikipedia articles have been written with Content Translation
  • These are some of the people who translated 100,000 Wikipedia articles in the last year
  • The new Content Translation Tool has created 100,000 new Wikipedia articles in just over one year


100000 new Wikipedia articles added with the Content Translation tool. Some users share their experiences on how this tool helped them edit Wikipedia."

  • ...


This video explains how you can turn on Content Translation and start translating Wikipedia articles for yourself. Also view on and A version with burned-in English-language subtitles is here.

Content Translation, the Wikipedia article translation tool, was introduced in January 2015. A few days ago, the article about the 1950s song Crying, Waiting, Hoping was written for the Spanish Wikipedia with this tool. The reason we noted it was because it was the 100,000th new Wikipedia page that was written with Content Translation. The tool is still a beta feature, but it’s already heavily used by Wikipedia editors. Everyday we, Content Translation’s developers, are making new improvements to iron out problems and bring changes that will help editors write high quality articles conveniently and reduce post publication clean ups.

From the time Content Translation was being designed we have given priority to what users, particularly Wikipedia editors, have highlighted as important features for a translation tool like this. Before actual development work started we conducted research sessions with users. Later, as Content Translation was being made available gradually to more users we continued this practice of collecting feedback and integrating their views with the development plans.

The Catalan Wikipedia community was the earliest group of users of Content Translation. Most speakers of Catalan are bilingual and they quickly adopted this tool to write new articles by translating them from the Spanish Wikipedia. Editors in many other languages have similarly used the tool to translate high quality articles from other wikis.

Ravishankar Ayyakkannu
Nahid Sultan
Àlex Hinojo

We spoke to Ravishankar Ayyakkannu who edits the Tamil Wikipedia. He mentioned that very often articles are translated into Tamil from the English Wikipedia. While Content Translation has not been used extensively for the Tamil Wikipedia translathons yet, Nahid Sultan coordinated a successful online translation campaign for the Bangla Wikipedia to celebrate Wikipedia 15. 600 new articles were translated from English Wikipedia good articles list. 550 users participated in the event and Content Translation was heavily used to introduce new editors on how to write new Wikipedia articles.

Mehtab Solangi, a Sindhi Wikipedia editor from Pakistan started contributing from August 2015. He came across Content Translation while looking around the preference settings and liked the tool so much that he introduced it widely to other editors. He had never translated an article before and prefers using the tool as it offers a clear structure for the new article, adapts links and categories that may have been difficult for new users who are still learning wikitext.

During the many months of design and development we spoke to many editors who liked the interface and the ease of translation that the tool provides. Automatic translation support through machine translation services has been a major advantage for many. The Catalan Wikipedia editors have been using Apertium heavily to translate from Spanish and have observed about significant reduction in the time taken to translate a new article. Àlex Hinojo, in an earlier blog post mentioned that he could write a new article of about 20 lines in less than 5 minutes, with the tool taking care of much of the wikitext changes that were earlier necessary.

Like many other editors, Olena who edits mostly the Ukrainian Wikipedia, said that she would also recommend Content Translation to new editors. However, she adds a caveat that editors should be aware of the fact that the tool does have its shortcomings and it is important to check the accuracy of the content and any other errors that may need correction. She emphasized that to ensure exchange of knowledge, there is need for better machine translation support between languages, to be able to translate from many different wikis about content that is topical for a region or culture and is often less represented in other wikis.

While many new articles are getting written with Content Translation, several users have requested extending the tool to support translation of existing articles, especially stubs that can be improved from well written articles about the same topic in other languages. Better template handling is another common issue that many users have requested. While the former needs more thought, the Language team is already planning for a major overhaul to template support in the coming months.

Aside from the editors, Content Translation has also been able to connect with developers who are enthusiastic about supporting the tool. Kevin Brubeck Unhammer, who often edits the Norwegian Nynorsk Wikipedia proposed a project under the Individual Engagement Grants (IEG) and improved machine translation support for Danish, Swedish, Norwegian Bokmål and Norwegian Nynorsk, that can be used with Content Translation. He earlier used some custom hacks to use Apertium while translating articles. Kevin suggests that Content Translation should be introduced to more people working in the field of language technology to interest other developers to come forward and contribute.

As we continue improvements on Content Translation, we also aim to reach out and introduce the tool to more Wikipedia users and help grow the sum all human knowledge across languages. The Medical Translation Project is successfully using the tool to expand its coverage of essential health content in many languages. They have observed about 17% improved productivity in the efforts taken to coordinate and complete translations, thus helping spread medical information faster around the world. We plan to continue supporting similar initiatives and connect with more individual users and groups who are working on topical projects, both short and long term. We hope to better connect with editathon and translathon events where Content Translation can be used.

Content Translation is developed by the Wikimedia Language team. You can reach us via the Content Translation project page or follow @WhatToTranslate on twitter for updates on Content Translation.


Runa Bhattacharjee, Language team, Wikimedia Foundation