Community Wishlist Survey 2022/Translation/Add some basic statistics about article/section/paragraph being translated

From Meta, a Wikimedia project coordination wiki

Add some basic statistics about article/section/paragraph being translated

  • Problem: When talking about people who learn a language as an additional language, readabililty of a text or section is a big issue. While there are different ways to calculate the readability of a text, most rely on specific word lists ('controlled vocabulary'). Something that would help make a translation understandable is to improve these scores. For this reason, it might make sense to calculate different scores for a text, or a section of text, most notably word length, and sentence length. It might also make sense to highlight sentences that are much longer than the average sentence length; if there's something like a word list, might also make sense to highlgt words that are not part of this list (optional). If there are several such word lists, the user must be able to choose one among those available (optional). The sipmle counting and comparing of word length/sentence length for a given section would already be helpful.
  • Proposed solution: Add word-length/sentence length, for the whole translated text, section being translated, or selection of text. Of course, this should be done for both the source text and the translation.
  • Who would benefit: Translators who want to make sure their translation is easy to understand for a target audience.
  • More comments:
  • Phabricator tickets: https://www.mediawiki.org/wiki/Topic:Wo1y6f8y8p7hfovc
  • Proposer: Eptalon (talk) 22:03, 17 January 2022 (UTC)[reply]

Discussion

  • What about some of the attitude of the readers? Sometimes, we have to translate advance knowledge, which can be hard to implement. Thingofme (talk) 02:37, 22 January 2022 (UTC)[reply]
    Even if you are talking about advanced scientific concepts making shorter sentences will make the aticle easier to understand. Also note, that this is just a number (or a set of numbers) that editors can throw around. At least at the start, the success of Reader's digest wasthat they summarized' and shortened other publications. Calculating and publishing these numbers won't keep anyone from doingg anything, but it might help those who at least try to writie in simpler/easier to understand language. -Eptalon (talk) 16:46, 22 January 2022 (UTC)[reply]

Voting