Community Wishlist Survey 2023/Search and Categories/Sort pages with Chinese Japanese or Korean titles in a more useful order

From Meta, a Wikimedia project coordination wiki

Sort pages with Chinese Japanese or Korean titles in a more useful order

  • Problem: In category pages, pages with titles that begin with a kanji are currently sorted in Unicode order and grouped by the first character in the title. This is not a traditional way to sort and group things in CJK languages, and it’s not very helpful because Unicode order is opaque and does not correspond to anything in any CJK language.
  • Proposed solution: Titles that begin with a kanji should be grouped the traditional way, by stroke count (e.g., the kTotalStrokes field in Unihan_IRGSources.txt if using Unicode 14), then within any stroke count grouping, sorted by Kangxi radical order (e.g., the integer part of the kRSKangXi field in Unihan_RadicalStrokeCounts.txt), any ambiguities that remain can be kept in Unicode order.
  • Who would benefit: native CJK readers who use category pages
  • More comments: I read a proposal to sort these by pronunciation, but that would not work because every kanji is pronounced differently in different languages (and there can be more than one pronunciation even within a single language), so any pronunciation chosen would privilege one language over all others.
  • Phabricator tickets:
  • Proposer: Al12si (talk) 21:24, 5 February 2023 (UTC)[reply]

Discussion

see also: phab:T47443 and Manual:$wgCategoryCollation.

Any collation that is defined by UCA/UCI as a variant, and that is installed on the servers, and for which a wiki community can show a consensus for, can already be switched to. Im not sure if the kanji scripts are significantly different in that (they might be). —TheDJ (talkcontribs) 12:37, 18 February 2023 (UTC)[reply]

Even if the sorting part can already be addressed the grouping still wouldn’t be ideal. Grouping by stroke count will reduce the number of groupings needed (perhaps significantly in some cases), reducing clutter. Al12si (talk) 06:54, 21 February 2023 (UTC)[reply]

Voting