Skip to content

Clarify guidelines for selecting Chinese-related language codes #512

@zih-syuan

Description

@zih-syuan

Chinese-related language codes look confusing because Chinese can be classified along multiple independent axes:

  • By language / variety (e.g. Mandarin, Minnan, Yue, Wu, Hakka)
  • By historical period (e.g. Classical Chinese vs. modern languages)
  • By region or usage standard (e.g. Taiwan, Macao, Hong Kong, Mainland China, Singapore)
  • By writing system (e.g. Traditional vs. Simplified characters, Latin-based romanization)

Wikidata preserves all of these distinctions, even when they overlap in everyday usage.
Therefore, we need to guide users toward consistent and meaningful choices within a flattened table structure.

To-do

  • Documentation / Guidelines
    • Add a “Language Code Guideline” section to WikiPage: Add A New Instrument Name
      • A language code in UMIL
      • Understanding the Chinese (Sinitic) language family in UMIL
      • Guideline to Selecting the Language Code for Sinitic Language Family
    • Update WikiPage: FAQ
      • Add: What is a language code in UMIL?
      • Add: Why are there multiple Chinese-related language codes?
      • Add: Why some language codes are not recommended
  • UI: Add a link to the language code guideline in UMIL website
    • Location: “Add instrument name” button or help text nearby
    • Purpose: allow users to check guidance before selecting a code

Metadata

Metadata

Assignees

Labels

documentationImprovements or additions to documentation

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions