• calcopiritus@lemmy.world
    link
    fedilink
    arrow-up
    5
    ·
    4 months ago

    The purpose of Unicode is to be able to represent everything humans have written. Doesn’t matter if correct or not.

    There are some Chinese characters that appear only once in written text, but they happen to be just typos of copying other text. They exist in Unicode.

    • renzev@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      2
      ·
      4 months ago

      With all of those obscure characters that they keep on adding, you’d think they’d have the decency to have separate sets for japanese and chinese characters. But nope, those are all lumped together into the CJK Unified Ideographs block. Whether a character shows up chinese-style or japanese-style depends on the font.

      • calcopiritus@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        4 months ago

        I have absolutely no idea about Chinese or Japanese characters, but if they did that there’s probably a technical reason like retro compatibility or something. Unicode has free space left for millions or billions of characters.

        • renzev@lemmy.worldOP
          link
          fedilink
          English
          arrow-up
          1
          ·
          4 months ago

          Unicode has free space left for millions or billions of characters.

          I might be wrong, but isn’t unicode essentially unlimited? Like, they’re just assigning numbers (codepoints) to individual characters. Any limitation would come from encodings like utf-8, no?

    • MarcomachtKuchen
      link
      fedilink
      arrow-up
      1
      ·
      4 months ago

      That’s correct. The mistake made was making an error while transcribing the same symbol into Unicode