Wait hold on I just realized.
-
@mcc unfortunately there’s not really a good solution to this problem and Android, like everyone else, just has to pick a resolution method and stick with it. If you’ve heard of “Han Unification,” well it sounds like something that happened violently in 2200 BC but actually it happened quite recently in a Unicode meeting room and it causes this exact specific intractable issue
@0xabad1dea @mcc I suppose the only actually reliable approach would be to store the IME locale per character or something so that it can be accurately rendered as it was written... or are these truly identical graphemes, and there's no chance of confusion in context? Even when people use multiple languages simultaneously?
(late edit after reading a lot more: ah, I see they DID just add a variant-selector character to effectively specify the locale... that seems a bit unlikely to gain major use, but technically I like it I guess)
Maybe one day we'll have UTF-8-2 and it'll just be infinitely extendable, rather than using a limited length prefix.
-
@0xabad1dea @mcc I suppose the only actually reliable approach would be to store the IME locale per character or something so that it can be accurately rendered as it was written... or are these truly identical graphemes, and there's no chance of confusion in context? Even when people use multiple languages simultaneously?
(late edit after reading a lot more: ah, I see they DID just add a variant-selector character to effectively specify the locale... that seems a bit unlikely to gain major use, but technically I like it I guess)
Maybe one day we'll have UTF-8-2 and it'll just be infinitely extendable, rather than using a limited length prefix.
@groxx @0xabad1dea There are various existing solutions but just because the solutions exist does not mean people follow them corectly
-
@groxx @0xabad1dea There are various existing solutions but just because the solutions exist does not mean people follow them corectly
@mcc @0xabad1dea definitely agreed. even technically, it seems very unlikely to me that any IME is going to choose to, like, add variant selectors *to every single character* and confuse their users when it's blended with other text or in a size-limited scenario. those characters already take up a ton of space, making it worse won't go over well.
-
@mcc I have to wonder if this is downstream of Unicode's choices around CJK unification. Because I seem to remember reading that it ended up causing some situations where, in order to correctly render a block of text, you need out-of-band knowledge of which language it's in.
@rachelplusplus
Yeah, I remember this from doing some i18n work, and the Wikipedia article appears to agree. -
Anyways I think the sentence was wrong to start with because it's missing 个s or something
鸡机
Attempting to imagine the Chicken Machine
-
鸡机
Attempting to imagine the Chicken Machine
@mcc fun fact 雞雞 is baby talk for penis lol
-
@mcc fun fact 雞雞 is baby talk for penis lol
@ionchy nice lol
-
@Gaelan I thought this was a good solution https://mastodon.social/@noone2333/116146702971358411
-
@Gaelan I thought this was a good solution https://mastodon.social/@noone2333/116146702971358411
@Gaelan Gaelan how do you feel about wuxia and/or girls who are really really *really* good friends
-
@Gaelan Gaelan how do you feel about wuxia and/or girls who are really really *really* good friends
@mcc uh, I’m not particularly familiar with Wuxia but am myself a girl who is really^3 good friends with a number of girls
-
鸡机
Attempting to imagine the Chicken Machine
-
鸡机
Attempting to imagine the Chicken Machine
@mcc one needs not imagine the chicken machine https://youtu.be/dl9beG4LbJU