Wait hold on I just realized.
-
@mcc unfortunately there’s not really a good solution to this problem and Android, like everyone else, just has to pick a resolution method and stick with it. If you’ve heard of “Han Unification,” well it sounds like something that happened violently in 2200 BC but actually it happened quite recently in a Unicode meeting room and it causes this exact specific intractable issue
@0xabad1dea @mcc also an act of violence, I would argue
-
@mcc it seems like it is actually using the declared language on my end because if I switch the post language here to Japanese I see the Japanese variants of the characters, and if I switch it back to Chinese I see the Chinese variants of the characters.
Test post marked as Japanese: 八人入
@mcc I couldn’t find an Android API to do this right, and I found what seems like a reasonable iOS API but it doesn’t do what I was expecting so I’m not actually sure it’s possible to do this well except with web technologies
-
Wait hold on I just realized. Is
八人入
A reasonable Chinese sentence
@mcc you've been on Chinese fanfiction sites I see
-
@Heliograph @rk The 个 is a friend that you give to a number so that it does not get lonely
@mcc @Heliograph @rk
It's a Totoro umbrella. -
Okay I now believe the problem is neither Tusky nor Lenovo but rather that Android is not a serious product and never has been. It seems Android may outright refuse to show scripts unless you've whitelisted the language. Problem: I think this menu is asking me which version of Chinese I want but the menu is in Chinese. I want to look at Chinese text so I can learn Chinese. I don't know it yet. I feel like I'm playing an adventure game.
* I may explore a PR later anyway.
@mcc i think the first one is japanese, the second simplified chinese.
-
@mcc I couldn’t find an Android API to do this right, and I found what seems like a reasonable iOS API but it doesn’t do what I was expecting so I’m not actually sure it’s possible to do this well except with web technologies
@porglezomp I'm talking to someone and we think we found the android API
-
@mcc unfortunately there’s not really a good solution to this problem and Android, like everyone else, just has to pick a resolution method and stick with it. If you’ve heard of “Han Unification,” well it sounds like something that happened violently in 2200 BC but actually it happened quite recently in a Unicode meeting room and it causes this exact specific intractable issue
@0xabad1dea @mcc I suppose the only actually reliable approach would be to store the IME locale per character or something so that it can be accurately rendered as it was written... or are these truly identical graphemes, and there's no chance of confusion in context? Even when people use multiple languages simultaneously?
(late edit after reading a lot more: ah, I see they DID just add a variant-selector character to effectively specify the locale... that seems a bit unlikely to gain major use, but technically I like it I guess)
Maybe one day we'll have UTF-8-2 and it'll just be infinitely extendable, rather than using a limited length prefix.
-
@0xabad1dea @mcc I suppose the only actually reliable approach would be to store the IME locale per character or something so that it can be accurately rendered as it was written... or are these truly identical graphemes, and there's no chance of confusion in context? Even when people use multiple languages simultaneously?
(late edit after reading a lot more: ah, I see they DID just add a variant-selector character to effectively specify the locale... that seems a bit unlikely to gain major use, but technically I like it I guess)
Maybe one day we'll have UTF-8-2 and it'll just be infinitely extendable, rather than using a limited length prefix.
@groxx @0xabad1dea There are various existing solutions but just because the solutions exist does not mean people follow them corectly
-
@groxx @0xabad1dea There are various existing solutions but just because the solutions exist does not mean people follow them corectly
@mcc @0xabad1dea definitely agreed. even technically, it seems very unlikely to me that any IME is going to choose to, like, add variant selectors *to every single character* and confuse their users when it's blended with other text or in a size-limited scenario. those characters already take up a ton of space, making it worse won't go over well.
-
@mcc I have to wonder if this is downstream of Unicode's choices around CJK unification. Because I seem to remember reading that it ended up causing some situations where, in order to correctly render a block of text, you need out-of-band knowledge of which language it's in.
@rachelplusplus
Yeah, I remember this from doing some i18n work, and the Wikipedia article appears to agree. -
Anyways I think the sentence was wrong to start with because it's missing 个s or something
鸡机
Attempting to imagine the Chicken Machine
-
鸡机
Attempting to imagine the Chicken Machine
@mcc fun fact 雞雞 is baby talk for penis lol
-
@mcc fun fact 雞雞 is baby talk for penis lol
@ionchy nice lol
-
@Gaelan I thought this was a good solution https://mastodon.social/@noone2333/116146702971358411
-
@Gaelan I thought this was a good solution https://mastodon.social/@noone2333/116146702971358411
@Gaelan Gaelan how do you feel about wuxia and/or girls who are really really *really* good friends
-
@Gaelan Gaelan how do you feel about wuxia and/or girls who are really really *really* good friends
@mcc uh, I’m not particularly familiar with Wuxia but am myself a girl who is really^3 good friends with a number of girls
-
鸡机
Attempting to imagine the Chicken Machine
-
鸡机
Attempting to imagine the Chicken Machine
@mcc one needs not imagine the chicken machine https://youtu.be/dl9beG4LbJU