The problem even if the source is UTF-8 with BOM "שלום" would
be encoded according to locale's 8bit codepage like 1255 or 936
and not UTF-8 string (codepage 65001).
It is rather stupid, but this is how MSVC works or understands
the place of UTF-8 in this world.
Unicode and Visual Studio is just broken...
こんな人間がよくlocaleライブラリなんて設計しようと思い立ったな。こう言ってやりたいね。
Give us all a favor and cease developing locale library. It's not only useless, but also harmful. It encourage people to use ASCII.
このエントリを見た人が議論を追いかけやすいように、件の議論へのリンクを貼っておきます。
ReplyDeletehttp://thread.gmane.org/gmane.comp.lib.boost.devel/218369/focus=218476
お、そういう便利なものがあったのか。
ReplyDeleteThis is very kind of you that you detach the quote from the context.
ReplyDeleteYes, MSVC and Unicode is broken as it does not allow to create UTF-8 text literal as all other compilers around.
It is HUGE issue whether you like it or not.
Artyom Beilis.
MSVC doesn't support UTF-8 because it isn't needed.
ReplyDeleteWindows use UTF-16.