= Сообщение: 1101 из 7128 ====================================== FTSC_PUBLIC = От : Michiel van der Vlist 2:280/5555 14 Jan 14 13:03:47 Кому : Björn Felten 14 Jan 14 13:03:47 Тема : CHRS kludge implementations FGHI : area://FTSC_PUBLIC?msgid=2:280/5555+52d529ce На : area://FTSC_PUBLIC?msgid=2:203/2+52d35980 = Кодировка сообщения определена как: UTF-8 ================================== Ответ: area://FTSC_PUBLIC?msgid=2:203/2+52d54a06 ============================================================================== Hello Björn,
On Monday January 13 2014 04:12, you wrote to Nicholas Boel:
NB>> If it's at a level 4, it's supposed to let other software know NB>> there are multibyte characters in the header or message.
BF> As if UTF-8 in itself doesn't say that, you mean?
UTF-8 has some redundancy. Not all byte combinations are "well formed". The presence of non "well formed" strings is a good indications it is not UTF-8.
In your case he following strategy seems to work:
o When it is a "well formed" string assume the encoding is UTF-8.
o Obviously a single byte in the range 128-255 is not a well formed string.