I was working with my Wiki site the other day, and experienced the following strangeness: I'd enter text in Russian (actually, I'd paste it in an HTML field), and then, upon saving the result, I'd get a set of mostly vowels with diacritical marks (character values 129 and up, ISO-8859-1, if memory serves).
(Recall I have had the same problem with LJ when posting Cyrillic using Semagic 1.2.1.1.)
I tried to document what I did, and in doing so, I pasted some ISO-8859-1 characters (obtained by running ClipConvert, which does things to your Windows clipboard to change Unicode to single-byte chars) into the page.
Suddenly, the Cyrillic Unicode started displaying properly, as did the ISO-8859-1 characters.
I just tried the same trick in Semagic, but it doesn't work. It turns out the text I input/paste already consists of 8-bit characters. Go figure.
Ah, well, it's still a puzzle.
Cheers...
(Recall I have had the same problem with LJ when posting Cyrillic using Semagic 1.2.1.1.)
I tried to document what I did, and in doing so, I pasted some ISO-8859-1 characters (obtained by running ClipConvert, which does things to your Windows clipboard to change Unicode to single-byte chars) into the page.
Suddenly, the Cyrillic Unicode started displaying properly, as did the ISO-8859-1 characters.
I just tried the same trick in Semagic, but it doesn't work. It turns out the text I input/paste already consists of 8-bit characters. Go figure.
Ah, well, it's still a puzzle.
Cheers...
no subject
Date: 2003-07-06 11:50 pm (UTC)