alexpgp | A test...

You're viewing

alexpgp's journal
Create a Dreamwidth Account Learn More

Reload page in style: site light

I was working with my Wiki site the other day, and experienced the following strangeness: I'd enter text in Russian (actually, I'd paste it in an HTML field), and then, upon saving the result, I'd get a set of mostly vowels with diacritical marks (character values 129 and up, ISO-8859-1, if memory serves).

(Recall I have had the same problem with LJ when posting Cyrillic using Semagic 1.2.1.1.)

I tried to document what I did, and in doing so, I pasted some ISO-8859-1 characters (obtained by running ClipConvert, which does things to your Windows clipboard to change Unicode to single-byte chars) into the page.

Suddenly, the Cyrillic Unicode started displaying properly, as did the ISO-8859-1 characters.

I just tried the same trick in Semagic, but it doesn't work. It turns out the text I input/paste already consists of 8-bit characters. Go figure.

Ah, well, it's still a puzzle.

Cheers...

Flat | Top-Level Comments Only

From:

brenk.livejournal.com

This is way beyond the bounds of my geek-powers and I am thus impressed *g*. One thing that struck me, though, is that if I remember rightly Semagic was developed by a Russian? Probably in English, I suppose...?