strange letters / encoding prob?

  • Unknown's avatar

    some week ago all (no, actually only some) of non-english letters went crazy … i waited a bit thinking that it will be repaired but nothing changes … u can view it here: http://horizonts.wordpress.com/ … i started to change them back slowly, and new ones appear ok, but why did it happen?

  • Unknown's avatar

    Somehow your charset is set to UTF-8. Is this correct? I’m guessing that it’s not. You may want to change this in your Dashboard and sending in a feedback to see if you can get your previous posts cheanged over if that’s possible.

    Good luck,
    -drmike

  • Unknown's avatar

    utf-8 is ok and *recommended* for the most cases including Lettish.

  • Unknown's avatar

    That looks like Russian to me. (It’s been way too long) Isn’t that koi8-r? To get most foreign characters into UTF-8, they usually have to be encoded. If you look at the source code of the page, I see lots of encoding. I was thinking that that was what horizonts’ issue was. If it’s not, please forgive me. I don’t usually deal with foreign languages. I normally stay out of these.

  • Unknown's avatar

    well, utf-8 certainly is ok … and it was ok, untill one day it all changed to those question marks (at least that’s how they look like from here) …

  • Unknown's avatar

    AH! Question marks! That’s a different issue. :)

    That’s usually your browser isn’t supporting the charset or font that you are using in the webpage. That’s why I and options are seeing the actual characters. Something’s different with yourset up. Have you done any updating on your home computer lately?

  • Unknown's avatar

    Since last week I have the similar problem except that I write in Chinese. I sent a feedback about a week ago but the problem still exists.

    I don’t think it’s about browser or encoding since I didn’t change any setting of my blog or my computers. I use different computers but the problem is the same.

    I remember that the similar problems happened before on wordpress.com.

  • Unknown's avatar

    Russian uses Cyrillic alphabet (which is derived from Greek so its letters look a bit similar to it), this one mostly uses Latin symbols with some extensions.

    KOI8-R is a default russian encoding for e-mail/newsgroups exchange (so that it can be read transliterated by latin chars on the dumb terminal w/o cyrillic fonts installed).

    there’re some HTML entities encoded (apostrophes, ampersands and such) in the source — this is normal.

    I don’t blame you, just pointed out that changing an encoding won’t solve horizonts’ (and other people) problem w/ foreign chars garbled.

    as it was already discussed smth went wrong with db encoding just before last major downtime. I am afraid that last backup was made in Latin-1 (instead of UTF-8), so…

    (lucky you, staying out of this crazy convoluted stuff ;-)

  • Unknown's avatar

    The question marks are missing information/ fonts/ support/ etc out of the browser. That’s what you get. A simple Google search will support that.

  • Unknown's avatar

    I still believe this is not because of the browser.

    I just re-typed the title of my blog and the latest post, it turns out to be fine. If WordPress cannot recover my posts, I think I might repost them when I have time later. Luckily, I’m not frequently blogging…

    Not only browse, server can screw up the page display too…

  • Unknown's avatar

    if we could see a MySQL tables now, we’d saw that garbled accented characters are already stored as as a diamond question marks there.

    simple goog search gives us:

    If an accented character displays as a diamond question mark, Movable Type is publishing a ISO-8859-1 database with UTF-8 pages.

    Chineese and other foreign charsets just can’t be encoded properly in the database which used an ISO-8859-1 (or less formally cited as Latin-1), which essentialy is “a standard character encoding of the Latin alphabet”.

  • Unknown's avatar

    I’m posting in Korean and I have the same problem. I’ve been using UTF-8 as default encoding from the beginning and I haven’t changed any encoding settings or browser option. My friends are reporting broken charater problems on my blog. I’m sure this isn’t an end-user problem but a server side problem.

  • The topic ‘strange letters / encoding prob?’ is closed to new replies.