----- Original Message ----- From: "Ignacio Vazquez-Abrams" <ivazqueznet gmail com> To: "For participants of the Documentation Project" <fedora-docs-list redhat com>
Sent: Monday, March 30, 2009 3:51 PM Subject: Re: More Publican Pain
0xc2 is the first byte of a 2-byte UTF-8 sequence for a character between U+0080 and U+00BF.
Ahh, that makes sense then. It is always followed by a non-breaking space, 0xa0. For Western European coding I don't think there are any multi-byte sequences. So the problem is entirely on the codepage.
--McD