Mail Archives: djgpp/2000/06/01/11:01:00
> From: Martin Str|mberg <ams AT ludd DOT luth DOT se>
> Date: Thu, 1 Jun 2000 14:04:17 +0200 (MET DST)
> In emacs in Linux if I insert an "ö" (octal 366) and look at the file
> in NOTEPAD in WINDOZE I see an "ö". But if I use emacs in WINDOZE I
> see another character (something I'd call a divide symbol) similar to
> a ":" with the dots sperated by a "-".
(This is better posted to gnu.emacs.help.)
Short answer: if you want Notepad to see the same character, tell
Emacs to save the buffer with latin-1 encoding, like this:
C-x C-m f latin-1 RET C-x C-s
(You only need the "C-x C-m f latin-1 RET" part once.)
Long answer: The problem is that non-ASCII characters can be encoded
in several different ways. The general issue is too broad to explain,
but for Western Europe, the encoding used by Unix and Linux is ISO8859-1,
a.k.a. Latin-1. Windows uses codepage 1252 which just happens to be
identical with Latin-1 (other locales, such as Cyrillic and East
European, are not so lucky). OTOH, the DJGPP port of Emacs by default
uses the DOS codepage set up on your system; for West Europe this is
typically either cp850 or cp437. Thus, Emacs encodes the same
character differently than Windows programs expect, and therefore
Windows programs display a different glyph for that code. If you look
at the file with another DJGPP program, such as Less, you *will* see the
glyph you expect.
You can force Emacs to use specific encoding with "C-x C-m f" or
"C-x C-m c" key sequences. These are described in the Emacs manual.
For more info about codepage support in Emacs, read the node "MS-DOS
and MULE" in the on-line manual.
(I'm assuming you use Emacs 20.x; if not, you don't have an easy way
of producing different encodings of the same character.)
- Raw text -