delorie.com/archives/browse.cgi   search  
Mail Archives: djgpp/2000/06/01/11:01:00

Date: Thu, 1 Jun 2000 16:17:13 +0300 (IDT)
From: Eli Zaretskii <eliz AT is DOT elta DOT co DOT il>
Message-Id: <200006011317.QAA20532@is.elta.co.il>
To: Martin Str|mberg <ams AT ludd DOT luth DOT se>
CC: djgpp AT delorie DOT com
In-reply-to: <200006011204.OAA09202@father.ludd.luth.se> (message from Martin Str|mberg on Thu, 1 Jun 2000 14:04:17 +0200 (MET DST))
Subject: Re: Character differences
Reply-To: djgpp AT delorie DOT com
Errors-To: nobody AT delorie DOT com
X-Mailing-List: djgpp AT delorie DOT com
X-Unsubscribes-To: listserv AT delorie DOT com

> From: Martin Str|mberg <ams AT ludd DOT luth DOT se>
> Date: Thu, 1 Jun 2000 14:04:17 +0200 (MET DST)

> In emacs in Linux if I insert an "ö" (octal 366) and look at the file
> in NOTEPAD in WINDOZE I see an "ö". But if I use emacs in WINDOZE I
> see another character (something I'd call a divide symbol) similar to
> a ":" with the dots sperated by a "-".

(This is better posted to gnu.emacs.help.)

Short answer: if you want Notepad to see the same character, tell
Emacs to save the buffer with latin-1 encoding, like this:

	C-x C-m f latin-1 RET C-x C-s

(You only need the "C-x C-m f latin-1 RET" part once.)

Long answer:  The problem is that non-ASCII characters can be encoded
in several different ways.  The general issue is too broad to explain,
but for Western Europe, the encoding used by Unix and Linux is ISO8859-1,
a.k.a. Latin-1.  Windows uses codepage 1252 which just happens to be
identical with Latin-1 (other locales, such as Cyrillic and East
European, are not so lucky).  OTOH, the DJGPP port of Emacs by default
uses the DOS codepage set up on your system; for West Europe this is
typically either cp850 or cp437.  Thus, Emacs encodes the same
character differently than Windows programs expect, and therefore
Windows programs display a different glyph for that code.  If you look
at the file with another DJGPP program, such as Less, you *will* see the
glyph you expect.

You can force Emacs to use specific encoding with "C-x C-m f" or 
"C-x C-m c" key sequences.  These are described in the Emacs manual.

For more info about codepage support in Emacs, read the node "MS-DOS
and MULE" in the on-line manual.

(I'm assuming you use Emacs 20.x; if not, you don't have an easy way
of producing different encodings of the same character.)

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019