public inbox for [email protected]  
help / color / mirror / Atom feed
From: Preston Landers <[email protected]>
To: [email protected]
Subject: Suggestion for Encodings table
Date: Wed, 16 Feb 2005 11:30:33 -0600 (CST)
Message-ID: <[email protected]> (raw)


http://www.postgresql.org/docs/8.0/interactive/multibyte.html#CHARSET-TABLE

I would humbly suggest a few improvements to that Encodings table to
improve the clarity.

Many of the entries clearly indicate the language or writing system, such
as WIN1256 = "Windows CP1256 (Arabic)"

I would suggest that every single entry should be described that way with
the common language or writing system name.  Even Unicode could say "All
languages".

In particular, the "WIN" encoding just says "CP1251" -- this is Cyrillic
(Russian) but some people might just see the WIN and assume it's the
character set that Western/US Windows uses (CP 1252).

It's an easy mistake to make and one I see repeated frequently on other
web pages (calling Windows "Western" CP 1251.)  Someone reading English
language docs and seeing a "WIN" character set might naturally assume that
it is the English Windows character set.  (Which BTW is not currently
supported by PG for conversions.)

Some more examples that might improve clarity:

 LATIN5 should say "Turkish"

 LATIN6 should say "Nordic"

 ALT and KOI8 should say "Cyrillic"   (or Russian)




view thread (3+ messages)  latest in thread

reply

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Reply to all the recipients using the --to and --cc options:
  reply via email

  To: [email protected]
  Cc: [email protected]
  Subject: Re: Suggestion for Encodings table
  In-Reply-To: <[email protected]>

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

This inbox is served by agora; see mirroring instructions
for how to clone and mirror all data and code used for this inbox