X-Original-To: pgsql-docs-postgresql.org@localhost.postgresql.org Received: from localhost (unknown [200.46.204.144]) by svr1.postgresql.org (Postfix) with ESMTP id 82E758B9CFF for ; Wed, 16 Feb 2005 17:32:44 +0000 (GMT) Received: from svr1.postgresql.org ([200.46.204.71]) by localhost (av.hub.org [200.46.204.144]) (amavisd-new, port 10024) with ESMTP id 24072-09 for ; Wed, 16 Feb 2005 17:32:37 +0000 (GMT) Received: from bobby.int.journyx.com (unknown [66.219.41.226]) by svr1.postgresql.org (Postfix) with SMTP id 3647E8B9B5E for ; Wed, 16 Feb 2005 17:32:29 +0000 (GMT) Received: (qmail 1780 invoked from network); 16 Feb 2005 17:32:22 -0000 Received: from dhcp-100-120.int.journyx.com (planders@192.168.100.120) by bobby.int.journyx.com with SMTP; 16 Feb 2005 17:32:22 -0000 Date: Wed, 16 Feb 2005 11:30:33 -0600 (CST) From: Preston Landers X-X-Sender: planders@dhcp-100-120.int.journyx.com To: pgsql-docs@postgresql.org Subject: Suggestion for Encodings table Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Scanned: by amavisd-new at hub.org X-Spam-Status: No, hits=0 tagged_above=0 required=5 tests= X-Spam-Level: X-Archive-Number: 200502/49 X-Sequence-Number: 2882 http://www.postgresql.org/docs/8.0/interactive/multibyte.html#CHARSET-TABLE I would humbly suggest a few improvements to that Encodings table to improve the clarity. Many of the entries clearly indicate the language or writing system, such as WIN1256 = "Windows CP1256 (Arabic)" I would suggest that every single entry should be described that way with the common language or writing system name. Even Unicode could say "All languages". In particular, the "WIN" encoding just says "CP1251" -- this is Cyrillic (Russian) but some people might just see the WIN and assume it's the character set that Western/US Windows uses (CP 1252). It's an easy mistake to make and one I see repeated frequently on other web pages (calling Windows "Western" CP 1251.) Someone reading English language docs and seeing a "WIN" character set might naturally assume that it is the English Windows character set. (Which BTW is not currently supported by PG for conversions.) Some more examples that might improve clarity: LATIN5 should say "Turkish" LATIN6 should say "Nordic" ALT and KOI8 should say "Cyrillic" (or Russian)