Received: from maia.hub.org (maia-5.hub.org [200.46.204.29]) by mail.postgresql.org (Postfix) with ESMTP id 2FE1DB5DBE1 for ; Fri, 20 May 2011 05:45:03 -0300 (ADT) Received: from mail.postgresql.org ([200.46.204.86]) by maia.hub.org (mx1.hub.org [200.46.204.29]) (amavisd-maia, port 10024) with ESMTP id 87506-03 for ; Fri, 20 May 2011 08:44:56 +0000 (UTC) X-Greylist: from auto-whitelisted by SQLgrey-1.7.6 Received: from outmail149080.authsmtp.com (outmail149080.authsmtp.com [62.13.149.80]) by mail.postgresql.org (Postfix) with ESMTP id C86C2B5D8B2 for ; Fri, 20 May 2011 05:44:55 -0300 (ADT) Received: from mail-c193.authsmtp.com (mail-c193.authsmtp.com [62.13.128.118]) by punt10.authsmtp.com (8.14.2/8.14.2/Kp) with ESMTP id p4K8irdC000696; Fri, 20 May 2011 09:44:53 +0100 (BST) Received: from [192.168.23.4] (212.100.42.202.fixip.bitel.net [212.100.42.202]) (authenticated bits=0) by mail.authsmtp.com (8.14.2/8.14.2) with ESMTP id p4K8inht028555; Fri, 20 May 2011 09:44:49 +0100 (BST) Message-ID: <4DD62A01.6010206@2ndQuadrant.com> Date: Fri, 20 May 2011 10:44:49 +0200 From: Susanne Ebrecht User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.17) Gecko/20110424 Thunderbird/3.1.10 MIME-Version: 1.0 To: Peter Eisentraut CC: pgsql-docs@postgresql.org Subject: Re: non-ASCII characters in SGML documentation (and elsewhere) References: <1305841740.3952.32.camel@vanquo.pezone.net> In-Reply-To: <1305841740.3952.32.camel@vanquo.pezone.net> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Server-Quench: 6c4b344a-82bd-11e0-97bb-002264978518 X-AuthReport-Spam: If SPAM / abuse - report it at: http://www.authsmtp.com/abuse X-AuthRoute: OCdxZQATClZOUR8T DCUJJHVGRTw4LxFW CBkfawBdJUwMSABN M15eIxoJcUtGHRZ8 Ui8UWVRVUU1xUGlx bwBVZEtfY0hQXgVq TkxLXFBSFhpqBAMB SF4aKRErKV0CeHx4 YENnEHRSWkI0cxV5 EE1QETwbbWUyPH0e URVYagpTIlFXfh9H aFZ7XXQFZGQPKBg1 TUcAExcaBgAXLiNT Byg2BnMzCU0LFyQ1 TAwPBy5nHEtNfAoW C1QqLVUBGUsXLkwu MFwnERIRNRNaBwlF El5NDTVUZjH/ X-Authentic-SMTP: 61633235383639.1014:706 X-AuthFastPath: 0 (Was 255) X-AuthVirus-Status: No virus detected - but ensure you scan with your own anti-virus system. X-Virus-Scanned: Maia Mailguard 1.0.1 X-Spam-Status: No, hits=-1.9 tagged_above=-5 required=5 tests=BAYES_00=-1.9 X-Spam-Level: X-Archive-Number: 201105/64 X-Sequence-Number: 6739 Hello Peter, On 19.05.2011 23:49, Peter Eisentraut wrote: > So, some questions: > > * Should we consistently use entities for encoding non-ASCII > characters in SGML? Or use LATIN1 freely? > * Should we allow/use non-ASCII characters in the release notes? > * What encoding should the HISTORY file have? > * Should we allow non-ASCII characters in general source files? > * If so, what should the encoding be? one more argument for switching to XML? :) I guess we will get some more non-ASCII signs in documentation. How do you want to document the collation stuff? Collations are for all that isn't ASCII. Our docs usually have small examples. I can imagine that you want to place German or Russian letters or whatever else as examples into doc. Do you have another idea then using utf8? What do you expect what not would fit into utf8? I would expect words like déjà vu - means words that English just copied from French and still use the French accents. Or even personal names with e.g. umlauts, accents, and other special signs from special languages. Also consider - usually editors (vi, emacs) use utf8 today. Btw. For German docs I use utf8. The HTML output works well with both 'ö' and 'ö'. I not yet tested other outputs. I just changed to utf8 in stylsheets and use export SP_ENCODING=XML before compiling. Unfortunately index sorting neither works with 'ö' nor 'ö' yet. We are still fighting with it and try to figure out how we can force that it will sort correct. Just changing makefile didn't help. But - in English docs - I doubt that you have to deal with indexes on special words using non-ASCII characters. Means very small and low effort changes already might help. Susanne -- Susanne Ebrecht - 2ndQuadrant PostgreSQL Development, 24x7 Support, Training and Services www.2ndQuadrant.com