Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bI26C-0001ge-Pb for pgsql-docs@arkaria.postgresql.org; Tue, 28 Jun 2016 23:07:20 +0000 Received: from localhost ([127.0.0.1] helo=postgresql.org) by malur.postgresql.org with smtp (Exim 4.84_2) (envelope-from ) id 1bI26B-0003cG-Dv for pgsql-docs@arkaria.postgresql.org; Tue, 28 Jun 2016 23:07:19 +0000 Received: from makus.postgresql.org ([2001:4800:1501:1::229]) by malur.postgresql.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_CBC_SHA384:256) (Exim 4.84_2) (envelope-from ) id 1bI26A-0003bi-9Y for pgsql-docs@postgresql.org; Tue, 28 Jun 2016 23:07:18 +0000 Received: from mail-ob0-x22f.google.com ([2607:f8b0:4003:c01::22f]) by makus.postgresql.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_CBC_SHA1:256) (Exim 4.84_2) (envelope-from ) id 1bI262-0005Qs-J7 for pgsql-docs@postgresql.org; Tue, 28 Jun 2016 23:07:16 +0000 Received: by mail-ob0-x22f.google.com with SMTP id mu6so1960087obc.3 for ; Tue, 28 Jun 2016 16:07:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bowt-ie.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=l+0A+tA0RFu2tDPdtkhGP4DSARqxtEQJmEjEddAbBrI=; b=whXPHbcWM2Ybmt1OR+KWme6E9Fl+UiI+cYIf6eDD1yyfUHLxN3636hGPMuUg0So6rT 6MbXAnTQsmgbouScrjnQE3yvWyZ7b8VvdT2IWaNQoDKY2+W51zBNSi5GxE6xtj5AsLqU P6RndoO2VaqIftTj61vPFg3FDyC9Odo4ok380OEpqTlhqyiTqN6/c24ryQat+uQABwI1 94vHxIrLIvusfrYkDeu5Ns4ehGOSZGajV030MMI6vzvLSon68yp7+gnkgR4xgXT4Lg7l OWiOwF7fszeFhdo4xsDowpNLZLzOCMgS5RBw+lG2e8WDfYAHQjzeiqHY1W+Xi0S2u999 rZqA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=l+0A+tA0RFu2tDPdtkhGP4DSARqxtEQJmEjEddAbBrI=; b=PwLB6+cQKgxJTdeh4FF94Pt2raXLPJc9MQwoEyWwgDlnDgJ0q0W6IyOA/8Tqop/7m3 yBbJVLyhEo9E5s5Dasq4lTvqajkT/kWFl8gHSeKI9NVHG8SbcjMwhW+Ns+h82KBZjR/W aii9R9+9HJjz99Cd9QkaBE3kVHtojEBUhQrxF/UGPWIDTMhk/T+qHhY6dlCtnOVS8+zQ LoJ231dg4Es3xmZ8gytjWVGA1KgMLHUbdKRbHG9ygEccK3mcpjGcdcUaoaADjOy6CRa1 TFgLNx9yjrWGSWbryNw2G91AHVByPo3deSk+Gd7tJ5YHLbg4qTSTKF23GVDcix4OdXQf xiiA== X-Gm-Message-State: ALyK8tJRIvP92FkI+Yr1DR20H65fooQbE7FfX5kM+kXBWKcmLDg3uSrqaN9SBRIZZHcQGSut5dQfzev/AK8qQQ== X-Received: by 10.157.52.183 with SMTP id g52mr3431389otc.12.1467155229153; Tue, 28 Jun 2016 16:07:09 -0700 (PDT) MIME-Version: 1.0 Received: by 10.157.17.57 with HTTP; Tue, 28 Jun 2016 16:06:49 -0700 (PDT) X-Originating-IP: [75.101.100.201] In-Reply-To: <20160628225015.GA82106@alvherre.pgsql> References: <20160628225015.GA82106@alvherre.pgsql> From: Peter Geoghegan Date: Tue, 28 Jun 2016 16:06:49 -0700 Message-ID: Subject: Re: Pg_upgrade and collation To: Alvaro Herrera Cc: Bruce Momjian , PostgreSQL-documentation Content-Type: text/plain; charset=UTF-8 X-Pg-Spam-Score: -2.6 (--) List-Archive: List-Help: List-ID: List-Owner: List-Post: List-Subscribe: List-Unsubscribe: X-Mailing-List: pgsql-docs Precedence: bulk Sender: pgsql-docs-owner@postgresql.org On Tue, Jun 28, 2016 at 3:50 PM, Alvaro Herrera wrote: > Uhmm. Until now I saw all this ICU thing as having fringe benefit on > strange platforms only, but it is seeming more and more like we need to > take it seriously. I'm not prepared to spend effort on it myself, > though. Let me put it this way: If we lived in a world where internationalization was a new idea, and someone proposed collation support that relied on the OS today, the patch would be rejected in about 2 minutes. The author would be pointed in the direction of "Notes to Operator Class Implementors" within the nbtree README. There are numerous user-visible benefits to ICU support, too, like: * Case-insensitive collations become possible (with work in other areas). No more contrib/citext hack. This is something that we seem to want to work towards. * Abbreviated keys in indexes with collated text becomes possible. (Already mentioned that abbreviated keys for collated text + sorting are effectively reintroduced.) * More useful collations available for certain languages, such as Japanese. Apparently, the JIS X 4061 algorithm produces results that Japanese people find more useful, but glibc doesn't support it, and never will. * We might be able to document WAL compatibility usefully, now. The documentation never gets around to explaining what two instances are compatible for the purposes of physical replication. I can't think of any other factor that prevents us from locking that down. * Upgrade major OS versions without difficulty. * User-defined collations, where you can mix and match certain facets of how text is sorted as you please. Basically, ICU offers rich functionality that we can bubble up to our users without too much effort, as other database systems have. -- Peter Geoghegan -- Sent via pgsql-docs mailing list (pgsql-docs@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-docs