Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1w0miA-002CoN-2O for pgsql-hackers@arkaria.postgresql.org; Thu, 12 Mar 2026 20:32:50 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.96) (envelope-from ) id 1w0mi8-00HaFc-14 for pgsql-hackers@arkaria.postgresql.org; Thu, 12 Mar 2026 20:32:48 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1w0mi8-00HaFT-02 for pgsql-hackers@lists.postgresql.org; Thu, 12 Mar 2026 20:32:48 +0000 Received: from mail-ed1-x52f.google.com ([2a00:1450:4864:20::52f]) by magus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.98.2) (envelope-from ) id 1w0mi5-00000002M5w-3czf for pgsql-hackers@postgresql.org; Thu, 12 Mar 2026 20:32:47 +0000 Received: by mail-ed1-x52f.google.com with SMTP id 4fb4d7f45d1cf-662fc12ac5bso2842378a12.0 for ; Thu, 12 Mar 2026 13:32:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cybertec.at; s=google; t=1773347565; x=1773952365; darn=postgresql.org; h=mime-version:user-agent:content-transfer-encoding:references :in-reply-to:date:cc:to:from:subject:message-id:from:to:cc:subject :date:message-id:reply-to; bh=n4t5BFhTmiXcXeQnIpMYyeRMSqiytpK9qR3b/zkJYmY=; b=rgxvFHyuKhTB+MLCp7PJKrjwr0Mwa7Kgqn5ZayEDnFar6+kauayTMOdEoWkg64piqs u3z47BWRVFmIniAM6m34BopjLc5v4bwsux+Fdxhz9oGdgS+Px0oTr5RlN52p9/BYpQSk vMGZYjTdbC75kUyZXDIQA3z0cIaQ2bYGUEAt6amKmNLvHl2WQidzK10etzEQKCu9ezCW +yrfBM4K4hmm8MC8nmBFQbDjJ/boZvCWpYfetJXlnwWOlHjmRRbGDx3mtYP41oNfL/7E BkDnrcCQdjs29ToNLLs4F8+DXFi6punIvMY3vh/0oHhX0bcDVPropyG3sw/LeDC7GUlt 6Eiw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1773347565; x=1773952365; h=mime-version:user-agent:content-transfer-encoding:references :in-reply-to:date:cc:to:from:subject:message-id:x-gm-gg :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=n4t5BFhTmiXcXeQnIpMYyeRMSqiytpK9qR3b/zkJYmY=; b=Uyoqm8rVvwc5xSxG1qaDt8ovlwV+DY5yFo6ncWl1BrqByphnHYs4/DjDz2EfN1kuro wG5YEak9KIPtBGpnNs5viEelJLvjFrlqFcvYTfsKzCuz/hsh1quk03KmiNgJSYL30RE4 F037p5i8NlF/4DYAe3N9DvDkx4PQNWB16j8idY40jz2lkN/6We5XwbEkC78ZXQoOpxHJ Hol106o6DLUTTxIwrSXAaG4croXtnJ2jCyLHuqtASPX9jVNIwqsoWwdbbSh2wA4neliu i0YTZVCHlCe9MgkGhUvvfUKiuWwtoQJVLerN2BL+Kk4Vtfe4n6vrFMWCnTjp97+1utid DgdQ== X-Gm-Message-State: AOJu0Yw+Ebty5v8Eu++mNZyGAh2tiHvSQD2nXFspMJa4THBi2GdTtYXi DkdvX5+MVH20aW3Z/b2V5QgCDJTirboGxsi0n0WuB1MfCWBUVjtegAsd6gpKYR2+ZWU= X-Gm-Gg: ATEYQzyaEte5yX/FoRh/LlomAPI2Jc00O149r2KzGt38o/USmLXVTGgchbJyN2+jkVp bh0MzCNnC19gXDdaa23ur2S+MV/Sq0kmrjMJk106Rzz1oGxPVI69KJ4vpWV/95C2Zcz5iDK8JLQ wgrz2/KtE0iECAq4LRZInR6EdiLE7KBAIxU5L7vdwn+5x0GD51smogt94oPSuq2vxELXPdZx+xl 1I7fAd+rTBH2Ok8q++DggDTYVra/e3T/1HtLzd2BeVtw0slIp214XragZA0AvGwSgAEq9Ow1iCd jPcUmLPkUyVjWg5pE0aS/jRUg7LU4I39Rdd3+V7mrSnC5mRDzcjXpO2wBrMgnIzp5v8RUxgp3DI 5JxBTH2Z8yI2aN+vtkbELUJ42AZSOAkL0QO+hLjQYzYIHYGvDkLR6Ey6RHKQmonfeww4e77a+yo SMaTTJ5ym6WY9nL1wd9mu3pgEGNzOS55Mw5d9jJz2NR8pY0BvlDnNBW81Q907jK/d9khlOg1r/b dnGKU749dpFi5B/HdxHfdoSC0IdqmTnmIQ9 X-Received: by 2002:a17:907:ec8a:b0:b97:4e42:23e3 with SMTP id a640c23a62f3a-b976504907amr41692666b.26.1773347564705; Thu, 12 Mar 2026 13:32:44 -0700 (PDT) Received: from laurenz.albe-K4N0CV00F97414D (dynamic-046-114-168-089.46.114.pool.telefonica.de. [46.114.168.89]) by smtp.gmail.com with ESMTPSA id a640c23a62f3a-b972e14aa23sm162139166b.40.2026.03.12.13.32.44 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 12 Mar 2026 13:32:44 -0700 (PDT) Message-ID: <75c43ae49dba00a3af7a4d0cb06498bef9508c4c.camel@cybertec.at> Subject: Re: Change initdb default to the builtin collation provider From: Laurenz Albe To: Robert Haas , Jeff Davis Cc: pgsql-hackers@postgresql.org Date: Thu, 12 Mar 2026 21:32:43 +0100 In-Reply-To: References: <47e1b4f72fe732c5ae85c6cf2c085b4e99a10120.camel@j-davis.com> <4309879ac305b1cf6b4d7b5fb85bc7b62c6ab768.camel@j-davis.com> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable User-Agent: Evolution 3.58.3 (3.58.3-1.fc43) MIME-Version: 1.0 List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk On Thu, 2026-03-12 at 10:04 -0400, Robert Haas wrote: > So maybe the people who are saying that defaulting to C is fine and > that making people make an explicit choice if they want something else > are right. but my personal guess is that we will make a bunch of > people unhappy. Defaulting to C will make a bunch of people unhappy, I agree. However, that is a kind of problem that they will discover early on during development or testing, and it is easily remedied for those columns where alphabetical sorting order matters. But a good number of people are already unhappy because they have index corruption (today I dealt with another support case). And the people who are diligent enough to know that they should reindex after an OS update are unhappy because of the additional down time. I maintain that the second kind of unhappiness weighs heavier, and I find this need to reindex to be one of the most embarrassing flaws in an otherwise great piece of software. This is a value judgement, and I may be wrong. > If even a small fraction of users create a > database using "C" unintentionally and load a terabyte of data into it > before realizing that all their text indexes are sorting "wrong", I > suspect that's not going to be much fun. Hm. So reindexing a large table after an OS upgrade is acceptable, but reindexing a large table after a bulk load is not? The problem you describe could have been avoided with a bit of testing, but the need to reindex after an OS upgrade sometimes cannot be avoided. > Obviously, I could be wildly incorrect. Maybe people will just be > super-happy about faster sorting and life will be great. I have not experienced sort speed as a frequent problem, and the life with the C collation won't be all that great. But I believe that it would be better than a life with index corruption. Yours, Laurenz Albe