Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1vGYub-000gxJ-QI for pgsql-general@arkaria.postgresql.org; Wed, 05 Nov 2025 08:30:37 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.94.2) (envelope-from ) id 1vGYua-001Dnb-QT for pgsql-general@arkaria.postgresql.org; Wed, 05 Nov 2025 08:30:35 +0000 Received: from makus.postgresql.org ([2001:4800:3e1:1::229]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1vGWzZ-000erW-Da for pgsql-general@lists.postgresql.org; Wed, 05 Nov 2025 06:27:36 +0000 Received: from mail-il1-x129.google.com ([2607:f8b0:4864:20::129]) by makus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.96) (envelope-from ) id 1vGWzW-005XEF-2O for pgsql-general@lists.postgresql.org; Wed, 05 Nov 2025 06:27:35 +0000 Received: by mail-il1-x129.google.com with SMTP id e9e14a558f8ab-4330d2ea04eso22185295ab.3 for ; Tue, 04 Nov 2025 22:27:34 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1762324054; x=1762928854; darn=lists.postgresql.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=tmzzbpzNS0+S8xzIj7FwNfH4Ug0SduPBR/uTSTv5iao=; b=f+NGtchVQU0lg6FXtRjSz5TQDQjXsU+UlOh28kN00FCHUqBsKGR4s2YAx+WjQ57K1P LiquJ4JV+PR+BUhlxnXXeM9ZpbX6yFGDcvar555nHLTjnZyrezKxQlDycJc9gEOzGm/v 7y9EP2o0Zkqv8SDSAodE7dqkglCJTngxncCZG4SrnCtprox2gV1gLlFKAkEoodFtU+ci LR/JRSIV7aXezo00nnTbLMDgHtsSqJJ/BFgcrrlg6rU/C8AAhmSqX7ZjBpPWzs4phmou sueDCHbM6CvQTZ6LSHowjv4heoXAAuPMfOltDy0bqrQTjgok9p+4gFH3oliws8dgV8bA SvRw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1762324054; x=1762928854; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=tmzzbpzNS0+S8xzIj7FwNfH4Ug0SduPBR/uTSTv5iao=; b=FJLbZeIn0TA4sz0JYPMWDq9sS2nDeRJ6uLvwhVmdM0gcPQfuPo/8VTiStHXRbfYSFf w09YAHR8k7il5O5cDD/jTuB55t+rCZb3pm138esrIp4nqN27o98hRJIjlERGc+yQmIw7 AIiOWSN01wYpRo1pMIm3zR5oWU3pRrfeLKo6QkY2msauOSU7+uxEZip6rR/7ee7L7yfZ kcp8CuV1ntatm8gs/W3D4V+fhs/jZ86VVlYuIvFuuI7Y0kn7p5y7kpPc9h5yPcm0w2NK qNzrjq91f6NHWwRgGBJbL4pxXoZQsIxiwQ9HBOhYgVPrbebxqZrrFxrygLZK2Vy50lGw 4u0Q== X-Forwarded-Encrypted: i=1; AJvYcCUrrJKr89esxn+TDo01ehM0AbJBSExDilBDQ8nJR+bxAopbCvgS/ag8uexccal6OoaXJOMxnbpNMIfe4jMP@lists.postgresql.org X-Gm-Message-State: AOJu0YxWGl69xTCJwhLEdd8xnLdtvJyiNxq2qvMqMAuD62RTH5bVlHRG sQldNyn85xpgnLIDdOlwffnox7JxTP7MHWiVBJ5mXKpWZlIkoEcCkF00tCF4RngXn+nEzSRVQps yaTgnpW2zJB+KcJU03+qfqZpg+pPpEXKzgdch X-Gm-Gg: ASbGncvoghiPElFsoqE/Rd1cGPY/owAQMIDbJ7MsHXJxBYGAWldxjaLsbT20mbyU4LO lzaq5qRdExbWDZS/AAJ7yxwdporW3UkTlvKYssyEjE3OnVi4inFbB8lFvqLRqXPltEAscxETdg5 hGE/erxVqn+oVn0Szbhzxh30hPwWwpNcxWLEbgJU59Nn5qpVuHWyrlfjv6EQnc0N/hpftEk+ZmA FqMja8Oy2prYBxX8hwhdSpkqOt2vd7ajXqfU/AtVMma1Kg7AZOo/IsxwttqXBR3EUp5IyFJ7e/p bno4cEet0y9+K3wlwoE= X-Google-Smtp-Source: AGHT+IGA4c8/NWz2PsM2BGs/w24y0yjB1fSefn7GTmGzpajDp1YmKPUmpZViOwyBbkLsIKsN/KK2K6yDDhsR+uKipv0= X-Received: by 2002:a05:6e02:1c23:b0:430:d061:d9f7 with SMTP id e9e14a558f8ab-433407c519amr25304825ab.23.1762324054009; Tue, 04 Nov 2025 22:27:34 -0800 (PST) MIME-Version: 1.0 References: In-Reply-To: From: Bala M Date: Wed, 5 Nov 2025 11:57:22 +0530 X-Gm-Features: AWmQ_bn886KJ14Db3piPwvvF5gXkV_UGzBYlWO8_5MLuNdGOL3t0XD0BXszrLt0 Message-ID: Subject: Re: Index corruption issue after migration from RHEL 7 to RHEL 9 (PostgreSQL 11 streaming replication) To: Greg Sabino Mullane , Francisco Olarte Cc: "adrian.klaver@aklaver.com" , chris+google@qwirx.com, pgsql-general@lists.postgresql.org Content-Type: multipart/alternative; boundary="00000000000037edc40642d30bc6" List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk --00000000000037edc40642d30bc6 Content-Type: text/plain; charset="UTF-8" Thank you all for your suggestions, Thanks for your quick response and for sharing the details. After reviewing the options, the logical replication approach seems to be the most feasible one with minimal downtime. However, we currently have 7 streaming replication setups running from production, with a total database size of around 15 TB. Out of this, there are about 10 large tables ranging from 1 TB (max) to 50 GB (min) each, along with approximately 150+ sequences. Could you please confirm if there are any successful case studies or benchmarks available for a similar setup? Additionally, please share any recommended parameter tuning or best practices for handling logical replication at this scale. Current server configuration: CPU: 144 cores RAM: 512 GB Thanks & Regards Krishna. On Fri, 24 Oct 2025 at 21:55, Francisco Olarte wrote: > > On Thu, 23 Oct 2025 at 17:21, Greg Sabino Mullane > wrote > > pg_dump is the most reliable, and the slowest. Keep in mind that only the >> actual data needs to move over (not the indexes, which get rebuilt after >> the data is loaded). You could also mix-n-match pg_logical and pg_dump if >> you have a few tables that are super large. Whether either approach fits in >> your 24 hour window is hard to say without you running some tests. >> > > Long time ago I had a similar problem and did a "running with scissors" > restore. This means: > > 1.- Prepare normal configuration, test, etc for the new version. > 2.- Prepare a restore configuration, with fsync=off, wallevel=minimal, > whatever option gives you any speed advantage. > > As the target was empty, if restore failed we could just clean and restart. > > 3.- Dump, boot with the restore configuration, restore, clean shutdown, > switch to production configuration, boot again and follow on. > > Time has passed and I lost my notes, but I remember the restore was much > faster than doing it with the normal production configuration. Given > current machine speeds, it maybe doable. > > > Francisco Olarte. > > --00000000000037edc40642d30bc6 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Thank you all for your suggestions,= =C2=A0

Thanks for your quick response and for shar= ing the details.
After reviewing the options, the logical replication ap= proach seems to be the most feasible one with minimal downtime.

Howe= ver, we currently have 7 streaming replication setups running from producti= on, with a total database size of around 15 TB. Out of this, there are abou= t 10 large tables ranging from 1 TB (max) to 50 GB (min) each, along with a= pproximately 150+ sequences.

Could you please confirm if there are a= ny successful case studies or benchmarks available for a similar setup?
= Additionally, please share any recommended parameter tuning or best practic= es for handling logical replication at this scale.

Current server co= nfiguration:

CPU: 144 cores

RAM: 512 GB


Th= anks & Regards
Krishna.


=
On Fri, 24 Oct 2025 at 21:55, Francisco Olarte <folarte@peoplecall.com> wrote:

On Thu, 23 O= ct 2025 at 17:21, Greg Sabino Mullane <htamfids@gmail.com> wrote

pg_dump is the = most reliable, and the slowest. Keep in mind that only the actual data need= s to move over (not the indexes, which get rebuilt after the data is loaded= ). You could also mix-n-match pg_logical and pg_dump if you have a few tabl= es that are super large. Whether either approach fits in your 24 hour windo= w is hard to say without you running some tests.

Long= time ago I had a similar problem and did a "running with scissors&quo= t; restore. This means:

1.- Prepare norma= l configuration, test, etc for the new version.
2.- Prepare a restore configuration, with fsync=3D= off, wallevel=3Dminimal, whatever option gives you any speed advantage.

As the target was empty, if restore failed w= e could just clean and restart.

3.- Dump,= boot with the restore configuration, restore, clean shutdown, switch to pr= oduction configuration, boot again and follow on.

Time has passed and I lost my notes, but I remember the restore wa= s much faster than doing it with the normal production configuration. Given= current machine speeds, it maybe doable.


Francisco Olarte.=

--00000000000037edc40642d30bc6--