Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1urGo4-003Lgx-D3 for pgsql-general@arkaria.postgresql.org; Wed, 27 Aug 2025 14:07:21 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.94.2) (envelope-from ) id 1urGo2-00FW7Z-Hy for pgsql-general@arkaria.postgresql.org; Wed, 27 Aug 2025 14:07:19 +0000 Received: from makus.postgresql.org ([2001:4800:3e1:1::229]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1urGo2-00FW6w-72 for pgsql-general@lists.postgresql.org; Wed, 27 Aug 2025 14:07:18 +0000 Received: from mail-il1-x129.google.com ([2607:f8b0:4864:20::129]) by makus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.96) (envelope-from ) id 1urGo0-0021fP-0b for pgsql-general@lists.postgresql.org; Wed, 27 Aug 2025 14:07:17 +0000 Received: by mail-il1-x129.google.com with SMTP id e9e14a558f8ab-3ea8b3a6426so54095525ab.2 for ; Wed, 27 Aug 2025 07:07:16 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1756303636; x=1756908436; darn=lists.postgresql.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=00HEwfH73D4etKL+4hNH5okfx6IX49vkdL5sOG+lmf4=; b=MmgkhFpxa69bIA2cApTLV3l3izBtbxq1+ZM2KjkzDDdg+fOE79qeycs82y/8xO4El5 45FYPjaanxb53EAj0G61YWnCDoUI+zqHLNdiI55KxeBLW7RPx2c/FOdmI5RfvoWBxq3T Q+xnu+QhF/i2H3wxuRs1wPST7zTbuQv1ZrwMXQo19gghxn4rrwG6o/kI6r4xGEw2j4Lu jM8wUsytw6SPDyNQVN3tA6cPe+kaAhBmZ1Sf/nMsnU1N3kZmAdPQpW0gpVDN868Im6F2 5hygUSviyPHtEg4yY4xGWa1BAcHsmyyIQDrXE5vy1U9wbN2Lnt84AMTK1/pQnvqHezb5 3gZQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1756303636; x=1756908436; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=00HEwfH73D4etKL+4hNH5okfx6IX49vkdL5sOG+lmf4=; b=EvDKu6Ve/LO5g2wua9qj2i0giMNX8oinhdvjpuVMPg+siP19EFU+RC/R6ceih84TuB ATt3ljhsDRRivO/a2wdNXQWf2zdrQwQFWaGqjWIznrf9eha6nbyZ1+v6zIVtNxhGh2bi MRVLqvB7UPRoTEE/8zmdXS/l1QFWFC526P+Eq3pBLRcolaHCmaET6SEiDQZfQZgH1lx3 nP/wt/f7TkuOgvq7F8V1SnTHbKgQm2edoQXnZJ5PznxBN2idAEjITbHkAlyN4xt2hbt6 HfR+Sv99WILvjxrw5IzO9STdWUSadpz8g+JQKeQ16BNvg5v52vQSHQ2dxnst5MSzBoz+ iNlA== X-Gm-Message-State: AOJu0YxSnoWcUwTf76x0tEbnVGZwvzgl7i3p2wZxBJLGbHHkOebE0Eim DPGzR3i5rOUef+NAbstkQLELFaZkiBFa2UAs0BjT/whh3TpQ/Sx4YPWgXC9u+xnbarxtZCuuzSF kYOYO1SUZDnadPQPyVpyBS5meo5OThu0= X-Gm-Gg: ASbGncsLGOmqPWQCYRk8DBHGeb3fq4TsKZMXN92jZFEl36oSoImmJll6xYYJ62Khfli 5ZrV9JVrhmfarv12Rr4qWSq7h9bJpWsqruUVg0rtHs1yiZ2EVsVa+E+s9NMS38LfekPQOrh0e/d hV4sWieuMTuFXoQw3Nf8VFlltyscjyjcFGRoaFZn6DjlTle5jLpYRSO2+C4hrST/g5DU4bxbHda V9YEbC+dnkrHxJtun19FtJY6d6VxfhDWp+MUvPpQKd0sLsz+sNGFtrP0Rtst6Vp+zLt2a3FG68t ninU+0YfIxwceuXrMyNk/JkgQYHU+MJjG/zyzQ== X-Google-Smtp-Source: AGHT+IHKcYuGZnfSGCYMQMTDZjH4TYUuDw1zgh6x6ZzjyhK+3666dU89ua/aihL+7+luAJ9GMMa//8fUZU18cnETvzg= X-Received: by 2002:a92:ca4d:0:b0:3ee:ebf7:3948 with SMTP id e9e14a558f8ab-3eeebf73a55mr80184395ab.11.1756303636165; Wed, 27 Aug 2025 07:07:16 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Greg Sabino Mullane Date: Wed, 27 Aug 2025 10:06:40 -0400 X-Gm-Features: Ac12FXwX16o90yuskFGtUcgTSy8fM-xHSsVqSA1oKdw54CLzNl4OrGi8h4IZ67w Message-ID: Subject: Re: In-order pg_dump (or in-order COPY TO) To: Dimitrios Apostolou Cc: pgsql-general@lists.postgresql.org Content-Type: multipart/alternative; boundary="00000000000059f719063d594e33" List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk --00000000000059f719063d594e33 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Tue, Aug 26, 2025 at 3:44=E2=80=AFPM Dimitrios Apostolou = wrote: > I am storing dumps of a database (pg_dump custom format) in a > de-duplicating backup server. Each dump is many terabytes in size, so > deduplication is very important. And de-duplication itself is based on > rolling checksums which is pretty flexible, it can compensate for blocks > moving by some offset. > I suggest looking into pgBackRest, and it's block incremental feature, which sounds similar to what you are doing. But it also does it with parallel processes, and can do things like grab backup files from your replicas, plus a lot of other features. Cheers, Greg -- Crunchy Data - https://www.crunchydata.com Enterprise Postgres Software Products & Tech Support --00000000000059f719063d594e33 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
On Tue, Aug 26, 2025 at 3:44=E2=80=AFPM D= imitrios Apostolou <jimis@gmx.net&g= t; wrote:
I am storing dumps of a database (pg= _dump custom format) in a
de-duplicating backup server. Each dump is many terabytes in size, so
deduplication is very important. And de-duplication itself is based on
rolling checksums which is pretty flexible, it can compensate for blocks moving by some offset.

I suggest lookin= g into pgBackRest, and it's block incremental feature, which sounds=C2= =A0similar to what you are doing. But it also does it with parallel process= es, and can do things like grab backup files from your replicas, plus a lot= of other features.
=C2=A0
Cheers,
Greg

=
--
Enterprise = Postgres Software Products & Tech Support

--00000000000059f719063d594e33--