MIME-Version: 1.0
References: <CAAccyYKpNsQMD+S-A7a8YtDevFN0uRXkzg4tYWWBOFsv_jASNg@mail.gmail.com>
 <fd5c6708-3791-4339-83a2-e5fc389cd9cb@aklaver.com> <CAAccyYLYZmwQiNMoJcQgo5t+E24rDtu1ZeBUrER7ZTKNAcZesw@mail.gmail.com>
 <d3622bf1-6a62-4703-b14e-295f47b5e348@aklaver.com>
In-Reply-To: <d3622bf1-6a62-4703-b14e-295f47b5e348@aklaver.com>
From: px shi <spxlyy123@gmail.com>
Date: Tue, 12 Aug 2025 16:24:44 +0800
Message-ID: <CAAccyYJ-07SzCRAEkGJ2Qa8EAPCHQM4qcpB=OvD8P0zDbCJ0KQ@mail.gmail.com>
Subject: Re: Questions about the continuity of WAL archiving
To: Adrian Klaver <adrian.klaver@aklaver.com>
Cc: pgsql-general@lists.postgresql.org
Content-Type: multipart/alternative; boundary="00000000000071ecb1063c26c66e"
Archived-At: <https://www.postgresql.org/message-id/CAAccyYJ-07SzCRAEkGJ2Qa8EAPCHQM4qcpB%3DOvD8P0zDbCJ0KQ%40mail.gmail.com>
Precedence: bulk

--00000000000071ecb1063c26c66e
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

> 1) What is the current archiving setup on the primary and why is lagging?

 The archive command uses pgBackRest to archive to S3. Because it is
uploaded to S3, the archiving speed is slow, which has caused lagging.

2) Have you looked at archiving off the standby node while it is in standby
> per:

Yes, archiving on the standby node is disabled. Is it recommended to share
the WAL archive between the primary and standby nodes to avoid
interruptions in archiving?

Adrian Klaver <adrian.klaver@aklaver.com> =E4=BA=8E2025=E5=B9=B48=E6=9C=888=
=E6=97=A5=E5=91=A8=E4=BA=94 23:23=E5=86=99=E9=81=93=EF=BC=9A

> On 8/7/25 22:50, px shi wrote:
> > Thank you for your reply.
> > The archived files can be used for PITR (Point-In-Time Recovery),
> > allowing recovery to any point between WAL 80 and 100 on timeline 1.
> > Additionally, if there's a backup taken during timeline 1 and a
> > switchover to a new primary has occurred without taking a new full
> > backup yet, these WAL logs can still be used to recover to any point on
> > timeline 2.
>
> Alright I see.
>
> Two things:
>
> 1) What is the current archiving setup on the primary and why is lagging?
>
> 2) Have you looked at archiving off the standby node while it is in
> standby per:
>
>
> https://www.postgresql.org/docs/current/warm-standby.html#CONTINUOUS-ARCH=
IVING-IN-STANDBY
>
> >
> > Regards,
> > Pixian Shi
> >
> > Adrian Klaver <adrian.klaver@aklaver.com
> > <mailto:adrian.klaver@aklaver.com>> =E4=BA=8E2025=E5=B9=B48=E6=9C=888=
=E6=97=A5=E5=91=A8=E4=BA=94 12:25=E5=86=99=E9=81=93=EF=BC=9A
> >
> >     On 8/7/25 20:20, px shi wrote:
> >      > Hi,
> >      > There is a scenario: the current timeline of the PostgreSQL
> >     primary node
> >      > is 1, and the latest WAL file is 100. The standby node has also
> >     received
> >      > up to WAL file 100. However, the latest WAL file archived is onl=
y
> >     file
> >      > 80. If the primary node crashes at this point and the standby is
> >      > promoted to the new primary, archiving will resume from file 100
> on
> >      > timeline 2. As a result, WAL files from 81 to 100 on timeline 1
> >     will be
> >      > missing from the archive.
> >
> >     What are you planning to do with the archived files?
> >
> >     Also is not the case that once the primary crashes you are in a spl=
it
> >     brain case and can't really trust it's timeline anymore?
> >
> >
> >      > Is there a good solution to prevent this situation?
> >      >
> >      > Regards,
> >      > Pixian Shi
> >
> >
> >     --
> >     Adrian Klaver
> >     adrian.klaver@aklaver.com <mailto:adrian.klaver@aklaver.com>
> >
>
>
> --
> Adrian Klaver
> adrian.klaver@aklaver.com
>

--00000000000071ecb1063c26c66e
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><br><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-col=
or:rgb(204,204,204);padding-left:1ex">1) What is the current archiving setu=
p on the primary and why is lagging?</blockquote><div>=C2=A0The archive com=
mand uses pgBackRest to archive to S3. Because it is uploaded to S3, the ar=
chiving speed is slow, which has caused lagging.</div><div><br></div><block=
quote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-w=
idth:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding=
-left:1ex">2) Have you looked at archiving off the standby node while it is=
 in standby per:</blockquote><div>Yes, archiving on the standby node is dis=
abled. Is it recommended to share the WAL archive between the primary and s=
tandby nodes to avoid interruptions in archiving?<br></div></div><br><div c=
lass=3D"gmail_quote gmail_quote_container"><div dir=3D"ltr" class=3D"gmail_=
attr">Adrian Klaver &lt;<a href=3D"mailto:adrian.klaver@aklaver.com">adrian=
.klaver@aklaver.com</a>&gt; =E4=BA=8E2025=E5=B9=B48=E6=9C=888=E6=97=A5=E5=
=91=A8=E4=BA=94 23:23=E5=86=99=E9=81=93=EF=BC=9A<br></div><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;bo=
rder-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex">=
On 8/7/25 22:50, px shi wrote:<br>
&gt; Thank you for your reply.<br>
&gt; The archived files can be used for PITR (Point-In-Time Recovery), <br>
&gt; allowing recovery to any point between WAL 80 and 100 on timeline 1.<b=
r>
&gt; Additionally, if there&#39;s a backup taken during timeline 1 and a <b=
r>
&gt; switchover to a new primary has occurred without taking a new full <br=
>
&gt; backup yet, these WAL logs can still be used to recover to any point o=
n <br>
&gt; timeline 2.<br>
<br>
Alright I see.<br>
<br>
Two things:<br>
<br>
1) What is the current archiving setup on the primary and why is lagging?<b=
r>
<br>
2) Have you looked at archiving off the standby node while it is in <br>
standby per:<br>
<br>
<a href=3D"https://www.postgresql.org/docs/current/warm-standby.html#CONTIN=
UOUS-ARCHIVING-IN-STANDBY" rel=3D"noreferrer" target=3D"_blank">https://www=
.postgresql.org/docs/current/warm-standby.html#CONTINUOUS-ARCHIVING-IN-STAN=
DBY</a><br>
<br>
&gt; <br>
&gt; Regards,<br>
&gt; Pixian Shi<br>
&gt; <br>
&gt; Adrian Klaver &lt;<a href=3D"mailto:adrian.klaver@aklaver.com" target=
=3D"_blank">adrian.klaver@aklaver.com</a> <br>
&gt; &lt;mailto:<a href=3D"mailto:adrian.klaver@aklaver.com" target=3D"_bla=
nk">adrian.klaver@aklaver.com</a>&gt;&gt; =E4=BA=8E2025=E5=B9=B48=E6=9C=888=
=E6=97=A5=E5=91=A8=E4=BA=94 12:25=E5=86=99=E9=81=93=EF=BC=9A<br>
&gt; <br>
&gt;=C2=A0 =C2=A0 =C2=A0On 8/7/25 20:20, px shi wrote:<br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt; Hi,<br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt; There is a scenario: the current timeline of =
the PostgreSQL<br>
&gt;=C2=A0 =C2=A0 =C2=A0primary node<br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt; is 1, and the latest WAL file is 100. The sta=
ndby node has also<br>
&gt;=C2=A0 =C2=A0 =C2=A0received<br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt; up to WAL file 100. However, the latest WAL f=
ile archived is only<br>
&gt;=C2=A0 =C2=A0 =C2=A0file<br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt; 80. If the primary node crashes at this point=
 and the standby is<br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt; promoted to the new primary, archiving will r=
esume from file 100 on<br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt; timeline 2. As a result, WAL files from 81 to=
 100 on timeline 1<br>
&gt;=C2=A0 =C2=A0 =C2=A0will be<br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt; missing from the archive.<br>
&gt; <br>
&gt;=C2=A0 =C2=A0 =C2=A0What are you planning to do with the archived files=
?<br>
&gt; <br>
&gt;=C2=A0 =C2=A0 =C2=A0Also is not the case that once the primary crashes =
you are in a split<br>
&gt;=C2=A0 =C2=A0 =C2=A0brain case and can&#39;t really trust it&#39;s time=
line anymore?<br>
&gt; <br>
&gt; <br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt; Is there a good solution to prevent this situ=
ation?<br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt; Regards,<br>
&gt;=C2=A0 =C2=A0 =C2=A0 &gt; Pixian Shi<br>
&gt; <br>
&gt; <br>
&gt;=C2=A0 =C2=A0 =C2=A0-- <br>
&gt;=C2=A0 =C2=A0 =C2=A0Adrian Klaver<br>
&gt;=C2=A0 =C2=A0 =C2=A0<a href=3D"mailto:adrian.klaver@aklaver.com" target=
=3D"_blank">adrian.klaver@aklaver.com</a> &lt;mailto:<a href=3D"mailto:adri=
an.klaver@aklaver.com" target=3D"_blank">adrian.klaver@aklaver.com</a>&gt;<=
br>
&gt; <br>
<br>
<br>
-- <br>
Adrian Klaver<br>
<a href=3D"mailto:adrian.klaver@aklaver.com" target=3D"_blank">adrian.klave=
r@aklaver.com</a><br>
</blockquote></div>

--00000000000071ecb1063c26c66e--