MIME-Version: 1.0
References: <CAGbX52FxL9eR=jmS3ACgRC=tEm=xj_xMmFoOxO+wVA+oURW=kA@mail.gmail.com>
 <1628637f-419f-4f6a-9cb6-07af90cd0bc4@aklaver.com> <CAGbX52HTgb2jYYO5eVDu=xB5-bjAra+34=AUFg4Xk0NpCRjnyg@mail.gmail.com>
 <1c0273f5-a90a-48f6-b51f-fe15c16fa1c6@aklaver.com> <CAGbX52EMG+d-BR5SzDP+rQsrdxT32=CXD5XCT2YCmkTpr3jesw@mail.gmail.com>
In-Reply-To: <CAGbX52EMG+d-BR5SzDP+rQsrdxT32=CXD5XCT2YCmkTpr3jesw@mail.gmail.com>
From: Kashif Zeeshan <kashi.zeeshan@gmail.com>
Date: Fri, 7 Jun 2024 09:20:30 +0500
Message-ID: <CAAPsdhd4y2z7F0uHsw3ifEYGexu=Pyj7DFQQZ5FQtdoghsgLWA@mail.gmail.com>
Subject: Re: Questions on logical replication
To: Koen De Groote <kdg.dev@gmail.com>
Cc: Adrian Klaver <adrian.klaver@aklaver.com>, 
	PostgreSQL General <pgsql-general@lists.postgresql.org>
Content-Type: multipart/alternative; boundary="0000000000006311a2061a451f17"
Archived-At: <https://www.postgresql.org/message-id/CAAPsdhd4y2z7F0uHsw3ifEYGexu%3DPyj7DFQQZ5FQtdoghsgLWA%40mail.gmail.com>
Precedence: bulk

--0000000000006311a2061a451f17
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

On Fri, Jun 7, 2024 at 3:19=E2=80=AFAM Koen De Groote <kdg.dev@gmail.com> w=
rote:

> I'll give them a read, though it might take a few weekends
>
> Meanwhile, this seems to be what I'm looking for:
>
> From
> https://www.postgresql.org/docs/current/warm-standby.html#STREAMING-REPLI=
CATION-SLOTS
>
> " Replication slots provide an automated way to ensure that the primary
> does not remove WAL segments until they have been received by all standby=
s,
> and that the primary does not remove rows which could cause a recovery
> conflict
> <https://www.postgresql.org/docs/current/hot-standby.html#HOT-STANDBY-CON=
FLICT>
> even when the standby is disconnected."
>
> I'm reading that as: "if there is a replication slot, if the standby is
> disconnected, WAL is kept"
>
> And if we know WAL is kept in the "pg_wal" directory, that sounds like it
> could slowly but surely fill up disk space.
>

Hi

Yes that is a consideration with logical replication but the possible cast
out weight the benefit.
The kept WAL file size will only increase if the standby is offline.

Regards
Kashif Zeeshan
Bitnine Global

>
>
> But again, I'll give them a read. I've read all of logical replication
> already, and I feel like I didn't get my answer there.
>
> Thanks for the help
>
>
> Regards,
> Koen De Groote
>
> On Thu, Jun 6, 2024 at 12:19=E2=80=AFAM Adrian Klaver <adrian.klaver@akla=
ver.com>
> wrote:
>
>> On 6/5/24 14:54, Koen De Groote wrote:
>> >     https://www.postgresql.org/docs/current/wal-configuration.html
>> >     <https://www.postgresql.org/docs/current/wal-configuration.html>
>> >
>> >     "Checkpoints are points in the sequence of transactions at which i=
t
>> is
>> >     guaranteed that the heap and index data files have been updated wi=
th
>> >     all
>> >     information written before that checkpoint. At checkpoint time, al=
l
>> >     dirty data pages are flushed to disk and a special checkpoint
>> record is
>> >     written to the WAL file. (The change records were previously
>> flushed to
>> >     the WAL files.) In the event of a crash, the crash recovery
>> procedure
>> >     looks at the latest checkpoint record to determine the point in th=
e
>> WAL
>> >     (known as the redo record) from which it should start the REDO
>> >     operation. Any changes made to data files before that point are
>> >     guaranteed to be already on disk. Hence, after a checkpoint, WAL
>> >     segments preceding the one containing the redo record are no longe=
r
>> >     needed and can be recycled or removed. (When WAL archiving is bein=
g
>> >     done, the WAL segments must be archived before being recycled or
>> >     removed.)"
>> >
>> >
>> > And this is the same for logical replication and physical replication,
>> I
>> > take it.
>>
>> High level explanation, both physical and logical replication use the
>> WAL files as the starting point. When the recycling is done is dependent
>> on various factors. My suggestion would be to read through the below to
>> get a better idea of what is going. There is a lot to cover, but if you
>> really want to understand it you will need to go through it.
>>
>> Physical replication
>>
>> https://www.postgresql.org/docs/current/high-availability.html
>>
>> 27.2.5. Streaming Replication
>> 27.2.6. Replication Slots
>>
>> Logical replication
>>
>> https://www.postgresql.org/docs/current/logical-replication.html
>>
>> WAL
>>
>> https://www.postgresql.org/docs/current/wal.html
>>
>>
>>
>> >
>> > Thus, if a leader has a standby of the same version, and meanwhile
>> > logical replication is being done to a newer version, both those
>> > replications are taken into account, is that correct?
>>
>> Yes, see links above.
>>
>>
>> > And if it cannot sync them, due to connectivity loss for instance, the
>> > WAL records will not be removed, then?
>>
>> Depends on the type of replication being done. It is possible for
>> physical replication to have WAL records removed that are still needed
>> downstream.
>>
>> From
>>
>>
>> https://www.postgresql.org/docs/current/warm-standby.html#STREAMING-REPL=
ICATION
>>
>> "If you use streaming replication without file-based continuous
>> archiving, the server might recycle old WAL segments before the standby
>> has received them. If this occurs, the standby will need to be
>> reinitialized from a new base backup. You can avoid this by setting
>> wal_keep_size to a value large enough to ensure that WAL segments are
>> not recycled too early, or by configuring a replication slot for the
>> standby. If you set up a WAL archive that's accessible from the standby,
>> these solutions are not required, since the standby can always use the
>> archive to catch up provided it retains enough segments."
>>
>> This is why it is good idea to go through the links I posted above.
>>
>> >
>> > Regards,
>> > Koen De Groote
>> >
>>
>>
>> --
>> Adrian Klaver
>> adrian.klaver@aklaver.com
>>
>>

--0000000000006311a2061a451f17
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr"><br></div><br><div class=3D"gmail_quote">=
<div dir=3D"ltr" class=3D"gmail_attr">On Fri, Jun 7, 2024 at 3:19=E2=80=AFA=
M Koen De Groote &lt;<a href=3D"mailto:kdg.dev@gmail.com">kdg.dev@gmail.com=
</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:=
0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">=
<div dir=3D"ltr"><div>I&#39;ll give them a read, though it might take a few=
 weekends</div><div><br></div><div>Meanwhile, this seems to be what I&#39;m=
 looking for:</div><div><br></div><div>From <a href=3D"https://www.postgres=
ql.org/docs/current/warm-standby.html#STREAMING-REPLICATION-SLOTS" target=
=3D"_blank">https://www.postgresql.org/docs/current/warm-standby.html#STREA=
MING-REPLICATION-SLOTS</a></div><div><br></div><div>&quot;
Replication slots provide an automated way to ensure that the primary=20
does not remove WAL segments until they have been received by all=20
standbys, and that the primary does not remove rows which could cause a <a =
href=3D"https://www.postgresql.org/docs/current/hot-standby.html#HOT-STANDB=
Y-CONFLICT" title=3D"27.4.2.=C2=A0Handling Query Conflicts" target=3D"_blan=
k">recovery conflict</a> even when the standby is disconnected.&quot;</div>=
<div><br></div><div>I&#39;m reading that as: &quot;if there is a replicatio=
n slot, if the standby is disconnected, WAL is kept&quot;</div><div><br></d=
iv><div>And if we know WAL is kept in the &quot;pg_wal&quot; directory, tha=
t sounds like it could slowly but surely fill up disk space.</div></div></b=
lockquote><div><br></div><div>Hi</div><div><br></div><div>Yes that is a con=
sideration with logical replication but the possible cast out weight the be=
nefit.</div><div>The kept WAL file size will only increase if the standby i=
s offline.</div><div><br></div><div>Regards</div><div>Kashif Zeeshan</div><=
div>Bitnine Global=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"ma=
rgin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:=
1ex"><div dir=3D"ltr"><div><br></div><div><br></div><div>But again, I&#39;l=
l give them a read. I&#39;ve read all of logical replication already, and I=
 feel like I didn&#39;t get my answer there.</div><div><br></div><div>Thank=
s for the help<br></div><div><br></div><div><br></div><div>Regards,</div><d=
iv>Koen De Groote<br></div></div><br><div class=3D"gmail_quote"><div dir=3D=
"ltr" class=3D"gmail_attr">On Thu, Jun 6, 2024 at 12:19=E2=80=AFAM Adrian K=
laver &lt;<a href=3D"mailto:adrian.klaver@aklaver.com" target=3D"_blank">ad=
rian.klaver@aklaver.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_=
quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,=
204);padding-left:1ex">On 6/5/24 14:54, Koen De Groote wrote:<br>
&gt;=C2=A0 =C2=A0 =C2=A0<a href=3D"https://www.postgresql.org/docs/current/=
wal-configuration.html" rel=3D"noreferrer" target=3D"_blank">https://www.po=
stgresql.org/docs/current/wal-configuration.html</a><br>
&gt;=C2=A0 =C2=A0 =C2=A0&lt;<a href=3D"https://www.postgresql.org/docs/curr=
ent/wal-configuration.html" rel=3D"noreferrer" target=3D"_blank">https://ww=
w.postgresql.org/docs/current/wal-configuration.html</a>&gt;<br>
&gt; <br>
&gt;=C2=A0 =C2=A0 =C2=A0&quot;Checkpoints are points in the sequence of tra=
nsactions at which it is<br>
&gt;=C2=A0 =C2=A0 =C2=A0guaranteed that the heap and index data files have =
been updated with<br>
&gt;=C2=A0 =C2=A0 =C2=A0all<br>
&gt;=C2=A0 =C2=A0 =C2=A0information written before that checkpoint. At chec=
kpoint time, all<br>
&gt;=C2=A0 =C2=A0 =C2=A0dirty data pages are flushed to disk and a special =
checkpoint record is<br>
&gt;=C2=A0 =C2=A0 =C2=A0written to the WAL file. (The change records were p=
reviously flushed to<br>
&gt;=C2=A0 =C2=A0 =C2=A0the WAL files.) In the event of a crash, the crash =
recovery procedure<br>
&gt;=C2=A0 =C2=A0 =C2=A0looks at the latest checkpoint record to determine =
the point in the WAL<br>
&gt;=C2=A0 =C2=A0 =C2=A0(known as the redo record) from which it should sta=
rt the REDO<br>
&gt;=C2=A0 =C2=A0 =C2=A0operation. Any changes made to data files before th=
at point are<br>
&gt;=C2=A0 =C2=A0 =C2=A0guaranteed to be already on disk. Hence, after a ch=
eckpoint, WAL<br>
&gt;=C2=A0 =C2=A0 =C2=A0segments preceding the one containing the redo reco=
rd are no longer<br>
&gt;=C2=A0 =C2=A0 =C2=A0needed and can be recycled or removed. (When WAL ar=
chiving is being<br>
&gt;=C2=A0 =C2=A0 =C2=A0done, the WAL segments must be archived before bein=
g recycled or<br>
&gt;=C2=A0 =C2=A0 =C2=A0removed.)&quot;<br>
&gt; <br>
&gt; <br>
&gt; And this is the same for logical replication and physical replication,=
 I <br>
&gt; take it.<br>
<br>
High level explanation, both physical and logical replication use the <br>
WAL files as the starting point. When the recycling is done is dependent <b=
r>
on various factors. My suggestion would be to read through the below to <br=
>
get a better idea of what is going. There is a lot to cover, but if you <br=
>
really want to understand it you will need to go through it.<br>
<br>
Physical replication<br>
<br>
<a href=3D"https://www.postgresql.org/docs/current/high-availability.html" =
rel=3D"noreferrer" target=3D"_blank">https://www.postgresql.org/docs/curren=
t/high-availability.html</a><br>
<br>
27.2.5. Streaming Replication<br>
27.2.6. Replication Slots<br>
<br>
Logical replication<br>
<br>
<a href=3D"https://www.postgresql.org/docs/current/logical-replication.html=
" rel=3D"noreferrer" target=3D"_blank">https://www.postgresql.org/docs/curr=
ent/logical-replication.html</a><br>
<br>
WAL<br>
<br>
<a href=3D"https://www.postgresql.org/docs/current/wal.html" rel=3D"norefer=
rer" target=3D"_blank">https://www.postgresql.org/docs/current/wal.html</a>=
<br>
<br>
<br>
<br>
&gt; <br>
&gt; Thus, if a leader has a standby of the same version, and meanwhile <br=
>
&gt; logical replication is being done to a newer version, both those <br>
&gt; replications are taken into account, is that correct?<br>
<br>
Yes, see links above.<br>
<br>
<br>
&gt; And if it cannot sync them, due to connectivity loss for instance, the=
 <br>
&gt; WAL records will not be removed, then?<br>
<br>
Depends on the type of replication being done. It is possible for <br>
physical replication to have WAL records removed that are still needed <br>
downstream.<br>
<br>
From<br>
<br>
<a href=3D"https://www.postgresql.org/docs/current/warm-standby.html#STREAM=
ING-REPLICATION" rel=3D"noreferrer" target=3D"_blank">https://www.postgresq=
l.org/docs/current/warm-standby.html#STREAMING-REPLICATION</a><br>
<br>
&quot;If you use streaming replication without file-based continuous <br>
archiving, the server might recycle old WAL segments before the standby <br=
>
has received them. If this occurs, the standby will need to be <br>
reinitialized from a new base backup. You can avoid this by setting <br>
wal_keep_size to a value large enough to ensure that WAL segments are <br>
not recycled too early, or by configuring a replication slot for the <br>
standby. If you set up a WAL archive that&#39;s accessible from the standby=
, <br>
these solutions are not required, since the standby can always use the <br>
archive to catch up provided it retains enough segments.&quot;<br>
<br>
This is why it is good idea to go through the links I posted above.<br>
<br>
&gt; <br>
&gt; Regards,<br>
&gt; Koen De Groote<br>
&gt; <br>
<br>
<br>
-- <br>
Adrian Klaver<br>
<a href=3D"mailto:adrian.klaver@aklaver.com" target=3D"_blank">adrian.klave=
r@aklaver.com</a><br>
<br>
</blockquote></div>
</blockquote></div></div>

--0000000000006311a2061a451f17--