MIME-Version: 1.0
References: <CAKeSjMhHx1cMSTPTP7szhyV5Mt21YuA98Q+hS9+W2nrj0U9N3Q@mail.gmail.com>
 <CAKkG4_=V2UN62_CjRohbpZ_uxP56ai465ohNp1bt-+r41wCxig@mail.gmail.com> <CAKeSjMh3YKDkcDORDL8cE4DbwaAuWrRz_aoS1GoS-XX=Ki5KvQ@mail.gmail.com>
In-Reply-To: <CAKeSjMh3YKDkcDORDL8cE4DbwaAuWrRz_aoS1GoS-XX=Ki5KvQ@mail.gmail.com>
From: =?UTF-8?Q?Torsten_F=C3=B6rtsch?= <tfoertsch123@gmail.com>
Date: Thu, 23 May 2024 14:15:47 +0200
Message-ID: <CAKkG4_nK1DYU=DTe+hb-TQnS5+05hJ1MXTf5QNKJSfsLrZ-7iA@mail.gmail.com>
Subject: Re: Backup failure Postgres
To: Jethish Jethish <jethish777@gmail.com>
Cc: pgsql-general@lists.postgresql.org
Content-Type: multipart/alternative; boundary="000000000000801f1e06191e03b8"
Archived-At: <https://www.postgresql.org/message-id/CAKkG4_nK1DYU%3DDTe%2Bhb-TQnS5%2B05hJ1MXTf5QNKJSfsLrZ-7iA%40mail.gmail.com>
Precedence: bulk

--000000000000801f1e06191e03b8
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Look, you have to compromise somewhere. Let me explain the problem. PG uses
MVCC. That means if you update or delete rows, rows are not actually
modified or added back to free space. They are just marked for later
removal. That actual removal is VACUUM's task. The reason for doing so is
that a concurrent transaction might still need to see the modified or
deleted row. Now vacuum comes along and wants to actually add things back
to free space. On the master that works fine because the master knows all
concurrent transactions and what they might still need. So, vacuum will
simply skip those rows.

However, that knowledge does not extend to the replica. The master does not
know which transactions are running on the replica. So a vacuum operation
on the master might remove something that's still needed on the replica.
Now, that modification made by vacuum also needs to be replayed on the
replica. The way that works is by adding all modifications including insert
or vacuum or any other change in sequential order to a log (write-ahead-log
or WAL). This log is then simply shipped to the replica and replayed.

It's not difficult to understand that these changes must be replayed in the
same sequential order. Otherwise you get chaos. Now imagine a vacuum
operation at the replica which removes stuff that is still needed by a
transaction running on the replica like your COPY. Now the replica has 2
choices:

- abort the transaction and prefer replaying WAL
- pause replaying WAL and wait for the long running transaction

The 1st case is obviously bad for the transaction. The 2nd choice is bad
for everybody else because WAL can be replayed only in the same order as it
is generated. So, nothing that happened after that vacuum can be replayed
which leads to stale data on the replica.

One way to mitigate this is hot_standby_feedback. That way the replica
tells the master from time to time which old rows it still needs to see.
The drawback of this is that your tables on the master might accumulate
garbage that would normally be removed by vacuum earlier. That can affect
query performance.

Then you have the option to pause WAL replay one or the other way.
max_standby_streaming_delay, disconnecting from the master or explicitly
pausing replay, all fall in that category.

The last option I know of would be to use logical replication. That comes
with other problems. DDL becomes a bit finicky. Initial setup can be
tricky. The process applying the changes can become a bottleneck.

If you are really time-critical and you just want the COPY job to be done
and neither lag nor bloat are acceptable, then maybe you create another
streaming replica, disconnect it from the master, run your COPY job and
destroy the replica. If 3TB is the database size, then that does not look
unsurmountable. Of course, you need the resources. In my environment I'd
estimate 3-4 hours.

If you want a simple solution, then try hot_standby_feedback.

On Thu, May 23, 2024 at 12:46=E2=80=AFPM Jethish Jethish <jethish777@gmail.=
com>
wrote:

> Hi Torsten,
>
> I have tried by increasing the max_standby_streaming_delay but I'm facing
> lag issues on the replica server.
>
> When i increase the max_standby_streaming_delay even if a query runs for =
2
> minutes I'm facing lag issues for 2 minutes.
>
> Please suggest here.
> Data size is 3TB
>
> On Thu, May 23, 2024, 3:53=E2=80=AFPM Torsten F=C3=B6rtsch <tfoertsch123@=
gmail.com>
> wrote:
>
>> As the error message says, your query was aborted due to it conflicting
>> with recovery. There are many ways to deal with that. You could enable
>> hot_standby_feedback on the replica. You could disconnect the replica fr=
om
>> the master for the time the COPY takes (reset primary_conninfo). You cou=
ld
>> increase max_standby_streaming_delay. Perhaps you could also wrap the CO=
PY
>> operation in pg_wal_replay_pause() / pg_wal_replay_resume().
>>
>> On Thu, May 23, 2024 at 11:59=E2=80=AFAM Jethish Jethish <jethish777@gma=
il.com>
>> wrote:
>>
>>> I'm frequently facing the below error while performing backup. Someone
>>> please tell how solve this issues.
>>>
>>>
>>> Failed : pg_dump: error: Dumping the contents of table "botsession"
>>> failed: PQgetResult() failed. pg_dump: error: Error message from server=
:
>>> ERROR: canceling statement due to conflict with recovery DETAIL: User q=
uery
>>> might have needed to see row versions that must be removed. pg_dump: er=
ror:
>>> The command was: COPY public.botsession (id, userid, data, iscompressed=
) TO
>>> stdout;
>>>
>>

--000000000000801f1e06191e03b8
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Look, you have=C2=A0to compromise somewhere. Let me explai=
n the problem. PG uses MVCC. That means if you update or delete rows, rows =
are not actually modified or added back to free space. They are just marked=
 for later removal. That actual removal is VACUUM&#39;s task. The reason fo=
r doing so is that a concurrent transaction might still need to see the mod=
ified or deleted row. Now vacuum comes along and wants to actually add thin=
gs back to free space. On the master that works fine because the master kno=
ws all concurrent transactions and what they might still need. So, vacuum w=
ill simply skip those rows.<div><br></div><div>However, that knowledge does=
 not extend to the replica. The master does not know which transactions are=
 running on the replica. So a vacuum operation on the master might remove s=
omething that&#39;s still needed on the replica. Now, that modification mad=
e by vacuum also needs to be replayed on the replica. The way that works is=
 by adding all modifications including insert or vacuum or any other change=
 in sequential order to a log (write-ahead-log or WAL). This log is then si=
mply shipped to the replica and replayed.</div><div><br></div><div>It&#39;s=
 not difficult to understand that these changes must be replayed in the sam=
e sequential order. Otherwise you get chaos. Now imagine a vacuum operation=
 at the replica which removes stuff that is still needed by a transaction r=
unning on the replica like your COPY. Now the replica has 2 choices:</div><=
div><br></div><div>- abort the transaction and prefer replaying WAL</div><d=
iv>- pause replaying WAL and wait for the long running transaction</div><di=
v><br></div><div>The 1st case is obviously bad for the transaction. The 2nd=
 choice is bad for everybody else because WAL can be replayed only in the s=
ame order as it is generated. So, nothing that happened after that vacuum c=
an be replayed which leads to stale data on the replica.</div><div><br></di=
v><div>One way to mitigate this is hot_standby_feedback. That way the repli=
ca tells the master from time to time which old rows it still needs to see.=
 The drawback of this is that your tables on the master might accumulate ga=
rbage that would normally be removed by vacuum earlier. That can affect que=
ry performance.</div><div><br></div><div>Then you have the option to pause =
WAL replay one or the other way. max_standby_streaming_delay, disconnecting=
 from the master or explicitly pausing replay, all fall in that category.</=
div><div><br></div><div>The last option I know of would be to use logical r=
eplication. That comes with other problems. DDL becomes a bit finicky. Init=
ial setup can be tricky. The process applying the changes can become a bott=
leneck.</div><div><br></div><div>If you are really time-critical and you ju=
st want the COPY job to be done and neither lag nor bloat are acceptable, t=
hen maybe you create another streaming replica, disconnect it from the mast=
er, run your COPY job and destroy the replica. If 3TB is the database size,=
 then that does not look unsurmountable. Of course, you need the=C2=A0resou=
rces. In my environment I&#39;d estimate 3-4 hours.</div><div><br></div><di=
v>If you want a simple solution, then try hot_standby_feedback.</div></div>=
<br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Thu=
, May 23, 2024 at 12:46=E2=80=AFPM Jethish Jethish &lt;<a href=3D"mailto:je=
thish777@gmail.com">jethish777@gmail.com</a>&gt; wrote:<br></div><blockquot=
e class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px s=
olid rgb(204,204,204);padding-left:1ex"><div dir=3D"auto">Hi Torsten,<div d=
ir=3D"auto"><br></div><div dir=3D"auto">I have tried by increasing the max_=
standby_streaming_delay but I&#39;m facing lag issues on the replica server=
.</div><div dir=3D"auto"><br></div><div dir=3D"auto">When i increase the ma=
x_standby_streaming_delay even if a query runs for 2 minutes I&#39;m facing=
 lag issues for 2 minutes.</div><div dir=3D"auto"><br></div><div dir=3D"aut=
o">Please suggest here.</div><div dir=3D"auto">Data size is 3TB</div></div>=
<br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Thu=
, May 23, 2024, 3:53=E2=80=AFPM Torsten F=C3=B6rtsch &lt;<a href=3D"mailto:=
tfoertsch123@gmail.com" target=3D"_blank">tfoertsch123@gmail.com</a>&gt; wr=
ote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px=
 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir=3D=
"ltr">As the error message says, your query was aborted due to it conflicti=
ng with recovery. There are many ways to deal with that. You could enable h=
ot_standby_feedback on the replica. You could disconnect the replica from t=
he master for the time the COPY takes (reset primary_conninfo). You could i=
ncrease max_standby_streaming_delay. Perhaps you could also wrap the COPY o=
peration in pg_wal_replay_pause() / pg_wal_replay_resume().</div><br><div c=
lass=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Thu, May 23, =
2024 at 11:59=E2=80=AFAM Jethish Jethish &lt;<a href=3D"mailto:jethish777@g=
mail.com" rel=3D"noreferrer" target=3D"_blank">jethish777@gmail.com</a>&gt;=
 wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px =
0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir=
=3D"auto">I&#39;m frequently facing the below error while performing backup=
. Someone please tell how solve this issues.<div dir=3D"auto"><br></div><di=
v dir=3D"auto"><br></div><div dir=3D"auto"><p style=3D"margin-top:0px;margi=
n-bottom:0px">Failed : pg_dump: error: Dumping the contents of table &quot;=
botsession&quot; failed: PQgetResult() failed. pg_dump: error: Error messag=
e from server: ERROR: canceling statement due to conflict with recovery DET=
AIL: User query might have needed to see row versions that must be removed.=
 pg_dump: error: The command was: COPY public.botsession (id, userid, data,=
 iscompressed) TO stdout;</p></div></div>
</blockquote></div>
</blockquote></div>
</blockquote></div>

--000000000000801f1e06191e03b8--