MIME-Version: 1.0
References: 
 <SEZPR01MB574850C01356D4ABD2F5BB0AEAF12@SEZPR01MB5748.apcprd01.prod.exchangelabs.com>
In-Reply-To: 
 <SEZPR01MB574850C01356D4ABD2F5BB0AEAF12@SEZPR01MB5748.apcprd01.prod.exchangelabs.com>
From: =?UTF-8?Q?Raphael_Salguero_Arag=C3=B3n?=
 <raphael.salguero@enterprisedb.com>
Date: Fri, 7 Feb 2025 08:21:50 +0100
Message-ID: 
 <CAA2=wKb-XE+t7DpksUNwoiFYN8pxjQBEbS3ab72cffs4cLAfMg@mail.gmail.com>
Subject: Re: Postgresql replication failed in Patroni
To: Mendbayar Alzakhgui <mendbayar.alz@unitel.mn>
Cc: "pgsql-admin@lists.postgresql.org" <pgsql-admin@lists.postgresql.org>
Content-Type: multipart/alternative; boundary="000000000000fd39b4062d8836d3"
Archived-At: 
 <https://www.postgresql.org/message-id/CAA2%3DwKb-XE%2Bt7DpksUNwoiFYN8pxjQBEbS3ab72cffs4cLAfMg%40mail.gmail.com>
Precedence: bulk

--000000000000fd39b4062d8836d3
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Hi Mendbayar,

Am Fr., 7. Feb. 2025 um 07:04 Uhr schrieb Mendbayar Alzakhgui <
mendbayar.alz@unitel.mn>:

> Hello everybody,
> I need a urgent help on my Patroni managed postgres cluster,
>
> the main patroni managed leader postgres crushed and down, when we try to
> start the Postgresql it=E2=80=99s showing us this error log
>
> 2025-02-07 12:31:18 +08 [2354332]: [4-1] user=3D,db=3D,app=3D,client=3DLO=
G:
> listening on IPv4 address "ip_address", port 5432
>
> 2025-02-07 12:31:18 +08 [2354332]: [5-1] user=3D,db=3D,app=3D,client=3DLO=
G:
> listening on Unix socket "./.s.PGSQL.5432"
>
> 2025-02-07 12:31:18 +08 [2354337]: [1-1] user=3D,db=3D,app=3D,client=3DLO=
G:
> database system was shut down in recovery at 2025-02-07 11:56:50 +08
>
> 2025-02-07 12:31:18 +08 [2354337]: [2-1] user=3D,db=3D,app=3D,client=3DLO=
G:
> entering standby mode
>
> 2025-02-07 12:31:18 +08 [2354337]: [3-1] user=3D,db=3D,app=3D,client=3DFA=
TAL:
> requested timeline 20 is not a child of this server's history
>
> 2025-02-07 12:31:18 +08 [2354337]: [4-1] user=3D,db=3D,app=3D,client=3DDE=
TAIL:
> Latest checkpoint is at 71/4D8BB8C0 on timeline 19, but in the history of
> the requested timeline, the server forked off from that timeline at
> 71/4D793220.
>
> 2025-02-07 12:31:18 +08 [2354332]: [6-1] user=3D,db=3D,app=3D,client=3DLO=
G:
> startup process (PID 2354337) exited with exit code 1
>
> 2025-02-07 12:31:18 +08 [2354332]: [7-1] user=3D,db=3D,app=3D,client=3DLO=
G:
> aborting startup due to startup process failure
>
> 2025-02-07 12:31:18 +08 [2354332]: [8-1] user=3D,db=3D,app=3D,client=3DLO=
G:
> database system is shut down
>
>
> what should we check?, and is this because the leader node already delete=
d
> the wal it=E2=80=99s needed to start? And we were connected debezium to t=
his node
> when we recover it will the debezium start automatically from the
> disconnected sessions? Please help me.
>
> You're right, the crashed DB is not able to recover due to a lag of
transactional information.
What is your DB size?

The easiest way is to stop Patroni on the crashed instance (systemctl stop
patroni), remove and recreate the data directory (also take care about
tablespace if they're in use).
Afterwards, you can restart the Patroni service on the crashed instance and
run a reinit from the current leader:

patronictl -c /etc/patroni.yml reinit your_cluster_name replica_node

That should do the trick :)


> Sincerely,
>
>
> * Mendbayar A. *| Database Administrator
>
> Information technology department
>
>
>
> +976 8611-2165
>
> mendbayar.alz@unitel.mn
>
> Central Tower, 11th floor
>
> www.unitel.mn
>
>
>
Best regards
 Raphael

--000000000000fd39b4062d8836d3
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr">Hi Mendbayar,</div><br><div class=3D"gmai=
l_quote gmail_quote_container"><div dir=3D"ltr" class=3D"gmail_attr">Am Fr.=
, 7. Feb. 2025 um 07:04=C2=A0Uhr schrieb Mendbayar Alzakhgui &lt;<a href=3D=
"mailto:mendbayar.alz@unitel.mn">mendbayar.alz@unitel.mn</a>&gt;:<br></div>=
<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left:1px solid rgb(204,204,204);padding-left:1ex"><div class=3D"msg-9069072=
858961304070">


<div lang=3D"EN-US">
<div class=3D"m_-9069072858961304070WordSection1">
<p class=3D"MsoNormal">Hello everybody, <br>
I need a urgent help on my Patroni managed postgres cluster,<br>
<br>
the main patroni managed leader postgres crushed and down, when we try to s=
tart the Postgresql it=E2=80=99s showing us this error log<br>
<br>
2025-02-07 12:31:18 +08 [2354332]: [4-1] user=3D,db=3D,app=3D,client=3DLOG:=
=C2=A0 listening on IPv4 address &quot;ip_address&quot;, port 5432<u></u><u=
></u></p>
<p class=3D"MsoNormal">2025-02-07 12:31:18 +08 [2354332]: [5-1] user=3D,db=
=3D,app=3D,client=3DLOG:=C2=A0 listening on Unix socket &quot;./.s.PGSQL.54=
32&quot;<u></u><u></u></p>
<p class=3D"MsoNormal">2025-02-07 12:31:18 +08 [2354337]: [1-1] user=3D,db=
=3D,app=3D,client=3DLOG:=C2=A0 database system was shut down in recovery at=
 2025-02-07 11:56:50 +08<u></u><u></u></p>
<p class=3D"MsoNormal">2025-02-07 12:31:18 +08 [2354337]: [2-1] user=3D,db=
=3D,app=3D,client=3DLOG:=C2=A0 entering standby mode<u></u><u></u></p>
<p class=3D"MsoNormal">2025-02-07 12:31:18 +08 [2354337]: [3-1] user=3D,db=
=3D,app=3D,client=3DFATAL:=C2=A0 requested timeline 20 is not a child of th=
is server&#39;s history<u></u><u></u></p>
<p class=3D"MsoNormal">2025-02-07 12:31:18 +08 [2354337]: [4-1] user=3D,db=
=3D,app=3D,client=3DDETAIL:=C2=A0 Latest checkpoint is at 71/4D8BB8C0 on ti=
meline 19, but in the history of the requested timeline, the server forked =
off from that timeline at 71/4D793220.<u></u><u></u></p>
<p class=3D"MsoNormal">2025-02-07 12:31:18 +08 [2354332]: [6-1] user=3D,db=
=3D,app=3D,client=3DLOG:=C2=A0 startup process (PID 2354337) exited with ex=
it code 1<u></u><u></u></p>
<p class=3D"MsoNormal">2025-02-07 12:31:18 +08 [2354332]: [7-1] user=3D,db=
=3D,app=3D,client=3DLOG:=C2=A0 aborting startup due to startup process fail=
ure<u></u><u></u></p>
<p class=3D"MsoNormal">2025-02-07 12:31:18 +08 [2354332]: [8-1] user=3D,db=
=3D,app=3D,client=3DLOG:=C2=A0 database system is shut down<u></u><u></u></=
p>
<p class=3D"MsoNormal"><br>
what should we check?, and is this because the leader node already deleted =
the wal it=E2=80=99s needed to start? And we were connected debezium to thi=
s node when we recover it will the debezium start automatically from the di=
sconnected sessions? Please help me.<br>
<br></p></div></div></div></blockquote><div>You&#39;re right, the crashed D=
B is not able to recover due to a lag of transactional information.</div><d=
iv>What is your DB size?</div><div><br></div><div>The easiest way is to sto=
p Patroni on the crashed instance (systemctl stop patroni), remove and recr=
eate the data directory (also take care about tablespace if they&#39;re in =
use).</div><div>Afterwards, you can restart the Patroni service on the cras=
hed instance and run a reinit from the current leader:</div><div><br></div>=
<div>patronictl -c /etc/patroni.yml reinit your_cluster_name replica_node<b=
r></div><div><br></div><div>That should do the trick :)</div><div>=C2=A0</d=
iv><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;bord=
er-left:1px solid rgb(204,204,204);padding-left:1ex"><div class=3D"msg-9069=
072858961304070"><div lang=3D"EN-US"><div class=3D"m_-9069072858961304070Wo=
rdSection1"><p class=3D"MsoNormal">
<u></u><u></u></p>
<p class=3D"MsoNormal"><span style=3D"font-size:8pt;font-family:Arial,sans-=
serif;color:rgb(89,89,89)">Sincerely,</span><u></u><u></u></p>
<p class=3D"MsoNormal"><b><span style=3D"font-size:10pt;font-family:Arial,s=
ans-serif;color:black"><br>
Mendbayar A. </span></b><span style=3D"font-size:8pt;font-family:Arial,sans=
-serif;color:black">| Database Administrator</span><span style=3D"font-fami=
ly:Arial,sans-serif;color:rgb(31,73,125)"><u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:8pt;font-family:Arial,sans-=
serif;color:black">Information technology
</span><span lang=3D"MN" style=3D"font-size:8pt;font-family:Arial,sans-seri=
f;color:black">department</span><span style=3D"font-family:Arial,sans-serif=
;color:rgb(31,73,125)"><u></u><u></u></span></p>
<p class=3D"MsoNormal" style=3D"margin-left:0.5in"><span style=3D"font-size=
:8pt;font-family:Arial,sans-serif;color:rgb(32,31,30)">=C2=A0</span><span s=
tyle=3D"font-family:Arial,sans-serif;color:rgb(31,73,125)"><u></u><u></u></=
span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:8pt;font-family:Arial,sans-=
serif;color:black">+976 8611-2165</span><span style=3D"font-family:Arial,sa=
ns-serif;color:rgb(31,73,125)"><u></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:8pt;font-family:Arial,sans-=
serif;color:rgb(31,73,125)"><a href=3D"mailto:mendbayar.alz@unitel.mn" targ=
et=3D"_blank"><span style=3D"color:rgb(5,99,193)">mendbayar.alz@unitel.mn</=
span></a>
</span><span style=3D"font-family:Arial,sans-serif;color:rgb(31,73,125)"><u=
></u><u></u></span></p>
<p class=3D"MsoNormal"><span style=3D"font-size:8pt;font-family:Arial,sans-=
serif;color:rgb(31,73,125)">Central Tower, 11th floor</span><span style=3D"=
font-family:Arial,sans-serif;color:rgb(31,73,125)"><u></u><u></u></span></p=
>
<p class=3D"MsoNormal"><span style=3D"color:rgb(31,73,125)"><a href=3D"http=
://www.unitel.mn/" target=3D"_blank"><span style=3D"font-size:8pt;font-fami=
ly:Arial,sans-serif;color:rgb(5,99,193)">www.unitel.mn</span></a></span><sp=
an style=3D"font-size:8pt;font-family:Arial,sans-serif;color:rgb(31,73,125)=
">
</span><span style=3D"font-family:Arial,sans-serif;color:rgb(31,73,125)"><u=
></u><u></u></span></p>
<p class=3D"MsoNormal"><u></u>=C2=A0</p></div></div></div></blockquote><div=
>Best regards</div><div>=C2=A0Raphael=C2=A0</div></div></div>

--000000000000fd39b4062d8836d3--