MIME-Version: 1.0
From: Adam Blomeke <adam.blomeke@gmail.com>
Date: Fri, 12 Dec 2025 16:25:59 -0500
Message-ID: 
 <CAG9Amsj61Oy7YGJw40uv7fedZwabLu2b5dVmJK4aDrZLDMVj+w@mail.gmail.com>
Subject: Pgpool can't detect database status properly
To: pgpool-general@lists.postgresql.org
Content-Type: multipart/alternative; boundary="000000000000fbed0f0645c7e8cb"
Archived-At: 
 <https://www.postgresql.org/message-id/CAG9Amsj61Oy7YGJw40uv7fedZwabLu2b5dVmJK4aDrZLDMVj%2Bw%40mail.gmail.com>
Precedence: bulk

--000000000000fbed0f0645c7e8cb
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

I'm resending this as it's been sitting in the moderation queue for a
while. Possibly because I didn't have a subject line? Anyways, any help
would be great. Thanks!

I=E2=80=99m setting up a pgpool cluster to replace a single node database i=
n my
environment. The single node is separate from the cluster at the moment.
When it=E2=80=99s time to implement the DB I=E2=80=99m going to redo the ba=
ckup/restore,
throw an upgrade from pg15->18, and then bring the cluster and take over
the old IP.


*Environment:*

   - pgpool-II version: 4.6.3 (chirikoboshi)
   - PostgreSQL version: 18
   - OS: RHEL9
   - Cluster topology: 3 pgpool nodes (10.6.1.196, 10.6.1.197, 10.6.1.198)
   + 2 PostgreSQL nodes (10.6.1.199 primary, 10.6.1.200 standby)


*Issue:*

I have pgpool configured and I=E2=80=99ve set it up using the scripts and c=
onfig
files from a different instance, one which has been running just fine for a
year and a half or so. The issue I=E2=80=99m experiencing is that when I
detach/reattach a node, it sits in waiting constantly. It never transitions
to up. I have to manually change the status file to up for it to get to
agree that it is, and when I try to drop the node it doesn't actually drop
it. It just goes into waiting again. I also don=E2=80=99t see any connectio=
n
attempts from the pgpool server to the postgres nodes if I look at postgres
logs. I've confirmed that it can run the postgres commands from the command
line. I've tried this both running pgpool as a service and running it
directly from the command line. No difference in behavior.


Here=E2=80=99s the log output:

2025-12-03 14:20:49.037: main pid 1085028: LOG:  =3D=3D=3D Starting fail ba=
ck.
reconnect host 10.6.1.200(5432) =3D=3D=3D

2025-12-03 14:20:49.037: main pid 1085028: LOCATION:  pgpool_main.c:4169

2025-12-03 14:20:49.037: main pid 1085028: LOG:  Node 0 is not down
(status: 2)

2025-12-03 14:20:49.037: main pid 1085028: LOCATION:  pgpool_main.c:1524

2025-12-03 14:20:49.038: main pid 1085028: LOG:  Do not restart children
because we are failing back node id 1 host: 10.6.1.200 port: 5432 and we
are in streaming replication mode and not all backends were down

2025-12-03 14:20:49.038: main pid 1085028: LOCATION:  pgpool_main.c:4370

2025-12-03 14:20:49.038: main pid 1085028: LOG:
find_primary_node_repeatedly: waiting for finding a primary node

2025-12-03 14:20:49.038: main pid 1085028: LOCATION:  pgpool_main.c:2896

2025-12-03 14:20:49.189: main pid 1085028: LOG:  find_primary_node: primary
node is 0

2025-12-03 14:20:49.189: main pid 1085028: LOCATION:  pgpool_main.c:2815

2025-12-03 14:20:49.189: main pid 1085028: LOG:  find_primary_node: standby
node is 1

2025-12-03 14:20:49.189: main pid 1085028: LOCATION:  pgpool_main.c:2821

2025-12-03 14:20:49.189: main pid 1085028: LOG:  failover: set new primary
node: 0

2025-12-03 14:20:49.189: main pid 1085028: LOCATION:  pgpool_main.c:4660

2025-12-03 14:20:49.189: main pid 1085028: LOG:  failover: set new main
node: 0

2025-12-03 14:20:49.189: main pid 1085028: LOCATION:  pgpool_main.c:4667

2025-12-03 14:20:49.189: main pid 1085028: LOG:  =3D=3D=3D Failback done.
reconnect host 10.6.1.200(5432) =3D=3D=3D

2025-12-03 14:20:49.189: main pid 1085028: LOCATION:  pgpool_main.c:4763

2025-12-03 14:20:49.189: sr_check_worker pid 1085088: LOG:  worker process
received restart request

2025-12-03 14:20:49.189: sr_check_worker pid 1085088: LOCATION:
pool_worker_child.c:182

2025-12-03 14:20:50.189: pcp_main pid 1085087: LOG:  restart request
received in pcp child process

2025-12-03 14:20:50.189: pcp_main pid 1085087: LOCATION:  pcp_child.c:173

2025-12-03 14:20:50.193: main pid 1085028: LOG:  PCP child 1085087 exits
with status 0 in failover()

2025-12-03 14:20:50.193: main pid 1085028: LOCATION:  pgpool_main.c:4850

2025-12-03 14:20:50.193: main pid 1085028: LOG:  fork a new PCP child pid
1085089 in failover()

2025-12-03 14:20:50.193: main pid 1085028: LOCATION:  pgpool_main.c:4854

2025-12-03 14:20:50.193: pcp_main pid 1085089: LOG:  PCP process: 1085089
started

2025-12-03 14:20:50.193: pcp_main pid 1085089: LOCATION:  pcp_child.c:165

2025-12-03 14:20:50.194: sr_check_worker pid 1085090: LOG:  process started

2025-12-03 14:20:50.194: sr_check_worker pid 1085090: LOCATION:
pgpool_main.c:905

2025-12-03 14:22:31.460: pcp_main pid 1085089: LOG:  forked new pcp worker,
pid=3D1085093 socket=3D7

2025-12-03 14:22:31.460: pcp_main pid 1085089: LOCATION:  pcp_child.c:327

2025-12-03 14:22:31.721: pcp_main pid 1085089: LOG:  PCP process with pid:
1085093 exit with SUCCESS.

2025-12-03 14:22:31.721: pcp_main pid 1085089: LOCATION:  pcp_child.c:384

2025-12-03 14:22:31.721: pcp_main pid 1085089: LOG:  PCP process with pid:
1085093 exits with status 0

2025-12-03 14:22:31.721: pcp_main pid 1085089: LOCATION:  pcp_child.c:398

2025-12-03 14:25:39.480: child pid 1085050: LOG:  failover or failback
event detected

2025-12-03 14:25:39.480: child pid 1085050: DETAIL:  restarting myself

2025-12-03 14:25:39.480: child pid 1085050: LOCATION:  child.c:1524

2025-12-03 14:25:39.480: child pid 1085038: LOG:  failover or failback
event detected

2025-12-03 14:25:39.481: child pid 1085038: DETAIL:  restarting myself

2025-12-03 14:25:39.481: child pid 1085038: LOCATION:  child.c:1524

2025-12-03 14:25:39.481: child pid 1085035: LOG:  failover or failback
event detected

2025-12-03 14:25:39.481: child pid 1085035: DETAIL:  restarting myself

2025-12-03 14:25:39.481: child pid 1085035: LOCATION:  child.c:1524

2025-12-03 14:25:39.481: child pid 1085061: LOG:  failover or failback
event detected

2025-12-03 14:25:39.481: child pid 1085061: DETAIL:  restarting myself

2025-12-03 14:25:39.481: child pid 1085061: LOCATION:  child.c:1524

2025-12-03 14:25:39.483: child pid 1085053: LOG:  failover or failback
event detected

2025-12-03 14:25:39.483: child pid 1085053: DETAIL:  restarting myself

2025-12-03 14:25:39.483: child pid 1085053: LOCATION:  child.c:1524

2025-12-03 14:25:39.483: child pid 1085059: LOG:  failover or failback
event detected

......over and over and over again.


pcp_node_info output:

10.6.1.199 5432 1 0.500000 waiting up primary primary 0 none none
2025-12-03 14:04:39

10.6.1.200 5432 1 0.500000 waiting up standby standby 0 streaming async
2025-12-03 14:04:39

Logs show:

node status[0]: 1

node status[1]: 2

Node 0 (primary) gets status 1 (waiting), node 1 (standby) gets status 2
(up).

*auto_failback behavior:*

   - When a node is detached (pcp_detach_node), it goes to status 3 (down)
   - auto_failback triggers and moves it to status 1 (waiting)
   - Node never transitions from waiting to up

*Key configuration:*

backend_clustering_mode =3D 'streaming_replication'

backend_hostname0 =3D '10.6.1.199'

backend_hostname1 =3D '10.6.1.200'

backend_application_name0 =3D 'nasdw_users_1'

backend_application_name1 =3D 'nasdw_users_2'


use_watchdog =3D on

# 3 watchdog nodes configured


auto_failback =3D on

auto_failback_interval =3D 1


sr_check_period =3D 10

sr_check_user =3D 'pgpool'

sr_check_database =3D 'nasdw_users'


health_check_period =3D 1

health_check_user =3D 'pgpool'

health_check_database =3D 'nasdw_users'


failover_when_quorum_exists =3D on (default)

failover_require_consensus =3D on (default)
Cheers,
Adam

--000000000000fbed0f0645c7e8cb
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div><p class=3D"MsoNormal" style=3D"margin:0in;font-size:=
11pt;font-family:Aptos,sans-serif">I&#39;m resending this as it&#39;s been =
sitting in the moderation queue for a while. Possibly because I didn&#39;t =
have a subject line? Anyways, any help would be great. Thanks!<br><br>I=E2=
=80=99m setting up a pgpool cluster to replace a single node database in my=
 environment. The single node is separate from the cluster at the moment. W=
hen it=E2=80=99s time to implement the DB I=E2=80=99m going to redo the bac=
kup/restore, throw an upgrade from pg15-&gt;18, and then bring the cluster =
and take over the old IP.</p><p class=3D"MsoNormal" style=3D"margin:0in;fon=
t-size:11pt;font-family:Aptos,sans-serif"><b>=C2=A0</b></p><p class=3D"MsoN=
ormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif"><b>=
Environment:</b></p><ul type=3D"disc" style=3D"margin-top:0in;margin-bottom=
:0in"><li class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-famil=
y:Aptos,sans-serif">pgpool-II version: 4.6.3 (chirikoboshi)</li><li class=
=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-se=
rif">PostgreSQL version: 18</li><li class=3D"MsoNormal" style=3D"margin:0in=
;font-size:11pt;font-family:Aptos,sans-serif">OS: RHEL9</li><li class=3D"Ms=
oNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">C=
luster topology: 3 pgpool nodes (10.6.1.196, 10.6.1.197, 10.6.1.198) + 2 Po=
stgreSQL nodes (10.6.1.199 primary, 10.6.1.200 standby)</li></ul><p class=
=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-se=
rif"><b>=C2=A0</b></p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:=
11pt;font-family:Aptos,sans-serif"><b>Issue:</b></p><p class=3D"MsoNormal" =
style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">I have pgp=
ool configured and I=E2=80=99ve set it up using the scripts and config file=
s from a different instance, one which has been running just fine for a yea=
r and a half or so. The issue I=E2=80=99m experiencing is that when I detac=
h/reattach a node, it sits in waiting constantly. It never transitions to u=
p. I have to manually change the status file to up for it to get to agree t=
hat it is, and when I try to drop the node it doesn&#39;t actually drop it.=
 It just goes into waiting again. I also don=E2=80=99t see any connection a=
ttempts from the pgpool server to the postgres nodes if I look at=C2=A0post=
gres logs. I&#39;ve confirmed that it can run the postgres commands from th=
e command line. I&#39;ve tried this both running pgpool as a service and ru=
nning it directly from the command line. No difference in behavior.</p><p c=
lass=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,san=
s-serif">=C2=A0</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11p=
t;font-family:Aptos,sans-serif">Here=E2=80=99s the log output:<br><br>2025-=
12-03 14:20:49.037: main pid 1085028: LOG:=C2=A0 =3D=3D=3D Starting fail ba=
ck. reconnect host 10.6.1.200(5432) =3D=3D=3D</p><p class=3D"MsoNormal" sty=
le=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14=
:20:49.037: main pid 1085028: LOCATION:=C2=A0 pgpool_main.c:4169</p><p clas=
s=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-s=
erif">2025-12-03 14:20:49.037: main pid 1085028: LOG:=C2=A0 Node 0 is not d=
own (status: 2)</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11p=
t;font-family:Aptos,sans-serif">2025-12-03 14:20:49.037: main pid 1085028: =
LOCATION:=C2=A0 pgpool_main.c:1524</p><p class=3D"MsoNormal" style=3D"margi=
n:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:20:49.038:=
 main pid 1085028: LOG:=C2=A0 Do not restart children because we are failin=
g back node id 1 host: 10.6.1.200 port: 5432 and we are in streaming replic=
ation mode and not all backends were down</p><p class=3D"MsoNormal" style=
=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:2=
0:49.038: main pid 1085028: LOCATION:=C2=A0 pgpool_main.c:4370</p><p class=
=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-se=
rif">2025-12-03 14:20:49.038: main pid 1085028: LOG:=C2=A0 find_primary_nod=
e_repeatedly: waiting for finding a primary node</p><p class=3D"MsoNormal" =
style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03=
 14:20:49.038: main pid 1085028: LOCATION:=C2=A0 pgpool_main.c:2896</p><p c=
lass=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,san=
s-serif">2025-12-03 14:20:49.189: main pid 1085028: LOG:=C2=A0 find_primary=
_node: primary node is 0</p><p class=3D"MsoNormal" style=3D"margin:0in;font=
-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:20:49.189: main pid =
1085028: LOCATION:=C2=A0 pgpool_main.c:2815</p><p class=3D"MsoNormal" style=
=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:2=
0:49.189: main pid 1085028: LOG:=C2=A0 find_primary_node: standby node is 1=
</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:A=
ptos,sans-serif">2025-12-03 14:20:49.189: main pid 1085028: LOCATION:=C2=A0=
 pgpool_main.c:2821</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size=
:11pt;font-family:Aptos,sans-serif">2025-12-03 14:20:49.189: main pid 10850=
28: LOG:=C2=A0 failover: set new primary node: 0</p><p class=3D"MsoNormal" =
style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03=
 14:20:49.189: main pid 1085028: LOCATION:=C2=A0 pgpool_main.c:4660</p><p c=
lass=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,san=
s-serif">2025-12-03 14:20:49.189: main pid 1085028: LOG:=C2=A0 failover: se=
t new main node: 0</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:=
11pt;font-family:Aptos,sans-serif">2025-12-03 14:20:49.189: main pid 108502=
8: LOCATION:=C2=A0 pgpool_main.c:4667</p><p class=3D"MsoNormal" style=3D"ma=
rgin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:20:49.1=
89: main pid 1085028: LOG:=C2=A0 =3D=3D=3D Failback done. reconnect host 10=
.6.1.200(5432) =3D=3D=3D</p><p class=3D"MsoNormal" style=3D"margin:0in;font=
-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:20:49.189: main pid =
1085028: LOCATION:=C2=A0 pgpool_main.c:4763</p><p class=3D"MsoNormal" style=
=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:2=
0:49.189: sr_check_worker pid 1085088: LOG:=C2=A0 worker process received r=
estart request</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt=
;font-family:Aptos,sans-serif">2025-12-03 14:20:49.189: sr_check_worker pid=
 1085088: LOCATION:=C2=A0 pool_worker_child.c:182</p><p class=3D"MsoNormal"=
 style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-0=
3 14:20:50.189: pcp_main pid 1085087: LOG:=C2=A0 restart request received i=
n pcp child process</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size=
:11pt;font-family:Aptos,sans-serif">2025-12-03 14:20:50.189: pcp_main pid 1=
085087: LOCATION:=C2=A0 pcp_child.c:173</p><p class=3D"MsoNormal" style=3D"=
margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:20:50=
.193: main pid 1085028: LOG:=C2=A0 PCP child 1085087 exits with status 0 in=
 failover()</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;fo=
nt-family:Aptos,sans-serif">2025-12-03 14:20:50.193: main pid 1085028: LOCA=
TION:=C2=A0 pgpool_main.c:4850</p><p class=3D"MsoNormal" style=3D"margin:0i=
n;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:20:50.193: mai=
n pid 1085028: LOG:=C2=A0 fork a new PCP child pid 1085089 in failover()</p=
><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Apto=
s,sans-serif">2025-12-03 14:20:50.193: main pid 1085028: LOCATION:=C2=A0 pg=
pool_main.c:4854</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11=
pt;font-family:Aptos,sans-serif">2025-12-03 14:20:50.193: pcp_main pid 1085=
089: LOG:=C2=A0 PCP process: 1085089 started</p><p class=3D"MsoNormal" styl=
e=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:=
20:50.193: pcp_main pid 1085089: LOCATION:=C2=A0 pcp_child.c:165</p><p clas=
s=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-s=
erif">2025-12-03 14:20:50.194: sr_check_worker pid 1085090: LOG:=C2=A0 proc=
ess started</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;fo=
nt-family:Aptos,sans-serif">2025-12-03 14:20:50.194: sr_check_worker pid 10=
85090: LOCATION:=C2=A0 pgpool_main.c:905</p><p class=3D"MsoNormal" style=3D=
"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:22:3=
1.460: pcp_main pid 1085089: LOG:=C2=A0 forked new pcp worker, pid=3D108509=
3 socket=3D7</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;f=
ont-family:Aptos,sans-serif">2025-12-03 14:22:31.460: pcp_main pid 1085089:=
 LOCATION:=C2=A0 pcp_child.c:327</p><p class=3D"MsoNormal" style=3D"margin:=
0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:22:31.721: p=
cp_main pid 1085089: LOG:=C2=A0 PCP process with pid: 1085093 exit with SUC=
CESS.</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-fam=
ily:Aptos,sans-serif">2025-12-03 14:22:31.721: pcp_main pid 1085089: LOCATI=
ON:=C2=A0 pcp_child.c:384</p><p class=3D"MsoNormal" style=3D"margin:0in;fon=
t-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:22:31.721: pcp_main=
 pid 1085089: LOG:=C2=A0 PCP process with pid: 1085093 exits with status 0<=
/p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Ap=
tos,sans-serif">2025-12-03 14:22:31.721: pcp_main pid 1085089: LOCATION:=C2=
=A0 pcp_child.c:398</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size=
:11pt;font-family:Aptos,sans-serif">2025-12-03 14:25:39.480: child pid 1085=
050: LOG:=C2=A0 failover or failback event detected</p><p class=3D"MsoNorma=
l" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12=
-03 14:25:39.480: child pid 1085050: DETAIL:=C2=A0 restarting myself</p><p =
class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sa=
ns-serif">2025-12-03 14:25:39.480: child pid 1085050: LOCATION:=C2=A0 child=
.c:1524</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-f=
amily:Aptos,sans-serif">2025-12-03 14:25:39.480: child pid 1085038: LOG:=C2=
=A0 failover or failback event detected</p><p class=3D"MsoNormal" style=3D"=
margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:25:39=
.481: child pid 1085038: DETAIL:=C2=A0 restarting myself</p><p class=3D"Mso=
Normal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">20=
25-12-03 14:25:39.481: child pid 1085038: LOCATION:=C2=A0 child.c:1524</p><=
p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,=
sans-serif">2025-12-03 14:25:39.481: child pid 1085035: LOG:=C2=A0 failover=
 or failback event detected</p><p class=3D"MsoNormal" style=3D"margin:0in;f=
ont-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:25:39.481: child =
pid 1085035: DETAIL:=C2=A0 restarting myself</p><p class=3D"MsoNormal" styl=
e=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:=
25:39.481: child pid 1085035: LOCATION:=C2=A0 child.c:1524</p><p class=3D"M=
soNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">=
2025-12-03 14:25:39.481: child pid 1085061: LOG:=C2=A0 failover or failback=
 event detected</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11p=
t;font-family:Aptos,sans-serif">2025-12-03 14:25:39.481: child pid 1085061:=
 DETAIL:=C2=A0 restarting myself</p><p class=3D"MsoNormal" style=3D"margin:=
0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:25:39.481: c=
hild pid 1085061: LOCATION:=C2=A0 child.c:1524</p><p class=3D"MsoNormal" st=
yle=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 1=
4:25:39.483: child pid 1085053: LOG:=C2=A0 failover or failback event detec=
ted</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-famil=
y:Aptos,sans-serif">2025-12-03 14:25:39.483: child pid 1085053: DETAIL:=C2=
=A0 restarting myself</p><p class=3D"MsoNormal" style=3D"margin:0in;font-si=
ze:11pt;font-family:Aptos,sans-serif">2025-12-03 14:25:39.483: child pid 10=
85053: LOCATION:=C2=A0 child.c:1524</p><p class=3D"MsoNormal" style=3D"marg=
in:0in;font-size:11pt;font-family:Aptos,sans-serif">2025-12-03 14:25:39.483=
: child pid 1085059: LOG:=C2=A0 failover or failback event detected</p><p c=
lass=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,san=
s-serif">......over and over and over again.</p><p class=3D"MsoNormal" styl=
e=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif"><br></p><p cla=
ss=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-=
serif">=C2=A0</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;=
font-family:Aptos,sans-serif">pcp_node_info output:</p><p class=3D"MsoNorma=
l" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">10.6.1.=
199 5432 1 0.500000 waiting up primary primary 0 none none 2025-12-03 14:04=
:39</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-famil=
y:Aptos,sans-serif">10.6.1.200 5432 1 0.500000 waiting up standby standby 0=
 streaming async 2025-12-03 14:04:39</p><p class=3D"MsoNormal" style=3D"mar=
gin:0in;font-size:11pt;font-family:Aptos,sans-serif">Logs show:</p><p class=
=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-se=
rif">node status[0]: 1</p><p class=3D"MsoNormal" style=3D"margin:0in;font-s=
ize:11pt;font-family:Aptos,sans-serif">node status[1]: 2</p><p class=3D"Mso=
Normal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">No=
de 0 (primary) gets status 1 (waiting), node 1 (standby) gets status 2 (up)=
.</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:=
Aptos,sans-serif"><b>auto_failback behavior:</b></p><ul type=3D"disc" style=
=3D"margin-top:0in;margin-bottom:0in"><li class=3D"MsoNormal" style=3D"marg=
in:0in;font-size:11pt;font-family:Aptos,sans-serif">When a node is detached=
 (pcp_detach_node), it goes to status 3 (down)</li><li class=3D"MsoNormal" =
style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">auto_failb=
ack triggers and moves it to status 1 (waiting)</li><li class=3D"MsoNormal"=
 style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">Node neve=
r transitions from waiting to up</li></ul><p class=3D"MsoNormal" style=3D"m=
argin:0in;font-size:11pt;font-family:Aptos,sans-serif"><b>Key configuration=
:</b></p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-fam=
ily:Aptos,sans-serif">backend_clustering_mode =3D &#39;streaming_replicatio=
n&#39;</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-fa=
mily:Aptos,sans-serif">backend_hostname0 =3D &#39;10.6.1.199&#39;</p><p cla=
ss=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-=
serif">backend_hostname1 =3D &#39;10.6.1.200&#39;</p><p class=3D"MsoNormal"=
 style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">backend_a=
pplication_name0 =3D &#39;nasdw_users_1&#39;</p><p class=3D"MsoNormal" styl=
e=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">backend_applic=
ation_name1 =3D &#39;nasdw_users_2&#39;</p><p class=3D"MsoNormal" style=3D"=
margin:0in;font-size:11pt;font-family:Aptos,sans-serif">=C2=A0</p><p class=
=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-se=
rif">use_watchdog =3D on</p><p class=3D"MsoNormal" style=3D"margin:0in;font=
-size:11pt;font-family:Aptos,sans-serif"># 3 watchdog nodes configured</p><=
p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,=
sans-serif">=C2=A0</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:=
11pt;font-family:Aptos,sans-serif">auto_failback =3D on</p><p class=3D"MsoN=
ormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">aut=
o_failback_interval =3D 1</p><p class=3D"MsoNormal" style=3D"margin:0in;fon=
t-size:11pt;font-family:Aptos,sans-serif">=C2=A0</p><p class=3D"MsoNormal" =
style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">sr_check_p=
eriod =3D 10</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;f=
ont-family:Aptos,sans-serif">sr_check_user =3D &#39;pgpool&#39;</p><p class=
=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-se=
rif">sr_check_database =3D &#39;nasdw_users&#39;</p><p class=3D"MsoNormal" =
style=3D"margin:0in;font-size:11pt;font-family:Aptos,sans-serif">=C2=A0</p>=
<p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font-family:Aptos=
,sans-serif">health_check_period =3D 1</p><p class=3D"MsoNormal" style=3D"m=
argin:0in;font-size:11pt;font-family:Aptos,sans-serif">health_check_user =
=3D &#39;pgpool&#39;</p><p class=3D"MsoNormal" style=3D"margin:0in;font-siz=
e:11pt;font-family:Aptos,sans-serif">health_check_database =3D &#39;nasdw_u=
sers&#39;</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:11pt;font=
-family:Aptos,sans-serif">=C2=A0</p><p class=3D"MsoNormal" style=3D"margin:=
0in;font-size:11pt;font-family:Aptos,sans-serif">failover_when_quorum_exist=
s =3D on (default)</p><p class=3D"MsoNormal" style=3D"margin:0in;font-size:=
11pt;font-family:Aptos,sans-serif">failover_require_consensus =3D on (defau=
lt)</p></div><div><div dir=3D"ltr" class=3D"gmail_signature" data-smartmail=
=3D"gmail_signature"><div dir=3D"ltr"><div><div dir=3D"ltr"><div>Cheers,</d=
iv>Adam<br></div></div></div></div></div></div>

--000000000000fbed0f0645c7e8cb--