MIME-Version: 1.0
References: <20220805.114916.994654810780821553.horikyota.ntt@gmail.com>
 <CALj2ACWPMYoPSC3t-9uW+0gqDUcJf1mLww6hHzo2V2AvE-Tu+w@mail.gmail.com>
 <20220809.161236.1486509314201074910.horikyota.ntt@gmail.com>
 <CALj2ACXmMWtpmuT-=v8F+Lk4QCbdkeN+yHKXeRGKFfjG96YbKA@mail.gmail.com>
 <CALj2ACUO6oz-43ryqfMOVZ_Q-N10C5tkzKku12+QV02NnXsDrw@mail.gmail.com>
 <YzYh3NpCQAFkA6lF@momjian.us>
 <CAAhFRxjFGSk-hVTjnpFwm1XBUcHL8Obugt=P+ixV5AD9H+Kkrw@mail.gmail.com>
 <CAAhFRxgcBy-UCvyJ1ZZ1UKf4Owrx4J2X1F4tN_FD=fh5wZgdkw@mail.gmail.com>
 <CALj2ACVG5KCoPD_5AF2_u07HuZe4ajaLWKycB6OBYsGuj67OhA@mail.gmail.com>
 <CAHg+QDf9sMJ-r9JqFQTALRy8dX8Mr6SoFEvXx8V-Tto10VcFPA@mail.gmail.com>
 <Y4YzWeRgDYOj5Rod@momjian.us>
 <CAAhFRxi5f+2hB7X-y0MZLnC96EQYbTLucovyg27vjAUeaWJuGQ@mail.gmail.com>
In-Reply-To: 
 <CAAhFRxi5f+2hB7X-y0MZLnC96EQYbTLucovyg27vjAUeaWJuGQ@mail.gmail.com>
From: SATYANARAYANA NARLAPURAM <satyanarlapuram@gmail.com>
Date: Tue, 29 Nov 2022 11:20:19 -0800
Message-ID: 
 <CAHg+QDf9V-aMi0su9k5X8ru8KEQjLWRVQrGO7KYQUVpYKMObmw@mail.gmail.com>
Subject: Re: An attempt to avoid
 locally-committed-but-not-replicated-to-standby-transactions
 in synchronous replication
To: Andrey Borodin <amborodin86@gmail.com>
Cc: Bruce Momjian <bruce@momjian.us>,
	Bharath Rupireddy <bharath.rupireddyforpostgres@gmail.com>,
	Kyotaro Horiguchi <horikyota.ntt@gmail.com>,
 Laurenz Albe <laurenz.albe@cybertec.at>,
	PostgreSQL Hackers <pgsql-hackers@lists.postgresql.org>
Content-Type: multipart/alternative; boundary="000000000000ab9c9105eea0e191"
Archived-At: 
 <https://www.postgresql.org/message-id/CAHg%2BQDf9V-aMi0su9k5X8ru8KEQjLWRVQrGO7KYQUVpYKMObmw%40mail.gmail.com>
Precedence: bulk

--000000000000ab9c9105eea0e191
Content-Type: text/plain; charset="UTF-8"

On Tue, Nov 29, 2022 at 10:52 AM Andrey Borodin <amborodin86@gmail.com>
wrote:

> On Tue, Nov 29, 2022 at 8:29 AM Bruce Momjian <bruce@momjian.us> wrote:
> >
> > On Tue, Nov 29, 2022 at 08:14:10AM -0800, SATYANARAYANA NARLAPURAM wrote:
> > >     2. Process proc die immediately when a backend is waiting for sync
> > >     replication acknowledgement, as it does today, however, upon
> restart,
> > >     don't open up for business (don't accept ready-only connections)
> > >     unless the sync standbys have caught up.
> > >
> > >
> > > Are you planning to block connections or queries to the database? It
> would be
> > > good to allow connections and let them query the monitoring views but
> block the
> > > queries until sync standby have caught up. Otherwise, this leaves a
> monitoring
> > > hole. In cloud, I presume superusers are allowed to connect and
> monitor (end
> > > customers are not the role members and can't query the data). The same
> can't be
> > > true for all the installations. Could you please add more details on
> your
> > > approach?
> >
> > I think ALTER SYSTEM should be allowed, particularly so you can modify
> > synchronous_standby_names, no?
>
> We don't allow SQL access during crash recovery until it's caught up
> to consistency point. And that's for a reason - the cluster may have
> invalid system catalog.
> So no, after crash without a quorum of standbys you can only change
> auto.conf and send SIGHUP. Accessing the system catalog during crash
> recovery is another unrelated problem.
>

In the crash recovery case, catalog is inconsistent but in this case, the
cluster has remote uncommitted changes (consistent). Accepting a superuser
connection is no harm. The auth checks performed are still valid after
standbys fully caught up. I don't see a reason why superuser / pg_monitor
connections are required to be blocked.


> But I'd propose to treat these two points differently, they possess
> drastically different scales of danger. Query Cancels are issued here
> and there during failovers\switchovers. Crash amidst network
> partitioning is not that common.
>

Supportability and operability are more important in corner cases to
quickly troubleshoot an issue,


>
> Best regards, Andrey Borodin.
>

--000000000000ab9c9105eea0e191
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr"><br></div><br><div class=3D"gmail_quote">=
<div dir=3D"ltr" class=3D"gmail_attr">On Tue, Nov 29, 2022 at 10:52 AM Andr=
ey Borodin &lt;<a href=3D"mailto:amborodin86@gmail.com">amborodin86@gmail.c=
om</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margi=
n:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex=
">On Tue, Nov 29, 2022 at 8:29 AM Bruce Momjian &lt;<a href=3D"mailto:bruce=
@momjian.us" target=3D"_blank">bruce@momjian.us</a>&gt; wrote:<br>
&gt;<br>
&gt; On Tue, Nov 29, 2022 at 08:14:10AM -0800, SATYANARAYANA NARLAPURAM wro=
te:<br>
&gt; &gt;=C2=A0 =C2=A0 =C2=A02. Process proc die immediately when a backend=
 is waiting for sync<br>
&gt; &gt;=C2=A0 =C2=A0 =C2=A0replication acknowledgement, as it does today,=
 however, upon restart,<br>
&gt; &gt;=C2=A0 =C2=A0 =C2=A0don&#39;t open up for business (don&#39;t acce=
pt ready-only connections)<br>
&gt; &gt;=C2=A0 =C2=A0 =C2=A0unless the sync standbys have caught up.<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt; Are you planning to block connections or queries to the database?=
 It would be<br>
&gt; &gt; good to allow connections and let them query the monitoring views=
 but block the<br>
&gt; &gt; queries until sync standby have caught up. Otherwise, this leaves=
 a monitoring<br>
&gt; &gt; hole. In cloud, I presume superusers are allowed to connect and m=
onitor (end<br>
&gt; &gt; customers are not the role members and can&#39;t query the data).=
 The same can&#39;t be<br>
&gt; &gt; true for all the installations. Could you please add more details=
 on your<br>
&gt; &gt; approach?<br>
&gt;<br>
&gt; I think ALTER SYSTEM should be allowed, particularly so you can modify=
<br>
&gt; synchronous_standby_names, no?<br>
<br>
We don&#39;t allow SQL access during crash recovery until it&#39;s caught u=
p<br>
to consistency point. And that&#39;s for a reason - the cluster may have<br=
>
invalid system catalog.<br>
So no, after crash without a quorum of standbys you can only change<br>
auto.conf and send SIGHUP. Accessing the system catalog during crash<br>
recovery is another unrelated problem.<br></blockquote><div><br></div><div>=
In the crash recovery case, catalog is inconsistent but in this case, the c=
luster has remote uncommitted changes (consistent). Accepting a superuser c=
onnection is no harm. The auth checks performed are still valid after stand=
bys fully caught up. I don&#39;t see a reason why superuser / pg_monitor co=
nnections are required to be blocked.</div><div><br></div><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rg=
b(204,204,204);padding-left:1ex">
<br>
But I&#39;d propose to treat these two points differently, they possess<br>
drastically different scales of danger. Query Cancels are issued here<br>
and there during failovers\switchovers. Crash amidst network<br>
partitioning is not that common.<br></blockquote><div><br></div><div>Suppor=
tability and operability are more important in corner cases to quickly trou=
bleshoot an issue,<br></div><div>=C2=A0</div><blockquote class=3D"gmail_quo=
te" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204=
);padding-left:1ex">
<br>
Best regards, Andrey Borodin.<br>
</blockquote></div></div>

--000000000000ab9c9105eea0e191--