MIME-Version: 1.0
References: <18DEA0D4-1D33-48CA-A037-FF01AFE37471@leisi.net>
In-Reply-To: <18DEA0D4-1D33-48CA-A037-FF01AFE37471@leisi.net>
From: Zahid Rahman <zahidr1000@gmail.com>
Date: Thu, 7 Nov 2024 23:49:16 +0000
Message-ID: <CAPGSW3TdsjQZykFrB=hR4fnZCaSacdi_EOuWZndOBWfGWGNAUQ@mail.gmail.com>
Subject: Re: Advice on cluster architecture for two related, but distinct, use cases
To: Matthias Leisi <matthias@leisi.net>
Cc: pgsql-general@lists.postgresql.org
Content-Type: multipart/alternative; boundary="000000000000e2a0d006265b4855"
Archived-At: <https://www.postgresql.org/message-id/CAPGSW3TdsjQZykFrB%3DhR4fnZCaSacdi_EOuWZndOBWfGWGNAUQ%40mail.gmail.com>
Precedence: bulk

--000000000000e2a0d006265b4855
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Perhaps a 14 minute investment in this article may prove fruitful.

https://medium.com/@martin.hodges/adding-a-postgres-high-availability-datab=
ase-to-your-kubernetes-cluster-634ea5d6e4a1


On Thu, 7 Nov 2024, 21:06 Matthias Leisi, <matthias@leisi.net> wrote:

> Dear all,
>
> (This is a follow-up to a question I asked almost exactly a year ago,
> https://postgrespro.com/list/thread-id/2670756#726F3765-858C-4AC0-A7B0-5C=
B6720E4B37@leisi.net -
> the requirements have changed since then, and the platform has changed fr=
om
> OpenBSD to Linux, which may make some things easier.)
>
>
> I=E2=80=99m looking for advice on Postgres cluster architecture(s) for tw=
o related
> but distinct use cases. Ideally, the approaches for the two use cases wou=
ld
> not differ too widely.
>
> The goal of clustering is low RPO (I guess we need sync clustering) and
> RTO (ideally almost-instant failover, but a failover process of up to a
> minute in the worst case could be acceptable); throughput is not a concer=
n
> (it=E2=80=99s relatively low transaction volume except for some often-wri=
tten
> statistics data, which is still moderate). Latency (due to the distance
> between datacenters for georedundancy) is a fact we are willing to accept=
.
>
>
> The first use case is in an environment under our own control (and where
> eg a DBA could intervene). We can theoretically run any number of cluster
> instances, but assume we would use an even number (split over the two
> datacenters), or potentially an odd number of nodes (eg with an arbiter).
> We could use a load balancer, but I guess this would strongly deviate fro=
m
> the second use case:
>
>
> In the second use case, the environment is not under our control, so we
> can only assume basic network connectivity from the application to the DB=
,
> and between the DBs (the latter potentially through an SSH tunnel if
> needed). In this use case, we can not assume a person to intervene if a
> node goes down, and would prefer some automated failover to the other nod=
e
> (this automation would also be welcome for the first use case, eg if
> something happens while nobody is watching). We can not assume eg a load
> balancer.
>
> There could be various ways how the environment in the second use case is
> set up, ranging from =E2=80=9Eapplication and database running on the sam=
e box=E2=80=9C
> (well, no clustering for you then=E2=80=A6), to dedicated two- or three n=
ode
> database cluster serving a number of application machines.
>
>
> In both use cases, we have full control over the application and the
> database code and environment.
>
> From reading various docs, it seems we would need something like Patroni
> (/Percona), at least for the first use case. However it seems relatively
> complex to set up and operate.
>
> I would appreciate your experience and input into which approach would
> best fit the two use cases. We are also willing to engage in paid
> consulting.
>
> Thanks,
> =E2=80=94 Matthias
>
> --
> Matthias Leisi
> Katzenr=C3=BCtistrasse 68, 8153 R=C3=BCmlang
> <https://www.google.com/maps/search/Katzenr%C3%BCtistrasse+68,+8153+R%C3%=
BCmlang?entry=3Dgmail&source=3Dg>
> Mobile +41 79 377 04 43
> matthias@leisi.net
>

--000000000000e2a0d006265b4855
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"auto"><div>Perhaps a 14 minute investment in this article may p=
rove fruitful.<div dir=3D"auto"><br><div dir=3D"auto"><a href=3D"https://me=
dium.com/@martin.hodges/adding-a-postgres-high-availability-database-to-you=
r-kubernetes-cluster-634ea5d6e4a1">https://medium.com/@martin.hodges/adding=
-a-postgres-high-availability-database-to-your-kubernetes-cluster-634ea5d6e=
4a1</a></div><div dir=3D"auto"><br></div></div><br><br><div class=3D"gmail_=
quote"><div dir=3D"ltr" class=3D"gmail_attr">On Thu, 7 Nov 2024, 21:06 Matt=
hias Leisi, &lt;<a href=3D"mailto:matthias@leisi.net">matthias@leisi.net</a=
>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0=
 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style=3D"line-bre=
ak:after-white-space">Dear all,<div><br></div><div>(This is a follow-up to =
a question I asked almost exactly a year ago,=C2=A0<a href=3D"https://postg=
respro.com/list/thread-id/2670756#726F3765-858C-4AC0-A7B0-5CB6720E4B37@leis=
i.net" target=3D"_blank" rel=3D"noreferrer">https://postgrespro.com/list/th=
read-id/2670756#726F3765-858C-4AC0-A7B0-5CB6720E4B37@leisi.net</a>=C2=A0- t=
he requirements have changed since then, and the platform has changed from =
OpenBSD to Linux, which may make some things easier.)</div><div><br></div><=
div><br></div><div>I=E2=80=99m looking for advice on Postgres cluster archi=
tecture(s) for two related but distinct use cases. Ideally, the approaches =
for the two use cases would not differ too widely.</div><div><br></div><div=
>The goal of clustering is low RPO (I guess we need sync clustering) and RT=
O (ideally almost-instant failover, but a failover process of up to a minut=
e in the worst case could be acceptable); throughput is not a concern (it=
=E2=80=99s relatively low transaction volume except for some often-written =
statistics data, which is still moderate). Latency (due to the distance bet=
ween datacenters for georedundancy) is a fact we are willing to accept.</di=
v><div><br></div><div><br></div><div>The first use case is in an environmen=
t under our own control (and where eg a DBA could intervene). We can theore=
tically run any number of cluster instances, but assume we would use an eve=
n number (split over the two datacenters), or potentially an odd number of =
nodes (eg with an arbiter). We could use a load balancer, but I guess this =
would strongly deviate from the second use case:</div><div><br></div><div><=
br></div><div>In the second use case, the environment is not under our cont=
rol, so we can only assume basic network connectivity from the application =
to the DB, and between the DBs (the latter potentially through an SSH tunne=
l if needed). In this use case, we can not assume a person to intervene if =
a node goes down, and would prefer some automated failover to the other nod=
e (this automation would also be welcome for the first use case, eg if some=
thing happens while nobody is watching). We can not assume eg a load balanc=
er.</div><div><br></div><div>There could be various ways how the environmen=
t in the second use case is set up, ranging from =E2=80=9Eapplication and d=
atabase running on the same box=E2=80=9C (well, no clustering for you then=
=E2=80=A6), to dedicated two- or three node database cluster serving a numb=
er of application machines.</div><div><br></div><div><br></div><div>In both=
 use cases, we have full control over the application and the database code=
 and environment.</div><div><br></div><div>From reading various docs, it se=
ems we would need something like Patroni (/Percona), at least for the first=
 use case. However it seems relatively complex to set up and operate.</div>=
<div><br></div><div>I would appreciate your experience and input into which=
 approach would best fit the two use cases. We are also willing to engage i=
n paid consulting.</div><div><br></div><div>Thanks,</div><div>=E2=80=94 Mat=
thias</div><div><br><div><div style=3D"line-break:after-white-space"><div s=
tyle=3D"line-break:after-white-space"><div style=3D"line-break:after-white-=
space">--=C2=A0</div><div style=3D"line-break:after-white-space">Matthias L=
eisi<br><a href=3D"https://www.google.com/maps/search/Katzenr%C3%BCtistrass=
e+68,+8153+R%C3%BCmlang?entry=3Dgmail&amp;source=3Dg">Katzenr=C3=BCtistrass=
e 68, 8153 R=C3=BCmlang</a><br>Mobile +41 79 377 04 43<br><a href=3D"mailto=
:matthias@leisi.net" target=3D"_blank" rel=3D"noreferrer">matthias@leisi.ne=
t</a></div></div></div></div></div></div></blockquote></div></div></div>

--000000000000e2a0d006265b4855--