MIME-Version: 1.0
References: 
 <CAL5Gniu4Gp36jaFOQVXmazMCCUMQMcGD_EvRJTvUx7aSOJcbWA@mail.gmail.com>
 <CAL5Gnit8qfT_10cADQnJxbCCWBXkzWqE_xbrUj7TLzeH+oXrog@mail.gmail.com>
In-Reply-To: 
 <CAL5Gnit8qfT_10cADQnJxbCCWBXkzWqE_xbrUj7TLzeH+oXrog@mail.gmail.com>
From: Greg Sabino Mullane <htamfids@gmail.com>
Date: Fri, 4 Jul 2025 09:45:55 -0400
Message-ID: 
 <CAKAnmmL+WtCjx819ZT_YbM_TPKN-SEaXB89kRxiQfrDX5YC3Cg@mail.gmail.com>
Subject: Re: Guidance Needed: Scaling PostgreSQL for 12 TB Data Growth - New
 Feature Implementation
To: Motog Plus <mplus7535@gmail.com>
Cc: pgsql-performance@postgresql.org
Content-Type: multipart/alternative; boundary="0000000000009ddae706391ab82b"
Archived-At: 
 <https://www.postgresql.org/message-id/CAKAnmmL%2BWtCjx819ZT_YbM_TPKN-SEaXB89kRxiQfrDX5YC3Cg%40mail.gmail.com>
Precedence: bulk

--0000000000009ddae706391ab82b
Content-Type: text/plain; charset="UTF-8"

It's hard to give generic recommendations for what really depends on your
specific needs, but here is one attempt:

using HikariCP for connection pooling.


For better scaling, look into PGBouncer, which has very fast "transaction"
and "statement" modes.

... manage 10-12 TB of data in a production environment, considering
> typical transaction loads.


Yes, 10 TB is very doable.

We are considering splitting database "C" into two new databases: "C1" to
> exclusively house the "acc" schema, and "C2" for the remaining schemas. Is
> this a recommended approach for managing growth, and what are the potential
> pros and cons?


If they are logically connected, then keep them the same database. Having
to go across databases (or across clusters) is a lot of added complexity
for little gain.


> ...or could both "C1" and "C2" reside on the same database server?


They could, but you would be sharing all the resources anyway, so you don't
gain much.


> is there a general "limit" or best practice for the maximum amount of data
> a single database server should handle (e.g., 10 TB) and similarly general
> limit per database?


No limit per se, it really depends if you start seeing effects on your
measured performance. Lots of indirect things to keep in mind as well: time
it takes to make backups, autovacuum efforts, time to spin up replicas.
These days 10TB is not considered particularly huge, but it really depends
on your workload. Don't worry about limits per database - it's all about
the total cluster size; which database things are in can be considered a
housekeeping record.

Beyond standard practices like indexing and partitioning, what other best
> practices should we consider implementing to ensure optimal performance and
> manageability with such a large dataset?


This is probably the vaguest question in the email. Obvious things are to
make sure you are doing heavy monitoring, both at the OS level and PG
level, particularly via log_min_duration_statement and pg_stat_statements.
Keep a close eye on bloat. Keep indexes to a minimum and make them all
justify their worth. Use partial and functional indexes. Make sure your
backups are solid (use pgbackrest). Test your restores regularly. Use
pgbouncer. Send simple selects to the read replicas. Automate everything
you can. Be paranoid. Assume the application is going to do everything
wrong and try to destroy your database. Get a seasoned PG DBA who will know
how to do all this and what else to look for (the mailing lists are good,
but mostly reactive and asynchronous, as you are now discovering)

Hardware Configuration Recommendations: Based on our projected data growth
> and desired performance, what hardware configurations (e.g., RAM, CPU,
> storage I/O, storage type like NVMe) would you recommend for future
> database servers to efficiently handle 10-12 TB?


Maybe someone else can attempt specifics, but it's too open-ended of a
question for me. Storage should be fast but above all, stable and reliable.
More RAM is always good. More cores is always good. Postgres scales well
vertically. Offload as much work as possible (including backups) to the
replicas. 10-12 TB is a little meaningless except in regards to backups:
what matters is how much of that 10TB is being actively used.

Open-Source Horizontal Scaling Solutions: Are there any open-source
> horizontal scaling solutions for PostgreSQL (other than Citus Data) that
> the community recommends or has experience with for managing extremely
> large datasets?


Citus is very good for certain datasets. Can be overkill for many
situations. Don't overlook streaming rep + pgpool/haproxy as a good start
for basic horizontal scaling.

The more specific future questions are, the better a reply you will get.
Showing us database sizes is not a very good metric. Some more useful
things to measure would be WAL rate, txn rate, active data size (i.e.
shared_buffers analysis), number or active connections, and which queries
are the most expensive.

Cheers,
Greg

--
Crunchy Data - https://www.crunchydata.com
Enterprise Postgres Software Products & Tech Support

--0000000000009ddae706391ab82b
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">It&#39;s hard to give generic recommendations for what rea=
lly depends on your specific needs, but here is one attempt:<br><br><blockq=
uote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1p=
x solid rgb(204,204,204);padding-left:1ex">using HikariCP for connection po=
oling.</blockquote><br>For better scaling, look into PGBouncer, which has v=
ery fast &quot;transaction&quot; and &quot;statement&quot; modes.<div><br><=
/div><div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8=
ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">... manage 10-1=
2 TB of data in a production environment, considering typical transaction l=
oads.</blockquote><div><br></div><div>Yes, 10 TB is very doable.<br></div><=
div><br></div><div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0p=
x 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">We are=
 considering splitting database &quot;C&quot; into two new databases: &quot=
;C1&quot; to exclusively house the &quot;acc&quot; schema, and &quot;C2&quo=
t; for the remaining schemas. Is this a recommended approach for managing g=
rowth, and what are the potential pros and cons?</blockquote><div><br></div=
><div>If they are logically connected, then keep them the same database. Ha=
ving to go across databases (or across clusters) is a lot of added complexi=
ty for little gain.</div><div>=C2=A0</div></div><blockquote class=3D"gmail_=
quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,=
204);padding-left:1ex">...or could both &quot;C1&quot; and &quot;C2&quot; r=
eside on the same database server?</blockquote><div><br></div><div>They cou=
ld, but you would be sharing all the resources anyway, so you don&#39;t gai=
n much.</div><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"ma=
rgin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:=
1ex">is there a general &quot;limit&quot; or best practice for the maximum =
amount of data a single database server should handle (e.g., 10 TB) and sim=
ilarly general limit per database?</blockquote><div><br></div><div>No limit=
 per se, it really depends if you start seeing effects on your measured per=
formance. Lots of indirect things to keep in mind as well: time it takes to=
 make backups, autovacuum efforts, time to spin up replicas. These days 10T=
B is not considered particularly huge, but it really depends on your worklo=
ad. Don&#39;t worry about limits per database - it&#39;s all about the tota=
l cluster size; which database things are in can be considered a housekeepi=
ng record.</div><div><br></div><blockquote class=3D"gmail_quote" style=3D"m=
argin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left=
:1ex">Beyond standard practices like indexing and partitioning, what other =
best practices should we consider implementing to ensure optimal performanc=
e and manageability with such a large dataset?</blockquote><div><br></div><=
div>This is probably the vaguest question in the email. Obvious things are =
to make sure you are doing heavy monitoring, both at the OS level and PG le=
vel, particularly via log_min_duration_statement and pg_stat_statements. Ke=
ep a close eye on bloat. Keep indexes to a minimum and make them all justif=
y their worth. Use partial and functional indexes. Make sure your backups a=
re solid (use pgbackrest). Test your restores regularly. Use pgbouncer. Sen=
d simple selects=C2=A0to the read replicas. Automate everything you can. Be=
 paranoid. Assume the application is going to do everything wrong and try t=
o destroy your database. Get a seasoned PG DBA who will know how to do all =
this and what else to look for (the mailing lists are good, but mostly reac=
tive and asynchronous, as you are now discovering)</div><div><br></div><blo=
ckquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left=
:1px solid rgb(204,204,204);padding-left:1ex">Hardware Configuration Recomm=
endations: Based on our projected data growth and desired performance, what=
 hardware configurations (e.g., RAM, CPU, storage I/O, storage type like NV=
Me) would you recommend for future database servers to efficiently handle 1=
0-12 TB?</blockquote><div><br></div><div>Maybe someone else can attempt spe=
cifics, but it&#39;s too open-ended of a question for me. Storage should be=
 fast but above all, stable and reliable. More RAM is always good. More cor=
es is always good. Postgres scales well vertically. Offload as much work as=
 possible (including backups) to the replicas. 10-12 TB is a little meaning=
less except in regards to backups: what matters is how much of that 10TB is=
 being actively used.</div><div><br></div><blockquote class=3D"gmail_quote"=
 style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);p=
adding-left:1ex">Open-Source Horizontal Scaling Solutions: Are there any op=
en-source horizontal scaling solutions for PostgreSQL (other than Citus Dat=
a) that the community recommends or has experience with for managing extrem=
ely large datasets?</blockquote><div><br></div><div>Citus is very good for =
certain datasets. Can be overkill for many situations. Don&#39;t overlook s=
treaming rep=C2=A0+ pgpool/haproxy as a good start for basic horizontal sca=
ling.</div><div><br></div><div>The more specific future questions are, the =
better a reply you will get. Showing us database=C2=A0sizes is not a very g=
ood metric. Some more useful things to measure would be WAL rate, txn rate,=
 active data size (i.e. shared_buffers analysis), number or active connecti=
ons, and which queries are the most expensive.</div><div><br></div><div>Che=
ers,<br>Greg<br><br>--<br>Crunchy Data - <a href=3D"https://www.crunchydata=
.com">https://www.crunchydata.com</a><br>Enterprise Postgres Software Produ=
cts &amp; Tech Support</div></div></div>

--0000000000009ddae706391ab82b--