MIME-Version: 1.0
References: <CA+mZaOMbOTUrs1QXWKyaxLfWGM2N21dQ=72GRx6jodALL-r4aQ@mail.gmail.com>
 <IA3PR10MB811375110A03F754BA29A0048D4BA@IA3PR10MB8113.namprd10.prod.outlook.com>
In-Reply-To: <IA3PR10MB811375110A03F754BA29A0048D4BA@IA3PR10MB8113.namprd10.prod.outlook.com>
From: Greg Hennessy <greg.hennessy@gmail.com>
Date: Mon, 14 Jul 2025 14:25:19 -0400
Message-ID: <CA+mZaOMRBe4_szsyD-JVTQJ25ah6tOUdE_923ZffT-9zVjxqtQ@mail.gmail.com>
Subject: Re: optimizing number of workers
To: "Weck, Luis" <luis.weck@pismo.io>
Cc: "pgsql-general@lists.postgresql.org" <pgsql-general@lists.postgresql.org>
Content-Type: multipart/alternative; boundary="0000000000005bfd560639e7ca46"
Archived-At: <https://www.postgresql.org/message-id/CA%2BmZaOMRBe4_szsyD-JVTQJ25ah6tOUdE_923ZffT-9zVjxqtQ%40mail.gmail.com>
Precedence: bulk

--0000000000005bfd560639e7ca46
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Setting those values to zero (not something I'd want to do in production)
changes the number of workes
from 10 to 13. At least something, but if anyone knows where discussion
about how to use
large numbers of CPU's in postgresql are being held I'd appreciate learning
about it.

Greg


On Fri, Jul 11, 2025 at 2:11=E2=80=AFPM Weck, Luis <luis.weck@pismo.io> wro=
te:

> *From: *Greg Hennessy <greg.hennessy@gmail.com>
> *Date: *Thursday, July 10, 2025 at 4:40=E2=80=AFPM
> *To: * pgsql-general@lists.postgresql.org <
> pgsql-general@lists.postgresql.org>
> *Subject: *optimizing number of workers
>
> Having just received a shiny new dual CPU machine to use as a postgresql
> server, I'm trying to do some reasonable efforts to configure it
> correctly. The hard
> ware has 128 cores, and I am running a VM with Redhat 9 and Postgresql
> 16.9.
>
> In postgresql.conf I have:
> max_worker_processes =3D 90               # (change requires restart)
> max_parallel_workers_per_gather =3D 72    # gsh 26 oct 2022
> max_parallel_maintenance_workers =3D 72   # gsh 12 jun 2025
> max_parallel_workers =3D  72              # gsh 12 jun 2025
> max_logical_replication_workers =3D 72    # gsh 12 jun 2025
> max_sync_workers_per_subscription =3D 72   # gsh 12 jun 2025
> autovacuum_max_workers =3D 12             # max number of autovacuum
> subprocesses
>
> When I do a simple count of a large (large being 1.8 Billion entries), I
> get
> about 10 workers used.
>
> prod_v1_0_0_rc1=3D# explain (analyze, buffers) select count(*) from
> gaiadr3.gaia_source;
>
>                QUERY PLAN
>
> -------------------------------------------------------------------------=
---------------------------------------------------------------------------=
----------------------------------------
>  Finalize Aggregate  (cost=3D14379796.81..14379796.82 rows=3D1 width=3D8)
> (actual time=3D16702.806..16705.479 rows=3D1 loops=3D1)
>    Buffers: shared hit=3D2507481
>    ->  Gather  (cost=3D14379795.78..14379796.79 rows=3D10 width=3D8) (act=
ual
> time=3D16702.513..16705.470 rows=3D11 loops=3D1)
>          Workers Planned: 10
>          Workers Launched: 10
>          Buffers: shared hit=3D2507481
>          ->  Partial Aggregate  (cost=3D14379785.78..14379785.79 rows=3D1
> width=3D8) (actual time=3D16691.820..16691.821 rows=3D1 loops=3D11)
>                Buffers: shared hit=3D2507481
>                ->  Parallel Index Only Scan using gaia_source_nest128 on
> gaia_source  (cost=3D0.58..13926632.85 rows=3D181261171 width=3D0) (actua=
l
> time=3D0.025..9559.644 rows=3D164700888 loops=3D11)
>                      Heap Fetches: 0
>                      Buffers: shared hit=3D2507481
>  Planning:
>    Buffers: shared hit=3D163
>  Planning Time: 14.898 ms
>  Execution Time: 16705.592 ms
>
> Postgres has chosen to use only a small fraction of the CPU's I have on
> my machine. Given the query returns an answer in about 8 seconds, it may =
be
> that Postgresql has allocated the proper number of works. But if I wanted
> to try to tweak some config parameters to see if using more workers
> would give me an answer faster, I don't seem to see any obvious knobs
> to turn. Are there parameters that I can adjust to see if I can increase
> throughput? Would adjusting parallel_setup_cost or parallel_tuple_cost
> likely to be of help?
>
> I believe you can decrease min_parallel_table_scan_size (default is 8MB)
> and min_parallel_index_scan_size (default 5112kB). The number of workers
> depends also on a multiple of these settings.
>
>

--0000000000005bfd560639e7ca46
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Setting those values to zero (not something I&#39;d want t=
o do in production) changes the number of workes<div>from 10 to 13. At leas=
t something, but if anyone knows where discussion about how to use</div><di=
v>large numbers of CPU&#39;s in postgresql are being held I&#39;d appreciat=
e learning about it.</div><div><br></div><div>Greg</div><div><br></div></di=
v><br><div class=3D"gmail_quote gmail_quote_container"><div dir=3D"ltr" cla=
ss=3D"gmail_attr">On Fri, Jul 11, 2025 at 2:11=E2=80=AFPM Weck, Luis &lt;<a=
 href=3D"mailto:luis.weck@pismo.io">luis.weck@pismo.io</a>&gt; wrote:<br></=
div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;bor=
der-left:1px solid rgb(204,204,204);padding-left:1ex">


<div>
<blockquote style=3D"margin-left:0.5em;padding-left:0.4em;border-left:3px s=
olid rgb(200,200,200)">
<div dir=3D"ltr" style=3D"font-family:Aptos;font-size:12pt;color:black"><b>=
From:
</b>Greg Hennessy &lt;<a href=3D"mailto:greg.hennessy@gmail.com" target=3D"=
_blank">greg.hennessy@gmail.com</a>&gt;</div>
<div dir=3D"ltr" style=3D"font-family:Aptos;font-size:12pt;color:black"><b>=
Date:
</b>Thursday, July 10, 2025 at 4:40=E2=80=AFPM</div>
<div dir=3D"ltr" style=3D"font-family:Aptos;font-size:12pt;color:black"><b>=
To: </b>
<a href=3D"mailto:pgsql-general@lists.postgresql.org" target=3D"_blank">pgs=
ql-general@lists.postgresql.org</a> &lt;<a href=3D"mailto:pgsql-general@lis=
ts.postgresql.org" target=3D"_blank">pgsql-general@lists.postgresql.org</a>=
&gt;</div>
<div dir=3D"ltr" style=3D"font-family:Aptos;font-size:12pt;color:black"><b>=
Subject:
</b>optimizing number of workers</div>
<div dir=3D"ltr" style=3D"font-family:Aptos;font-size:12pt;color:black"><br=
>
</div>
</blockquote>
<div id=3D"m_6401302785966723913ms-outlook-mobile-signature" dir=3D"ltr" st=
yle=3D"color:inherit;background-color:inherit">
</div>
<div id=3D"m_6401302785966723913mail-editor-reference-message-container" st=
yle=3D"color:inherit;background-color:inherit">
<blockquote style=3D"margin-left:0.5em;padding-left:0.4em;border-left:3px s=
olid rgb(200,200,200)">
<div dir=3D"ltr">Having just received a shiny new dual CPU machine=C2=A0to =
use as a postgresql</div>
<div dir=3D"ltr">server, I&#39;m trying to do some reasonable efforts to co=
nfigure it correctly. The hard</div>
<div dir=3D"ltr">ware has 128 cores, and I am running a VM with Redhat 9 an=
d Postgresql=C2=A0 16.9.</div>
<div dir=3D"ltr"><br>
</div>
<div dir=3D"ltr">In postgresql.conf I have:</div>
<div dir=3D"ltr">max_worker_processes =3D 90 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 # (change requires restart)</div>
<div dir=3D"ltr">max_parallel_workers_per_gather =3D 72 =C2=A0 =C2=A0# gsh =
26 oct 2022</div>
<div dir=3D"ltr">max_parallel_maintenance_workers =3D 72 =C2=A0 # gsh 12 ju=
n 2025</div>
<div dir=3D"ltr">max_parallel_workers =3D =C2=A072 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0# gsh 12 jun 2025</div>
<div dir=3D"ltr">max_logical_replication_workers =3D 72 =C2=A0 =C2=A0# gsh =
12 jun 2025</div>
<div dir=3D"ltr">max_sync_workers_per_subscription =3D 72 =C2=A0 # gsh 12 j=
un 2025</div>
<div dir=3D"ltr">autovacuum_max_workers =3D 12 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 # max number of autovacuum subprocesses</div>
<div dir=3D"ltr"><br>
</div>
<div dir=3D"ltr">When I do a simple count of a large (large being 1.8 Billi=
on entries), I get</div>
<div dir=3D"ltr">about 10 workers used.</div>
<div dir=3D"ltr"><br>
</div>
<div dir=3D"ltr">prod_v1_0_0_rc1=3D# explain (analyze, buffers) select coun=
t(*) from gaiadr3.gaia_source;</div>
<div dir=3D"ltr">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0QUERY PLAN</div>
<div dir=3D"ltr">----------------------------------------------------------=
---------------------------------------------------------------------------=
-------------------------------------------------------</div>
<div dir=3D"ltr">=C2=A0Finalize Aggregate =C2=A0(cost=3D14379796.81..143797=
96.82 rows=3D1 width=3D8) (actual time=3D16702.806..16705.479 rows=3D1 loop=
s=3D1)</div>
<div dir=3D"ltr">=C2=A0 =C2=A0Buffers: shared hit=3D2507481</div>
<div dir=3D"ltr">=C2=A0 =C2=A0-&gt; =C2=A0Gather =C2=A0(cost=3D14379795.78.=
.14379796.79 rows=3D10 width=3D8) (actual time=3D16702.513..16705.470 rows=
=3D11 loops=3D1)</div>
<div dir=3D"ltr">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0Workers Planned: 10</div=
>
<div dir=3D"ltr">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0Workers Launched: 10</di=
v>
<div dir=3D"ltr">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0Buffers: shared hit=3D25=
07481</div>
<div dir=3D"ltr">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0-&gt; =C2=A0Partial Aggr=
egate =C2=A0(cost=3D14379785.78..14379785.79 rows=3D1 width=3D8) (actual ti=
me=3D16691.820..16691.821 rows=3D1 loops=3D11)</div>
<div dir=3D"ltr">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0Buf=
fers: shared hit=3D2507481</div>
<div dir=3D"ltr">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0-&g=
t; =C2=A0Parallel Index Only Scan using gaia_source_nest128 on gaia_source =
=C2=A0(cost=3D0.58..13926632.85 rows=3D181261171 width=3D0) (actual time=3D=
0.025..9559.644 rows=3D164700888 loops=3D11)</div>
<div dir=3D"ltr">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0Heap Fetches: 0</div>
<div dir=3D"ltr">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0Buffers: shared hit=3D2507481</div>
<div dir=3D"ltr">=C2=A0Planning:</div>
<div dir=3D"ltr">=C2=A0 =C2=A0Buffers: shared hit=3D163</div>
<div dir=3D"ltr">=C2=A0Planning Time: 14.898 ms</div>
<div dir=3D"ltr">=C2=A0Execution Time: 16705.592 ms</div>
<div dir=3D"ltr"><br>
</div>
<div dir=3D"ltr">Postgres has chosen to use only a small fraction of the CP=
U&#39;s I have on</div>
<div dir=3D"ltr">my machine. Given the query returns an answer in about 8 s=
econds, it may be</div>
<div dir=3D"ltr">that Postgresql has allocated the proper number of works. =
But if I wanted</div>
<div dir=3D"ltr">to try to tweak some config parameters to see if using mor=
e workers</div>
<div dir=3D"ltr">would give me an answer faster, I don&#39;t seem to see an=
y obvious knobs</div>
<div dir=3D"ltr">to turn. Are there parameters that I can adjust to see if =
I can increase</div>
<div dir=3D"ltr">throughput? Would adjusting parallel_setup_cost or paralle=
l_tuple_cost</div>
<div dir=3D"ltr">likely to be of help?=C2=A0</div>
</blockquote>
<div dir=3D"ltr" style=3D"font-family:Aptos,Arial,Helvetica,sans-serif;font=
-size:16px;color:rgb(0,0,0)">
<span style=3D"background-color:rgb(255,255,255)">I believe you can decreas=
e min_parallel_table_scan_size (default is 8MB) and min_parallel_index_scan=
_size (default 5112kB). The number of workers depends also on a multiple of=
 these settings.</span></div>
<div dir=3D"ltr"><br>
</div>
</div>
</div>

</blockquote></div>

--0000000000005bfd560639e7ca46--