MIME-Version: 1.0
References: <CAEzWdqd0SPkZMYNaAbERdgczkfQqLmNV5JBMmF-F9s7KjxJ0gw@mail.gmail.com>
 <323794933.277637.1770220093639@mail.yahoo.com>
In-Reply-To: <323794933.277637.1770220093639@mail.yahoo.com>
From: yudhi s <learnerdatabase99@gmail.com>
Date: Thu, 5 Feb 2026 14:05:49 +0530
Message-ID: <CAEzWdqd2ELrUo1xnrFaB9vCicjTuHk+KEbXEaiyX0jp1f3M+tQ@mail.gmail.com>
Subject: Re: Top -N Query performance issue and high CPU usage
To: felix.quintgz@yahoo.com
Cc: pgsql-general@lists.postgresql.org
Content-Type: multipart/alternative; boundary="000000000000fa0cd6064a0f8f06"
Archived-At: <https://www.postgresql.org/message-id/CAEzWdqd2ELrUo1xnrFaB9vCicjTuHk%2BKEbXEaiyX0jp1f3M%2BtQ%40mail.gmail.com>
Precedence: bulk

--000000000000fa0cd6064a0f8f06
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

On Wed, Feb 4, 2026 at 9:18=E2=80=AFPM <felix.quintgz@yahoo.com> wrote:

>
> Have you tried adding an index to txn_tbl.txn_type?
> And a vacuum on all tables? It seems the visibility map is outdated.
>
> I'm using https://explain.dalibo.com to view the plan visually; it's more
> convenient.
>
> You could use the option to periodically save the results of queries with
> common filters to another table, and then retrieve the results from that
> table when a user performs a query with their own filters.
> You should also store the user's query results somewhere for a while to
> prevent excessive database access.
>
> I imagine this is some kind of dashboard that each user is taken to after
> authenticating. It looks nice in presentations, but after a while in
> production, it can make the system unusable. I had to remove similar char=
ts
> from the homepage of a system because after a year of work, they were
> taking a minute to load.
>
>
>  On Saturday, January 31, 2026 at 08:30:33 AM GMT-5, yudhi s <
> learnerdatabase99@gmail.com> wrote:
>  Hello Experts,
>  We have a "Select" query which is using three to five main transaction
> tables (txn_tbl, txn_status, txn_decision, txn_sale, ath) holding ~2milli=
on
> rows in each of them(which is going to increase to have ~50-100million in
> future) and others(6-7) tables out of which some are master and some othe=
r
> small tables.
>
> When we are running this query , and it's taking ~2-3seconds , however
> when we hit this query from 10-15 session at same time its causing CPU
> spike up to ~50-60% for the DB instance and this is increasing and touchi=
ng
> 90% when we are increasing the hits further to 40-50 times concurrently.
>
> This query is going to be called in the first page of an UI screen and is
> supposed to show the latest 1000 rows based on a certain transaction date=
.
> This query is supposed to allow thousands of users to hit this same query
> at the first landing page at the same time.
>
> Its postgres version 17.  The instance has 2-VCPU and 16GB RAM.
>
> I have the following questions.
>
> 1)Why is this query causing a high cpu spike ,if there is any way in
> postgres to understand what part/line of the query is contributing to the
> high cpu time?
> 2)How can we tune this query to further reduce response time and mainly
> CPU consumption ? Is any additional index or anything will make this plan
> better further?
> 3) Is there any guidance or best practices exists , to create/design top
> N-queries for such UI scenarios where performance is an important factor?
> 4)And based on the CPU core and memory , is there any calculation by usin=
g
> which , we can say that this machine can support a maximum N number of
> concurrent queries of such type beyond which we need more cpu cores
> machines?
> Below is the query and its current plan:-
> https://gist.github.com/databasetech0073/6688701431dc4bf4eaab8d345c1dc65f
> RegardsYudhi
>
>
>
As folks suggested , adding an index on "tran_date" and combining the CTE
to two, and making the data type equal for the "ent_id" has helped reduce
the response to a large extent. Now I am trying to see if we can reduce any
further. As most of the time(100-20=3D~80ms) is now on materialize loop whi=
ch
is happening 43K times.

Also thinking if adding "txn_tbl_type_nm" column to the index i.e.
composite index on (tran_date,txn_tbl_type_nm) will be advisable , in cases
where , ~500K rows will be filtered  by the *txn_tbl_type_nm *filter
criteria (currently its just 17 rows getting filtered though for this case)=
.

https://gist.github.com/databasetech0073/558377c1939a9291e7b72b1cbac7c9f9

-> Nested Loop (cost=3D263.20..1680202.56 rows=3D483106 width=3D20) (actual
time=3D6.421..111.220 rows=3D1000 loops=3D1)
Buffers: shared hit=3D6168
-> Nested Loop (cost=3D262.77..1342550.91 rows=3D579149 width=3D20) (*actua=
l
time=3D6.406..107.946* rows=3D1049 loops=3D1)
Join Filter: (df.ent_id =3D m.ent_id)
Rows Removed by Join Filter: 514648
Buffers: shared hit=3D1972
-> Index Scan Backward using txn_tbl_due_dt_idx on txn_tbl df
(cost=3D0.43..115879.87 rows=3D1419195 width=3D20) (*actual time=3D0.019..2=
0.377*
rows=3D43727 loops=3D1)
*Filter: ((txn_tbl_type_nm)::text =3D ANY ('{TYPE1,TYPE2,TYPE3}'::text[]))*
*Rows Removed by Filter: 17*
Buffers: shared hit=3D1839
-> Materialize (cost=3D262.35..364.01 rows=3D58 width=3D8) (actual
time=3D0.000..0.001 rows=3D12 loops=3D43727)
Buffers: shared hit=3D133


Regards
Yudhi

--000000000000fa0cd6064a0f8f06
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr"><br></div><div class=3D"gmail_quote gmail=
_quote_container"><div dir=3D"ltr" class=3D"gmail_attr">On Wed, Feb 4, 2026=
 at 9:18=E2=80=AFPM &lt;<a href=3D"mailto:felix.quintgz@yahoo.com">felix.qu=
intgz@yahoo.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" s=
tyle=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);pad=
ding-left:1ex"><br>
Have you tried adding an index to txn_tbl.txn_type? <br>
And a vacuum on all tables? It seems the visibility map is outdated.<br>
<br>
I&#39;m using <a href=3D"https://explain.dalibo.com" rel=3D"noreferrer" tar=
get=3D"_blank">https://explain.dalibo.com</a> to view the plan visually; it=
&#39;s more convenient.<br>
<br>
You could use the option to periodically save the results of queries with c=
ommon filters to another table, and then retrieve the results from that tab=
le when a user performs a query with their own filters.<br>
You should also store the user&#39;s query results somewhere for a while to=
 prevent excessive database access.<br>
<br>
I imagine this is some kind of dashboard that each user is taken to after a=
uthenticating. It looks nice in presentations, but after a while in product=
ion, it can make the system unusable. I had to remove similar charts from t=
he homepage of a system because after a year of work, they were taking a mi=
nute to load.<br>
<br>
<br>
=C2=A0On Saturday, January 31, 2026 at 08:30:33 AM GMT-5, yudhi s &lt;<a hr=
ef=3D"mailto:learnerdatabase99@gmail.com" target=3D"_blank">learnerdatabase=
99@gmail.com</a>&gt; wrote:<br>
=C2=A0Hello Experts,<br>
=C2=A0We have a &quot;Select&quot; query which is using three to five main =
transaction tables (txn_tbl, txn_status, txn_decision, txn_sale, ath) holdi=
ng ~2million rows in each of them(which is going to increase to have ~50-10=
0million in future) and others(6-7) tables out of which some are master and=
 some other small tables.<br>
<br>
When we are running this query , and it&#39;s taking ~2-3seconds , however =
when we hit this query from 10-15 session at same time its causing CPU spik=
e up to ~50-60% for the DB instance and this is increasing and touching 90%=
 when we are increasing the hits further to 40-50 times concurrently.<br>
<br>
This query is going to be called in the first page of an UI screen and is s=
upposed to show the latest 1000 rows based on a certain transaction date. T=
his query is supposed to allow thousands of users to hit this same query at=
 the first landing page at the same time.<br>
<br>
Its postgres version 17.=C2=A0 The instance has 2-VCPU and 16GB RAM.<br>
<br>
I have the following=C2=A0questions.<br>
<br>
1)Why is this query causing a high cpu spike ,if there is any way in postgr=
es to understand what part/line of the query is contributing to the high cp=
u time?<br>
2)How can we tune this query to further reduce response time and mainly CPU=
 consumption ? Is any additional index or anything will make this plan bett=
er further?<br>
3) Is there any guidance or best practices exists , to create/design top N-=
queries for such UI scenarios where performance is an important factor?<br>
4)And based on the CPU core and memory , is there any calculation by using =
which , we can say that this machine can support a maximum N number of conc=
urrent queries of such type beyond which we need more cpu cores machines?<b=
r>
Below is the query and its current plan:-<a href=3D"https://gist.github.com=
/databasetech0073/6688701431dc4bf4eaab8d345c1dc65f" rel=3D"noreferrer" targ=
et=3D"_blank">https://gist.github.com/databasetech0073/6688701431dc4bf4eaab=
8d345c1dc65f</a><br>
RegardsYudhi<br>
<br>
<br></blockquote><div><br></div><div>As folks suggested , adding an index o=
n &quot;tran_date&quot; and combining the CTE to two, and making the data t=
ype equal for the &quot;ent_id&quot; has helped reduce the response to a la=
rge extent. Now I am trying to see if we can reduce any further. As most of=
 the time(100-20=3D~80ms) is now on materialize loop which is happening 43K=
 times.</div><div><br></div><div>Also thinking if adding &quot;txn_tbl_type=
_nm&quot; column to the index i.e. composite index on (tran_date,txn_tbl_ty=
pe_nm) will be advisable , in cases where , ~500K rows will be filtered=C2=
=A0 by the=C2=A0<b style=3D"color:rgb(31,35,40);font-family:&quot;Monaspace=
 Neon&quot;,ui-monospace,SFMono-Regular,&quot;SF Mono&quot;,Menlo,Consolas,=
&quot;Liberation Mono&quot;,monospace;font-size:12px;white-space:pre">txn_t=
bl_type_nm </b><span style=3D"color:rgb(31,35,40);font-family:&quot;Monaspa=
ce Neon&quot;,ui-monospace,SFMono-Regular,&quot;SF Mono&quot;,Menlo,Consola=
s,&quot;Liberation Mono&quot;,monospace;font-size:12px;white-space:pre">fil=
ter criteria (currently its just 17 rows getting filtered though for this c=
ase).</span></div><div><br></div><div><a href=3D"https://gist.github.com/da=
tabasetech0073/558377c1939a9291e7b72b1cbac7c9f9">https://gist.github.com/da=
tabasetech0073/558377c1939a9291e7b72b1cbac7c9f9</a></div><div><br></div><di=
v><table class=3D"gmail-highlight gmail-tab-size gmail-js-file-line-contain=
er" style=3D"border-spacing:0px;border-collapse:collapse;color:rgb(31,35,40=
);font-family:-apple-system,BlinkMacSystemFont,&quot;Segoe UI&quot;,&quot;N=
oto Sans&quot;,Helvetica,Arial,sans-serif,&quot;Apple Color Emoji&quot;,&qu=
ot;Segoe UI Emoji&quot;;font-size:14px"><tbody style=3D"box-sizing:border-b=
ox"><tr style=3D"box-sizing:border-box;background-color:rgba(0,0,0,0)"><td =
id=3D"gmail-file-gistfile1-txt-LC28" class=3D"gmail-blob-code gmail-blob-co=
de-inner gmail-js-file-line" style=3D"box-sizing:border-box;padding:0px 10p=
x;line-height:20px;vertical-align:top;overflow:visible;font-family:&quot;Mo=
naspace Neon&quot;,ui-monospace,SFMono-Regular,&quot;SF Mono&quot;,Menlo,Co=
nsolas,&quot;Liberation Mono&quot;,monospace;font-size:12px;white-space:pre=
">-&gt;  Nested Loop  (cost=3D263.20..1680202.56 rows=3D483106 width=3D20) =
(actual time=3D6.421..111.220 rows=3D1000 loops=3D1)</td></tr><tr style=3D"=
box-sizing:border-box"><td id=3D"gmail-file-gistfile1-txt-LC29" class=3D"gm=
ail-blob-code gmail-blob-code-inner gmail-js-file-line" style=3D"box-sizing=
:border-box;padding:0px 10px;line-height:20px;vertical-align:top;overflow:v=
isible;font-family:&quot;Monaspace Neon&quot;,ui-monospace,SFMono-Regular,&=
quot;SF Mono&quot;,Menlo,Consolas,&quot;Liberation Mono&quot;,monospace;fon=
t-size:12px;white-space:pre">        Buffers: shared hit=3D6168</td></tr><t=
r style=3D"box-sizing:border-box;background-color:rgba(0,0,0,0)"><td id=3D"=
gmail-file-gistfile1-txt-LC30" class=3D"gmail-blob-code gmail-blob-code-inn=
er gmail-js-file-line" style=3D"box-sizing:border-box;padding:0px 10px;line=
-height:20px;vertical-align:top;overflow:visible;font-family:&quot;Monaspac=
e Neon&quot;,ui-monospace,SFMono-Regular,&quot;SF Mono&quot;,Menlo,Consolas=
,&quot;Liberation Mono&quot;,monospace;font-size:12px;white-space:pre">    =
    -&gt;  Nested Loop  (cost=3D262.77..1342550.91 rows=3D579149 width=3D20=
) (<b>actual time=3D6.406..107.946</b> rows=3D1049 loops=3D1)</td></tr><tr =
style=3D"box-sizing:border-box"><td id=3D"gmail-file-gistfile1-txt-LC31" cl=
ass=3D"gmail-blob-code gmail-blob-code-inner gmail-js-file-line" style=3D"b=
ox-sizing:border-box;padding:0px 10px;line-height:20px;vertical-align:top;o=
verflow:visible;font-family:&quot;Monaspace Neon&quot;,ui-monospace,SFMono-=
Regular,&quot;SF Mono&quot;,Menlo,Consolas,&quot;Liberation Mono&quot;,mono=
space;font-size:12px;white-space:pre">              Join Filter: (df.ent_id=
 =3D m.ent_id)</td></tr><tr style=3D"box-sizing:border-box;background-color=
:rgba(0,0,0,0)"><td id=3D"gmail-file-gistfile1-txt-LC32" class=3D"gmail-blo=
b-code gmail-blob-code-inner gmail-js-file-line" style=3D"box-sizing:border=
-box;padding:0px 10px;line-height:20px;vertical-align:top;overflow:visible;=
font-family:&quot;Monaspace Neon&quot;,ui-monospace,SFMono-Regular,&quot;SF=
 Mono&quot;,Menlo,Consolas,&quot;Liberation Mono&quot;,monospace;font-size:=
12px;white-space:pre">              Rows Removed by Join Filter: 514648</td=
></tr><tr style=3D"box-sizing:border-box"><td id=3D"gmail-file-gistfile1-tx=
t-LC33" class=3D"gmail-blob-code gmail-blob-code-inner gmail-js-file-line" =
style=3D"box-sizing:border-box;padding:0px 10px;line-height:20px;vertical-a=
lign:top;overflow:visible;font-family:&quot;Monaspace Neon&quot;,ui-monospa=
ce,SFMono-Regular,&quot;SF Mono&quot;,Menlo,Consolas,&quot;Liberation Mono&=
quot;,monospace;font-size:12px;white-space:pre">              Buffers: shar=
ed hit=3D1972</td></tr><tr style=3D"box-sizing:border-box;background-color:=
rgba(0,0,0,0)"><td id=3D"gmail-file-gistfile1-txt-LC34" class=3D"gmail-blob=
-code gmail-blob-code-inner gmail-js-file-line" style=3D"box-sizing:border-=
box;padding:0px 10px;line-height:20px;vertical-align:top;overflow:visible;f=
ont-family:&quot;Monaspace Neon&quot;,ui-monospace,SFMono-Regular,&quot;SF =
Mono&quot;,Menlo,Consolas,&quot;Liberation Mono&quot;,monospace;font-size:1=
2px;white-space:pre">              -&gt;  Index Scan Backward using txn_tbl=
_due_dt_idx on txn_tbl df  (cost=3D0.43..115879.87 rows=3D1419195 width=3D2=
0) (<b>actual time=3D0.019..20.377</b> rows=3D43727 loops=3D1)</td></tr><tr=
 style=3D"box-sizing:border-box"><td id=3D"gmail-file-gistfile1-txt-LC35" c=
lass=3D"gmail-blob-code gmail-blob-code-inner gmail-js-file-line" style=3D"=
box-sizing:border-box;padding:0px 10px;line-height:20px;vertical-align:top;=
overflow:visible;font-family:&quot;Monaspace Neon&quot;,ui-monospace,SFMono=
-Regular,&quot;SF Mono&quot;,Menlo,Consolas,&quot;Liberation Mono&quot;,mon=
ospace;font-size:12px;white-space:pre">                    <b>Filter: ((txn=
_tbl_type_nm)::text =3D ANY (&#39;{TYPE1,TYPE2,TYPE3}&#39;::text[]))</b></t=
d></tr><tr style=3D"box-sizing:border-box;background-color:rgba(0,0,0,0)"><=
td id=3D"gmail-file-gistfile1-txt-LC36" class=3D"gmail-blob-code gmail-blob=
-code-inner gmail-js-file-line" style=3D"box-sizing:border-box;padding:0px =
10px;line-height:20px;vertical-align:top;overflow:visible;font-family:&quot=
;Monaspace Neon&quot;,ui-monospace,SFMono-Regular,&quot;SF Mono&quot;,Menlo=
,Consolas,&quot;Liberation Mono&quot;,monospace;font-size:12px;white-space:=
pre">                    <b>Rows Removed by Filter: 17</b></td></tr><tr sty=
le=3D"box-sizing:border-box"><td id=3D"gmail-file-gistfile1-txt-LC37" class=
=3D"gmail-blob-code gmail-blob-code-inner gmail-js-file-line" style=3D"box-=
sizing:border-box;padding:0px 10px;line-height:20px;vertical-align:top;over=
flow:visible;font-family:&quot;Monaspace Neon&quot;,ui-monospace,SFMono-Reg=
ular,&quot;SF Mono&quot;,Menlo,Consolas,&quot;Liberation Mono&quot;,monospa=
ce;font-size:12px;white-space:pre">                    Buffers: shared hit=
=3D1839</td></tr><tr style=3D"box-sizing:border-box;background-color:rgba(0=
,0,0,0)"><td id=3D"gmail-file-gistfile1-txt-LC38" class=3D"gmail-blob-code =
gmail-blob-code-inner gmail-js-file-line" style=3D"box-sizing:border-box;pa=
dding:0px 10px;line-height:20px;vertical-align:top;overflow:visible;font-fa=
mily:&quot;Monaspace Neon&quot;,ui-monospace,SFMono-Regular,&quot;SF Mono&q=
uot;,Menlo,Consolas,&quot;Liberation Mono&quot;,monospace;font-size:12px;wh=
ite-space:pre">              -&gt;  Materialize  (cost=3D262.35..364.01 row=
s=3D58 width=3D8) (actual time=3D0.000..0.001 rows=3D12 loops=3D43727)</td>=
</tr><tr style=3D"box-sizing:border-box"><td id=3D"gmail-file-gistfile1-txt=
-LC39" class=3D"gmail-blob-code gmail-blob-code-inner gmail-js-file-line" s=
tyle=3D"box-sizing:border-box;padding:0px 10px;line-height:20px;vertical-al=
ign:top;overflow:visible;font-family:&quot;Monaspace Neon&quot;,ui-monospac=
e,SFMono-Regular,&quot;SF Mono&quot;,Menlo,Consolas,&quot;Liberation Mono&q=
uot;,monospace;font-size:12px;white-space:pre">                    Buffers:=
 shared hit=3D133</td></tr><tr style=3D"box-sizing:border-box;background-co=
lor:rgba(0,0,0,0)"><td id=3D"gmail-file-gistfile1-txt-LC40" class=3D"gmail-=
blob-code gmail-blob-code-inner gmail-js-file-line" style=3D"box-sizing:bor=
der-box;padding:0px 10px;line-height:20px;vertical-align:top;overflow:visib=
le;font-family:&quot;Monaspace Neon&quot;,ui-monospace,SFMono-Regular,&quot=
;SF Mono&quot;,Menlo,Consolas,&quot;Liberation Mono&quot;,monospace;font-si=
ze:12px;white-space:pre"></td></tr></tbody></table><br class=3D"gmail-Apple=
-interchange-newline"></div><div><br></div><div><br></div><div>Regards</div=
><div>Yudhi</div></div></div>

--000000000000fa0cd6064a0f8f06--