MIME-Version: 1.0
References: <CAG-eXHJ+KbQ8_k-jKSGZU9V6HkLKU2Jqz7nYMYGhHuC-Zqm7qQ@mail.gmail.com>
 <CAGsyd8WqPEgoAkNO0Q7rpQpOWOZ-Z6wCM7xh5d6nXCxLH_GM_A@mail.gmail.com>
 <CAFL4M8EmboE4wXBHe2EMFcShxUAxXgWFa4TT-iVD2hJcHumetg@mail.gmail.com>
 <CAGsyd8X7U07UK8hjapwYBfbtK0KnMSxLtH6BFaxe1_i2=BR-+A@mail.gmail.com>
 <CAFL4M8FuS1ivNARaNUjoSgdjec+KH0DLXSqVa11uy1Nscup2+w@mail.gmail.com> <CAGsyd8VAdS+aByHkT=mi5sAgVumQgG4kBiY3kRw49NWWTtYGWA@mail.gmail.com>
In-Reply-To: <CAGsyd8VAdS+aByHkT=mi5sAgVumQgG4kBiY3kRw49NWWTtYGWA@mail.gmail.com>
From: ravi k <ravisql09@gmail.com>
Date: Sat, 9 Nov 2024 17:32:54 +0530
Message-ID: <CAFL4M8HvFUz56ezX-C2QGUONEafwObe3LWeu+MWUby=i8Teh4A@mail.gmail.com>
Subject: Re: Performance Issue with Hash Partition Query Execution in
 PostgreSQL 16
To: David Mullineux <dmullx@gmail.com>
Cc: Ramakrishna m <ram.pgdb@gmail.com>, pgsql-general <pgsql-general@lists.postgresql.org>
Content-Type: multipart/alternative; boundary="0000000000007eb336062679a677"
Archived-At: <https://www.postgresql.org/message-id/CAFL4M8HvFUz56ezX-C2QGUONEafwObe3LWeu%2BMWUby%3Di8Teh4A%40mail.gmail.com>
Precedence: bulk

--0000000000007eb336062679a677
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Thanks for the advice!

I am planing to set session level!

but before that one more observations noticed i.e One more table has same
issue, which is having similar like hash partitions.

And I scheduled manual analyze for all parent hash tables(thus all stats
will update together).

After this change I didn't noticed the issue, not sure does this addressed
issue or not, just monitoring if this not works will set custom plan in
session level.

I have seen in SQL server  parameter sniffing regularly but in postgres I
never  experienced. I am still wondering does this sniffing or not as from
stats I didn't notice any sequence scan.

Best,


On Sat, 9 Nov, 2024, 3:40=E2=80=AFpm David Mullineux, <dmullx@gmail.com> wr=
ote:

> Thanks for correction. At this point I would be trying to modify
> plan_cache_mode
> for the session which uses the bond variable. alter it so that
> plan_cache_mode=3Dforce_custom_plan
> One hypothesis is that, a bad plan got cached for that SQL pattern.
> Obviously, when you run it *manually* you are always getting a *custom*
> plan as it's not a prepared statement.
>
>
>
>
> On Sat, 9 Nov 2024, 03:46 ravi k, <ravisql09@gmail.com> wrote:
>
>> Sorry, it was typo. Bind variable is bigint only.
>>
>> Thanks
>>
>> On Fri, 8 Nov, 2024, 7:09=E2=80=AFpm David Mullineux, <dmullx@gmail.com>=
 wrote:
>>
>>> Just spotted a potential problem. The indexed column is a bigint. Are
>>> you, in your prepared statement passing a string or a big int ?
>>> I notice your plan is doing an implicit type conversion when you run it
>>> manually.
>>> Sometimes the wrong type will make it not use the index.
>>>
>>> On Fri, 8 Nov 2024, 03:07 ravi k, <ravisql09@gmail.com> wrote:
>>>
>>>> Hi ,
>>>>
>>>> Thanks for the suggestions.
>>>>
>>>> Two more observations:
>>>>
>>>> 1) no sequence scan noticed from pg_stat_user_tables ( hope stats are
>>>> accurate in postgres 16) if parameter sniffing happens the possibility=
 of
>>>> going to  sequence scan is more right.
>>>>
>>>> 2) no blockings or IO issue during the time.
>>>>
>>>> 3) even with limit clause if touch all partitions also it could have
>>>> been completed in milliseconds as this is just one record.
>>>>
>>>> 4) auto_explain in prod we cannot enable as this is expensive and with
>>>> high TPS we may face latency issues and lower environment this issue c=
annot
>>>> be reproduced,( this is happening out of Million one case)
>>>>
>>>> This looks puzzle to us, just in case anyone experianced pls share you=
r
>>>> experience.
>>>>
>>>> Regards,
>>>> Ravi
>>>>
>>>> On Thu, 7 Nov, 2024, 3:41=E2=80=AFam David Mullineux, <dmullx@gmail.co=
m> wrote:
>>>>
>>>>> It might be worth eliminating the use of cached plans here. Is your
>>>>> app using prepared statements at all?
>>>>> Point is that if the optimizer sees the same prepared query , 5 times=
,
>>>>> the  it locks the plan that it found at that time. This is a good tra=
de off
>>>>> as it avoids costly planning-time for repetitive queries. But if you =
are
>>>>> manually querying, the  a custom plan will be generated  anew.
>>>>> A quick analyze of the table should reset the stats and invalidate an=
y
>>>>> cached plans.
>>>>> This may not be your problem  just worth eliminating it from the list
>>>>> of potential causes.
>>>>>
>>>>> On Wed, 6 Nov 2024, 17:14 Ramakrishna m, <ram.pgdb@gmail.com> wrote:
>>>>>
>>>>>> Hi Team,
>>>>>>
>>>>>> One of the queries, which retrieves a single record from a table wit=
h
>>>>>> 16 hash partitions, is taking more than 10 seconds to execute. In co=
ntrast,
>>>>>> when we run the same query manually, it completes within millisecond=
s. This
>>>>>> issue is causing exhaustion of the application pools. Do we have any=
 bugs
>>>>>> in postgrs16 hash partitions? Please find the attached log, table, a=
nd
>>>>>> execution plan.
>>>>>>
>>>>>> size of the each partitions : 300GB
>>>>>> Index Size : 12GB
>>>>>>
>>>>>> Postgres Version : 16.x
>>>>>> Shared Buffers : 75 GB
>>>>>> Effective_cache :  175 GB
>>>>>> Work _mem : 4MB
>>>>>> Max_connections : 3000
>>>>>>
>>>>>> OS  : Ubuntu 22.04
>>>>>> Ram : 384 GB
>>>>>> CPU : 64
>>>>>>
>>>>>> Please let us know if you need any further information or if there
>>>>>> are additional details required.
>>>>>>
>>>>>>
>>>>>> Regards,
>>>>>> Ram.
>>>>>>
>>>>>

--0000000000007eb336062679a677
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"auto">Thanks for the advice!<div dir=3D"auto"><br></div><div di=
r=3D"auto">I am planing to set session level!</div><div dir=3D"auto"><br></=
div><div dir=3D"auto">but before that one more observations noticed i.e One=
 more table has same issue, which is having similar like hash partitions.</=
div><div dir=3D"auto"><br></div><div dir=3D"auto">And I scheduled manual an=
alyze for all parent hash tables(thus all stats will update together).</div=
><div dir=3D"auto"><br></div><div dir=3D"auto">After this change I didn&#39=
;t noticed the issue, not sure does this addressed issue or not, just monit=
oring if this not works will set custom plan in session level.</div><div di=
r=3D"auto"><br></div><div dir=3D"auto">I have seen in SQL server=C2=A0 para=
meter sniffing regularly but in postgres I never=C2=A0 experienced. I am st=
ill wondering does this sniffing or not as from stats I didn&#39;t notice a=
ny sequence scan.</div><div dir=3D"auto"><br></div><div dir=3D"auto">Best,<=
/div><div dir=3D"auto"><br></div><div dir=3D"auto"><br></div><div dir=3D"au=
to"><br></div><div dir=3D"auto"><br></div><div dir=3D"auto"><br></div><div =
dir=3D"auto"><br></div></div><br><div class=3D"gmail_quote"><div dir=3D"ltr=
" class=3D"gmail_attr">On Sat, 9 Nov, 2024, 3:40=E2=80=AFpm David Mullineux=
, &lt;<a href=3D"mailto:dmullx@gmail.com">dmullx@gmail.com</a>&gt; wrote:<b=
r></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border=
-left:1px #ccc solid;padding-left:1ex"><div dir=3D"auto">Thanks for correct=
ion. At this point I would be trying to modify=C2=A0<div dir=3D"auto">plan_=
cache_mode</div><div dir=3D"auto">for the session which uses the bond varia=
ble. alter it so that plan_cache_mode=3D<span style=3D"background-color:rgb=
(248,249,250);font-size:14.4px">force_custom_plan</span><br><span style=3D"=
background-color:rgb(248,249,250);font-size:14.4px">One hypothesis is that,=
 a bad plan got cached for that SQL pattern. Obviously, when you run it <i>=
manually</i> you are always getting a <i>custom</i> plan as it&#39;s not a =
prepared statement.=C2=A0=C2=A0</span></div><div dir=3D"auto"><br></div><di=
v dir=3D"auto"><br></div><div dir=3D"auto"><span style=3D"background-color:=
rgb(248,249,250);font-family:monospace,monospace;font-size:14.4px"><br></sp=
an></div></div><br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gma=
il_attr">On Sat, 9 Nov 2024, 03:46 ravi k, &lt;<a href=3D"mailto:ravisql09@=
gmail.com" target=3D"_blank" rel=3D"noreferrer">ravisql09@gmail.com</a>&gt;=
 wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8=
ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"auto">Sorry, it=
 was typo. Bind variable is bigint only.<div dir=3D"auto"><br></div><div di=
r=3D"auto">Thanks=C2=A0</div></div><br><div class=3D"gmail_quote"><div dir=
=3D"ltr" class=3D"gmail_attr">On Fri, 8 Nov, 2024, 7:09=E2=80=AFpm David Mu=
llineux, &lt;<a href=3D"mailto:dmullx@gmail.com" rel=3D"noreferrer noreferr=
er" target=3D"_blank">dmullx@gmail.com</a>&gt; wrote:<br></div><blockquote =
class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid=
;padding-left:1ex"><div dir=3D"auto">Just spotted a potential problem. The =
indexed column is a bigint. Are you, in your prepared statement passing a s=
tring or a big int ?<div dir=3D"auto">I notice your plan is doing an implic=
it type conversion when you run it manually.</div><div dir=3D"auto">Sometim=
es the wrong type will make it not use the index.</div></div><br><div class=
=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Fri, 8 Nov 2024, =
03:07 ravi k, &lt;<a href=3D"mailto:ravisql09@gmail.com" rel=3D"noreferrer =
noreferrer noreferrer" target=3D"_blank">ravisql09@gmail.com</a>&gt; wrote:=
<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex"><div dir=3D"auto">Hi ,<div dir=3D"=
auto"><br></div><div dir=3D"auto">Thanks for the suggestions.</div><div dir=
=3D"auto"><br></div><div dir=3D"auto">Two more observations:</div><div dir=
=3D"auto"><br></div><div dir=3D"auto">1) no sequence scan noticed from pg_s=
tat_user_tables ( hope stats are accurate in postgres 16) if parameter snif=
fing happens the possibility of going to=C2=A0 sequence scan is more right.=
</div><div dir=3D"auto"><br></div><div dir=3D"auto">2) no blockings or IO i=
ssue during the time.</div><div dir=3D"auto"><br></div><div dir=3D"auto">3)=
 even with limit clause if touch all partitions also it could have been com=
pleted in milliseconds as this is just one record.</div><div dir=3D"auto"><=
br></div><div dir=3D"auto">4) auto_explain in prod we cannot enable as this=
 is expensive and with high TPS we may face latency issues and lower enviro=
nment this issue cannot be reproduced,( this is happening out of Million on=
e case)</div><div dir=3D"auto"><br></div><div dir=3D"auto">This looks puzzl=
e to us, just in case anyone experianced pls share your experience.</div><d=
iv dir=3D"auto"><br></div><div dir=3D"auto">Regards,</div><div dir=3D"auto"=
>Ravi</div></div><br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"g=
mail_attr">On Thu, 7 Nov, 2024, 3:41=E2=80=AFam David Mullineux, &lt;<a hre=
f=3D"mailto:dmullx@gmail.com" rel=3D"noreferrer noreferrer noreferrer noref=
errer" target=3D"_blank">dmullx@gmail.com</a>&gt; wrote:<br></div><blockquo=
te class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc so=
lid;padding-left:1ex"><div dir=3D"auto">It might be worth eliminating the u=
se of cached plans here. Is your app using prepared statements at all?=C2=
=A0=C2=A0<div dir=3D"auto">Point is that if the optimizer sees the same pre=
pared query , 5 times, the=C2=A0 it locks the plan that it found at that ti=
me. This is a good trade off as it avoids costly planning-time for repetiti=
ve queries. But if you are manually querying, the=C2=A0 a custom plan will =
be generated=C2=A0 anew.</div><div dir=3D"auto">A quick analyze of the tabl=
e should reset the stats and invalidate any cached plans.</div><div dir=3D"=
auto">This may not be your problem=C2=A0 just worth eliminating it from the=
 list of potential causes.</div></div><br><div class=3D"gmail_quote"><div d=
ir=3D"ltr" class=3D"gmail_attr">On Wed, 6 Nov 2024, 17:14 Ramakrishna m, &l=
t;<a href=3D"mailto:ram.pgdb@gmail.com" rel=3D"noreferrer noreferrer norefe=
rrer noreferrer noreferrer" target=3D"_blank">ram.pgdb@gmail.com</a>&gt; wr=
ote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;=
border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>Hi Team,=
</div><div><div><div><div><div><div></div></div></div></div></div><div><div=
><div><div dir=3D"auto"><div><div><p>One of the queries, which retrieves a =
single record from a table with 16 hash partitions, is taking more than 10 =
seconds to execute. In contrast, when we run the same query manually, it co=
mpletes within milliseconds. This issue is causing exhaustion of the applic=
ation pools.=C2=A0Do we have any bugs in postgrs16 hash partitions? Please =
find the attached log, table, and execution plan.=C2=A0</p><p><font face=3D=
"arial, sans-serif">size of the each partitions : 300GB=C2=A0<br>Index Size=
 : 12GB</font></p><p><span style=3D"font-family:arial,sans-serif">Postgres =
Version : 16.x</span><font face=3D"arial, sans-serif"><br></font><span styl=
e=3D"font-family:arial,sans-serif">Shared Buffers : 75 GB</span><font face=
=3D"arial, sans-serif"><br></font><span style=3D"font-family:arial,sans-ser=
if">Effective_cache :=C2=A0 175 GB</span><font face=3D"arial, sans-serif"><=
br></font><span style=3D"font-family:arial,sans-serif">Work _mem : 4MB</spa=
n><font face=3D"arial, sans-serif"><br></font><span style=3D"font-family:ar=
ial,sans-serif">Max_connections : 3000</span><font face=3D"arial, sans-seri=
f"></font></p><p><span style=3D"font-family:arial,sans-serif">OS=C2=A0 :=C2=
=A0Ubuntu 22.04</span><br style=3D"font-family:arial,sans-serif"><span styl=
e=3D"font-family:arial,sans-serif">Ram : 384 GB</span><br style=3D"font-fam=
ily:arial,sans-serif"><span style=3D"font-family:arial,sans-serif">CPU : 64=
</span><font face=3D"arial, sans-serif"></font></p><p>Please let us know if=
 you need any further information or if there are additional details requir=
ed.=C2=A0=C2=A0</p><p><br></p></div></div></div></div></div></div></div><di=
v>Regards,</div><div dir=3D"ltr" class=3D"gmail_signature" data-smartmail=
=3D"gmail_signature"><div dir=3D"ltr"><div>Ram.<br></div></div></div></div>
</blockquote></div>
</blockquote></div>
</blockquote></div>
</blockquote></div>
</blockquote></div>
</blockquote></div>

--0000000000007eb336062679a677--