MIME-Version: 1.0
References: <CAEzWdqfUQuKtpqGAwf86dwkjPq2Kkeyj6Pw31GXr92YC8M2Y5g@mail.gmail.com>
 <CAB+=1TUKXYy9yXm+GFQ4qV=fupvAyVsTY1G3deMpz6zBk7xYxA@mail.gmail.com>
In-Reply-To: <CAB+=1TUKXYy9yXm+GFQ4qV=fupvAyVsTY1G3deMpz6zBk7xYxA@mail.gmail.com>
From: yudhi s <learnerdatabase99@gmail.com>
Date: Fri, 27 Sep 2024 12:06:58 +0530
Message-ID: <CAEzWdqfHyAa7=WNyJLiRCXGzJvEjkFP1MHG2ontwCcbx7TdUBQ@mail.gmail.com>
Subject: Re: Suggestion for memory parameters
To: veem v <veema0000@gmail.com>
Cc: pgsql-general <pgsql-general@lists.postgresql.org>
Content-Type: multipart/alternative; boundary="000000000000b09d9f0623141560"
Archived-At: <https://www.postgresql.org/message-id/CAEzWdqfHyAa7%3DWNyJLiRCXGzJvEjkFP1MHG2ontwCcbx7TdUBQ%40mail.gmail.com>
Precedence: bulk

--000000000000b09d9f0623141560
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

On Fri, Sep 27, 2024 at 9:11=E2=80=AFAM veem v <veema0000@gmail.com> wrote:

>
> On Thu, 26 Sept 2024 at 16:33, yudhi s <learnerdatabase99@gmail.com>
> wrote:
>
>> Hello All,
>>
>> In a RDS postgres we are seeing some select queries when running and
>> doing sorting on 50 million rows(as its having order by clause in it) , =
the
>> significant portion of wait event is showing as "IO:BufFileWrite" and it
>> runs for ~20minutes+.
>>
>> Going through the document in the link below, it states we should monito=
r
>> the "FreeLocalStorage" metric and when monitoring that, I see it showing=
 up
>> to ~535GB as the max limit and when these queries run this goes down til=
l
>> 100GB. Note-  (it's a R7g8xl instance)
>>
>>
>> https://docs.aws.amazon.com/AmazonRDS/latest/AuroraUserGuide/apg-waits.i=
obuffile.html
>>
>> We were thinking of bumping up the work_mem to a higher value in databas=
e
>> level , which is currently having size 4MB default. But we will also hav=
e
>> ~100 sessions running at same time and majority were from other
>> applications which execute other single row "insert" queries and I hope
>> that will not need high "work_mem" . And setting it at database level wi=
ll
>> consume 100 times that set work_mem value. So how to handle this situati=
on?
>>  Or
>>  Is it fine to let it use "FreeLocalStorage" unless it goes till zero?
>>
>> Also I am confused between the local storage (which is showing as 535GB)
>> vs the memory/RAM which is 256GB for this instance class with ~128TB max
>> storage space restriction, how are these storage different, (mainly the
>> 535GB space which it's showing vs the 128TB storage space restriction)?
>> Appreciate your guidance.
>>
>> select query looks something as below with no Joins but just single tabl=
e
>> fetch:-
>>
>> Select....
>> from <table_name>
>> where
>> order by column1, column2 LIMIT $b1 OFFSET $B2 ;
>>
>>
>>
> My 2 cents
> I think you should set the work_mem on specific session level , if your
> sorting queries are only from specific handful of sessions, as because
> setting it up at database level will eat up your most of RAM(which you sa=
id
> is 256GB) and you said 100+ sessions getting spawned at any point in time=
.
>


Thank you.
When I checked pg_stat_statements for this query , and divided the
temp_blk_read+temp_blk_written with the "calls", it came as ~1million which
means ~7GB. So does that mean ~7GB of work_mem should be allocated for this
query?

--000000000000b09d9f0623141560
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr"><br></div><br><div class=3D"gmail_quote">=
<div dir=3D"ltr" class=3D"gmail_attr">On Fri, Sep 27, 2024 at 9:11=E2=80=AF=
AM veem v &lt;<a href=3D"mailto:veema0000@gmail.com">veema0000@gmail.com</a=
>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px=
 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><di=
v dir=3D"ltr"><div dir=3D"ltr"><br></div><div class=3D"gmail_quote"><div di=
r=3D"ltr" class=3D"gmail_attr">On Thu, 26 Sept 2024 at 16:33, yudhi s &lt;<=
a href=3D"mailto:learnerdatabase99@gmail.com" target=3D"_blank">learnerdata=
base99@gmail.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" =
style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);pa=
dding-left:1ex"><div dir=3D"ltr">Hello All,<br><br>In a RDS postgres we are=
 seeing some select queries when running and doing sorting on 50 million ro=
ws(as its having order by clause in it) , the significant portion of wait e=
vent is showing as &quot;IO:BufFileWrite&quot; and it runs for ~20minutes+.=
 =C2=A0<br><br>Going through the document in the link below, it states we s=
hould monitor the &quot;FreeLocalStorage&quot; metric and when monitoring t=
hat, I see it showing up to ~535GB as the max limit and when these queries =
run this goes down till 100GB. Note-=C2=A0

(it&#39;s a R7g8xl instance)<br><br><a href=3D"https://docs.aws.amazon.com/=
AmazonRDS/latest/AuroraUserGuide/apg-waits.iobuffile.html" target=3D"_blank=
">https://docs.aws.amazon.com/AmazonRDS/latest/AuroraUserGuide/apg-waits.io=
buffile.html</a><br><br>We were thinking of bumping up the work_mem to a hi=
gher value in database level , which is currently having size 4MB default. =
But we will also have ~100 sessions running at same time and majority were =
from other applications which execute other single row &quot;insert&quot; q=
ueries and I hope that will not need high &quot;work_mem&quot; . And settin=
g it at database level will consume 100 times that set work_mem value. So h=
ow to handle this situation?<br>=C2=A0Or<br>=C2=A0Is it fine to let it use =
&quot;FreeLocalStorage&quot; unless it goes till zero?<br><br>Also I am con=
fused between the local storage (which is showing as 535GB) vs the memory/R=
AM which is 256GB for this instance class with ~128TB max storage space res=
triction, how are these storage different, (mainly the 535GB space which it=
&#39;s showing vs the 128TB storage space restriction)?=C2=A0 Appreciate=C2=
=A0your guidance.<br><br>select query looks something as below with no Join=
s but just single table fetch:-<div><br>Select....<br>from &lt;table_name&g=
t;<br>where <br>order by column1, column2 LIMIT $b1 OFFSET $B2 ;<br><div><b=
r></div><div><br></div></div></div></blockquote><div><br></div><div>My 2 ce=
nts=C2=A0</div><div>I think you should=C2=A0set the work_mem on specific se=
ssion level , if your sorting queries are only from specific handful of ses=
sions, as because setting=C2=A0it up at database level will eat up your mos=
t of RAM(which you said is 256GB) and you said 100+ sessions getting spawne=
d at any point in time.</div></div></div></blockquote><div><br></div><div><=
br></div><div>Thank you.</div><div>When I checked pg_stat_statements for th=
is query , and divided the temp_blk_read+temp_blk_written with the &quot;ca=
lls&quot;, it came as ~1million which means ~7GB. So does that mean ~7GB of=
 work_mem should be allocated for this query?<br></div></div></div>

--000000000000b09d9f0623141560--