MIME-Version: 1.0
References: <CAL93h0GtZ5H+X+8CRDOkcN1w0Bha2w3-57CqKSNFC2xrTNhEhA@mail.gmail.com>
 <cec733623111083dbd4d35ddc3950be9c02968cf.camel@cybertec.at>
In-Reply-To: <cec733623111083dbd4d35ddc3950be9c02968cf.camel@cybertec.at>
From: Alexandru Lazarev <alexandru.lazarev@gmail.com>
Date: Mon, 18 Nov 2024 18:11:15 +0200
Message-ID: <CAL93h0GSqz99bJmdxSZgSsqLLj_=HkpUvfHSxSbEBe1Zep-+ag@mail.gmail.com>
Subject: Re: work_mem RAM Accounting in PostgreSQL
To: Laurenz Albe <laurenz.albe@cybertec.at>
Cc: pgsql-general@lists.postgresql.org
Content-Type: multipart/alternative; boundary="0000000000002d6f800627322b72"
Archived-At: <https://www.postgresql.org/message-id/CAL93h0GSqz99bJmdxSZgSsqLLj_%3DHkpUvfHSxSbEBe1Zep-%2Bag%40mail.gmail.com>
Precedence: bulk

--0000000000002d6f800627322b72
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Hi @Laurenz Albe <laurenz.albe@cybertec.at> & PG Community,
Highly appreciate your response. But I have some additional questions
(inline)

On Thu, Nov 14, 2024 at 4:40=E2=80=AFPM Laurenz Albe <laurenz.albe@cybertec=
.at>
wrote:

> On Wed, 2024-11-13 at 21:09 +0200, Alexandru Lazarev wrote:
> > I have some questions regarding how the "work_mem" parameter affects th=
e
> overall RAM
> > usage of PostgreSQL processes within a physical host or container.
> >
> > Each backend process during SQL execution may allocate N * "work_mem"
> simultaneously.
> > For example, if "work_mem" is set to 32MB and N=3D5 (i.e. 5 simultaneou=
s
> and/or sequential
> > ORDER and hash operations), and the initial RAM usage (RSS - Resident
> Set Size) of the
> > backend is 10MB, I would expect the backend process to use 160MB (32MB =
*
> 5) + 10MB,
> > resulting in a total RAM usage of 170MB.
>
> The limit for a hash is hash_mem_multiplier * work_mem.
>

Yes, I know, I considered it above in "N".


>
> > My questions are as follows:
> >
> > 1. What happens to the allocated "work_mem" after the execution of quer=
y
> nodes? Are
> >    these memory allocations freed?
>
> Yes.
>
> > 2. If they are freed, do they remain in the RSS of the PostgreSQL
> backend?
>
> They may, because the C library can choose not to actually free all the
> memory,
> but retain some to serve future malloc() requests more efficiently.
>

This part is important for me to understand (my apologies, I am not a C
programmer and for me it is difficult reading PG sources :)): I wanted to
understand if in this part there isn't some kind of allocated memory
pooling in postgres. So, since some memory is freed then it is up to C
Library and underlying OS if it will be returned back to OS or will stay
somehow reserved - did I get it correctly? If so, then most probably this
"reserved" memory should be reclaimed under memory pressure conditions
(other backends processes try greedely allocate big chunks of memory)?


>
> > 3. From various sources, I understand that these allocations are freed
> after
> >    each node execution due to memory contexts, but they might remain in
> some sort
> >    of backend memory pool for future reuse. Is this correct?
>
> I am not sure what you mean, but perhaps what I wrote above.
>

I repeated myself, so Yes, You wrote above.


>
> > 4. If so, will this memory be accounted for as used RAM on my
> Linux/Container
> >    system after the backend returns to an idle state (e.g., connection
> pooling)?
>
> Certainly.
>
> > Additionally: If the above is true, and my PostgreSQL host or container
> is limited
> > to 16GB of RAM, what would happen if I have 100 pooled connections, eac=
h
> gradually
> > allocating those 160MB? Will this memory be reclaimed (if I understood
> it correctly
> > as a kind of inactive anon mem), or will the OOM Killer be triggered at
> some point
> > (because it is real allocated memory)?
>
> The backends won't keep that much memory allocated, so you need not worry=
.
>

Let me reformulate the use-case a bit differently - I would highly
appreciate any community inputs:
1) Let say 60 connections did some intensive memory consuming operations,
each one allocating up to 200MB of work_mem, then they finished and
returned to 'idle' state.
2) After that the rest of 40 connections starting doing "work_mem"
consuming operations in parallel, each one allocating up to 300MB, then
will N*"work_mem" allocated RAM from step#1 be reclaimed at point of step#2
when multiple backend need to allocate aggressively K*"wor_mem" memory?

Thank You


>
> 100 connections are a lot.  With efficient pooling, you could have fewer
> connections
> and use your resources more efficiently.
>
> Yours,
> Laurenz Albe
>

--0000000000002d6f800627322b72
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>Hi=C2=A0<a class=3D"gmail_plusreply" id=3D"plusReplyC=
hip-1" href=3D"mailto:laurenz.albe@cybertec.at" tabindex=3D"-1">@Laurenz Al=
be</a>=C2=A0&amp; PG Community,=C2=A0</div><div>Highly appreciate=C2=A0your=
 response. But I have some additional questions (inline)</div><br><div clas=
s=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Thu, Nov 14, 202=
4 at 4:40=E2=80=AFPM Laurenz Albe &lt;<a href=3D"mailto:laurenz.albe@cybert=
ec.at">laurenz.albe@cybertec.at</a>&gt; wrote:<br></div><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rg=
b(204,204,204);padding-left:1ex">On Wed, 2024-11-13 at 21:09 +0200, Alexand=
ru Lazarev wrote:<br>
&gt; I have some questions regarding how the &quot;work_mem&quot; parameter=
 affects the overall RAM<br>
&gt; usage of PostgreSQL processes within a physical host or container.<br>
&gt; <br>
&gt; Each backend process during SQL execution may allocate N * &quot;work_=
mem&quot; simultaneously.<br>
&gt; For example, if &quot;work_mem&quot; is set to 32MB and N=3D5 (i.e. 5 =
simultaneous and/or sequential<br>
&gt; ORDER and hash operations), and the initial RAM usage (RSS - Resident =
Set Size) of the<br>
&gt; backend is 10MB, I would expect the backend process to use 160MB (32MB=
 * 5) + 10MB,<br>
&gt; resulting in a total RAM usage of 170MB.<br>
<br>
The limit for a hash is hash_mem_multiplier * work_mem.<br></blockquote><di=
v><br></div><div>Yes, I know, I considered it above in &quot;N&quot;.</div>=
<div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px =
0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<br>
&gt; My questions are as follows:<br>
&gt; <br>
&gt; 1. What happens to the allocated &quot;work_mem&quot; after the execut=
ion of query nodes? Are<br>
&gt;=C2=A0 =C2=A0 these memory allocations freed?<br>
<br>
Yes.<br>
<br>
&gt; 2. If they are freed, do they remain in the RSS of the PostgreSQL back=
end?<br>
<br>
They may, because the C library can choose not to actually free all the mem=
ory,<br>
but retain some to serve future malloc() requests more efficiently.<br></bl=
ockquote><div><br></div><div>This part is important for me to understand (m=
y apologies, I am not a C programmer and for me it is difficult reading PG =
sources :)): I wanted to understand if in this part there isn&#39;t some ki=
nd of allocated memory pooling in postgres. So, since some memory is freed =
then it is up to C Library and underlying OS if it will be returned back to=
 OS or will stay somehow reserved - did I get it correctly? If so, then mos=
t probably this &quot;reserved&quot; memory=C2=A0should be reclaimed under =
memory pressure conditions (other backends processes try greedely allocate =
big chunks of memory)?</div><div>=C2=A0</div><blockquote class=3D"gmail_quo=
te" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204=
);padding-left:1ex">
<br>
&gt; 3. From various sources, I understand that these allocations are freed=
 after<br>
&gt;=C2=A0 =C2=A0 each node execution due to memory contexts, but they migh=
t remain in some sort<br>
&gt;=C2=A0 =C2=A0 of backend memory pool for future reuse. Is this correct?=
<br>
<br>
I am not sure what you mean, but perhaps what I wrote above.<br></blockquot=
e><div><br></div><div>I repeated myself, so Yes, You wrote above.=C2=A0</di=
v><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0p=
x 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<br>
&gt; 4. If so, will this memory be accounted for as used RAM on my Linux/Co=
ntainer<br>
&gt;=C2=A0 =C2=A0 system after the backend returns to an idle state (e.g., =
connection pooling)?<br>
<br>
Certainly.<br>
<br>
&gt; Additionally: If the above is true, and my PostgreSQL host or containe=
r is limited<br>
&gt; to 16GB of RAM, what would happen if I have 100 pooled connections, ea=
ch gradually<br>
&gt; allocating those 160MB? Will this memory be reclaimed (if I understood=
 it correctly<br>
&gt; as a kind of inactive anon mem), or will the OOM Killer be triggered a=
t some point<br>
&gt; (because it is real allocated memory)?<br>
<br>
The backends won&#39;t keep that much memory allocated, so you need not wor=
ry.<br></blockquote><div><br></div><div>Let me reformulate the use-case a b=
it differently - I would highly appreciate=C2=A0any community inputs:=C2=A0=
</div><div>1) Let say 60 connections did some intensive memory consuming op=
erations, each one allocating up to 200MB of work_mem, then they finished a=
nd returned to &#39;idle&#39; state.=C2=A0</div><div>2) After that the rest=
 of 40 connections starting doing &quot;work_mem&quot; consuming operations=
 in parallel, each one allocating up to 300MB, then will N*&quot;work_mem&q=
uot; allocated RAM from step#1 be reclaimed at point of step#2 when multipl=
e backend need to allocate aggressively=C2=A0K*&quot;wor_mem&quot; memory?<=
/div><div><br></div><div>Thank=C2=A0You</div><div>=C2=A0</div><blockquote c=
lass=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px soli=
d rgb(204,204,204);padding-left:1ex">
<br>
100 connections are a lot.=C2=A0 With efficient pooling, you could have few=
er connections<br>
and use your resources more efficiently.<br>
<br>
Yours,<br>
Laurenz Albe<br>
</blockquote></div></div>

--0000000000002d6f800627322b72--