MIME-Version: 1.0
References: 
 <CAOC+FBXR59QvEvpnOj0d0Om352i=2gP_Mm+b=aeQ4Zo3w-gNdA@mail.gmail.com>
 <CANzqJaDi90pp=cq5kuTnewstkV5E30ToYu2eW-_yTrN0UJkJuw@mail.gmail.com>
 <CAOC+FBWzTatoqRna_tyiEkqcXnu1AvMUx=1hm0qR7=xi+uPJ7w@mail.gmail.com>
In-Reply-To: 
 <CAOC+FBWzTatoqRna_tyiEkqcXnu1AvMUx=1hm0qR7=xi+uPJ7w@mail.gmail.com>
From: Ron Johnson <ronljohnsonjr@gmail.com>
Date: Thu, 12 Sep 2024 22:41:37 -0400
Message-ID: 
 <CANzqJaCAHbn4vvPq-uWA5PJW6Ti3iKfeePjuFR0ve0_f-ZLWhw@mail.gmail.com>
Subject: Re: Query plan getting less efficient over time with frequent updates
 and deletes..
To: pgsql-admin <pgsql-admin@postgresql.org>
Content-Type: multipart/alternative; boundary="0000000000002f7e810621f72a2e"
Archived-At: 
 <https://www.postgresql.org/message-id/CANzqJaCAHbn4vvPq-uWA5PJW6Ti3iKfeePjuFR0ve0_f-ZLWhw%40mail.gmail.com>
Precedence: bulk

--0000000000002f7e810621f72a2e
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

On Thu, Sep 12, 2024 at 7:56=E2=80=AFPM Wells Oliver <wells.oliver@gmail.co=
m> wrote:

> Yes, I regularly look at pg_stat_user_tables and in particular
> last_autovacuum and last_autoanalyze and these are always the current dat=
e
> (or within two days) after our nightly processes soon finish.
>

"Or within two days".  I used to think that was adequate, but now I vacuum
and analyze some tables multiple times a day.

1.5% autovacuum_X_scale_factor and 200 autovacuum_X_threshold is required
on some tables.

Because there's sooo many indices on that table, you might have to manually
vacuum it with a pretty high PARALLEL value.


> I wondered if the similar low planning time but the dissimilar longer
> execution time might indicate rows are spread out over disk, thereby
> negating a bitmap heap scan and the slower query taking longer due to
> having to read a lot more disk? Is that a possibility?
>

It was 30 years ago.  Modern (like ext2 and newer) filesystems purposefully
spread files across devices.


> On Thu, Sep 12, 2024 at 4:47=E2=80=AFPM Ron Johnson <ronljohnsonjr@gmail.=
com>
> wrote:
>
>> On Thu, Sep 12, 2024 at 6:52=E2=80=AFPM Wells Oliver <wells.oliver@gmail=
.com>
>> wrote:
>>
>>> Hi all: we have a table which receives frequent daily updates and
>>> deletes on the order of 100-600k. The overall row length is approximate=
ly
>>> 80m. This table has 50 indexes and 303 columns and is quite frequently
>>> queried by humans and applications.
>>>
>>> I've been in the habit of using pg_repack maybe once a month on this
>>> table because I can't quite figure out why querying gets bogged down. T=
he
>>> vacuum and analyze thresholds are set such that the table is both auto
>>> vacuumed and analyzed every night.
>>>
>>
>> 1. You're absolutely positive that the VACUUM and ANALYZE complete every
>> night?
>> 2. Nightly may not be often enough.
>>
>>
>
> --
> Wells Oliver
> wells.oliver@gmail.com <wellsoliver@gmail.com>
>


--=20
Death to America, and butter sauce.
Iraq lobster!

--0000000000002f7e810621f72a2e
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr">On Thu, Sep 12, 2024 at 7:56=E2=80=AFPM W=
ells Oliver &lt;<a href=3D"mailto:wells.oliver@gmail.com">wells.oliver@gmai=
l.com</a>&gt; wrote:<br></div><div class=3D"gmail_quote"><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rg=
b(204,204,204);padding-left:1ex"><div dir=3D"ltr"><div style=3D"font-size:s=
mall">Yes, I regularly look at pg_stat_user_tables and in particular last_a=
utovacuum and last_autoanalyze and these are always=C2=A0the current date (=
or within two days) after our nightly processes soon finish.</div></div></b=
lockquote><div><br></div><div>&quot;Or within two days&quot;.=C2=A0 I used =
to think that was adequate, but now I vacuum and analyze some tables multip=
le=C2=A0times a day.</div><div><br></div><div>1.5%=C2=A0autovacuum_X_scale_=
factor and 200 autovacuum_X_threshold is required on some tables.</div><div=
><br></div><div>Because there&#39;s sooo many indices on that table, you mi=
ght have to manually vacuum it with a pretty high PARALLEL value.</div><div=
>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px =
0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir=3D"=
ltr"><div style=3D"font-size:small">I wondered if the similar low planning =
time but the dissimilar longer execution time might indicate rows are sprea=
d out over disk, thereby negating a bitmap heap scan and the slower query t=
aking longer due to having to read a lot more disk? Is that a possibility?<=
br></div></div></blockquote><div><br></div><div>It was 30 years ago.=C2=A0 =
Modern (like ext2 and newer) filesystems purposefully spread files across d=
evices.</div><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"ma=
rgin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:=
1ex"><div dir=3D"ltr"><div style=3D"font-size:small"></div><div style=3D"fo=
nt-size:small">On Thu, Sep 12, 2024 at 4:47=E2=80=AFPM Ron Johnson &lt;<a h=
ref=3D"mailto:ronljohnsonjr@gmail.com" target=3D"_blank">ronljohnsonjr@gmai=
l.com</a>&gt; wrote:<br></div></div><div class=3D"gmail_quote"><blockquote =
class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px sol=
id rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr"><div dir=3D"ltr">On =
Thu, Sep 12, 2024 at 6:52=E2=80=AFPM Wells Oliver &lt;<a href=3D"mailto:wel=
ls.oliver@gmail.com" target=3D"_blank">wells.oliver@gmail.com</a>&gt; wrote=
:<br></div><div class=3D"gmail_quote"><blockquote class=3D"gmail_quote" sty=
le=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);paddi=
ng-left:1ex"><div dir=3D"ltr"><div style=3D"font-size:small">Hi all: we hav=
e a table which receives frequent daily updates and deletes on the order of=
 100-600k. The overall row length is approximately 80m. This table has 50 i=
ndexes and 303 columns and is quite frequently queried by humans and applic=
ations.</div><div style=3D"font-size:small"><br></div><div style=3D"font-si=
ze:small">I&#39;ve been in the habit of using pg_repack maybe once a month =
on this table because I can&#39;t quite figure out why querying gets bogged=
 down. The vacuum and analyze thresholds are set such that the table is bot=
h auto vacuumed and analyzed every night.</div></div></blockquote><div>=C2=
=A0</div><div>1. You&#39;re absolutely positive that the VACUUM and ANALYZE=
 complete every night?</div><div>2. Nightly may not be often enough.</div><=
/div><br></div>
</blockquote></div><br clear=3D"all"><div><br></div><span class=3D"gmail_si=
gnature_prefix">-- </span><br><div dir=3D"ltr" class=3D"gmail_signature"><d=
iv dir=3D"ltr"><div>Wells Oliver<br><a href=3D"mailto:wellsoliver@gmail.com=
" target=3D"_blank">wells.oliver@gmail.com</a></div></div></div>
</blockquote></div><br clear=3D"all"><div><br></div><span class=3D"gmail_si=
gnature_prefix">-- </span><br><div dir=3D"ltr" class=3D"gmail_signature"><d=
iv dir=3D"ltr">Death to America, and butter sauce.<div>Iraq lobster!</div><=
/div></div></div>

--0000000000002f7e810621f72a2e--