MIME-Version: 1.0
References: 
 <CACG=ezZOrNsuLoETLD1gAswZMuH2nGGq7Ogcc0QOE5hhWaw=cw@mail.gmail.com>
 <CAD21AoCdx5ZNS_cO7bYz1Zfb+Kw1kuJV2wtewrz7T1pPpjcWGw@mail.gmail.com>
 <CAJDiXgi6ZQOoSEqj9RyZMEh+HHBtmW0+PHD85UNPtKch8ubvdg@mail.gmail.com>
 <CAD21AoBcoA-i-pJ_=y+jg14R8_QaJA1iwktCnu5i-C=yXDFPdA@mail.gmail.com>
 <CAJDiXgjnUdE6Sk4M0unmT+9dULyFAxcum2txQKpWTuo4uQ_oXQ@mail.gmail.com>
 <CAD21AoBTZdVR93JBo620B=MX-K8cdm3VRbjrBr_Vcpngk3AjVw@mail.gmail.com>
 <CAA5RZ0vfBg=c_0Sa1Tpxv8tueeBk8C5qTf9TrxKBbXUqPc99Ag@mail.gmail.com>
 <CAD21AoBgvUeWS8ZsXBahA1XdYayK6DJ6dx49d6Xpii-iH+Hrwg@mail.gmail.com>
 <CAA5RZ0vF+Lr-jU1LAZWTGUjboUETk8oLvaNBbA5ozX6dau+how@mail.gmail.com>
 <CAJDiXggueLSGMNRmLshbmFRfbo4jzks0W8bLDfUSRZ-61fPVEQ@mail.gmail.com>
 <CAFY6G8cJ=DRTX75pOGerH6sk39dRt+7MSH+y_qppDdhPs=qdQA@mail.gmail.com>
 <CAJDiXgg1t6wk9NjyMUTm1iKqM9GtdQ_wrEchBtz3xjWBZM8W8A@mail.gmail.com>
 <CAD21AoAC0=Xi38RQcAO4A+vdmoXToZMoHfbS=KLT49fAOTH_gA@mail.gmail.com>
 <CAJDiXgiD+AZKhJSn-FSRVQxtDLmJd95wDu4wtKniQF5==1JcjQ@mail.gmail.com>
 <CAD21AoAM8KsqNhrZYJuf7odvxcTC0TumXazJc-r_wC5KnDFDPg@mail.gmail.com>
 <CAJDiXghbcOC9OOj3ampxuyqXH0geggnosnrYUHGygkpss-RtxA@mail.gmail.com>
 <CAD21AoAPnq0vrcGgeN++r1GoL8Kza7jaGL=TNzuBn6+MkR=rUQ@mail.gmail.com>
 <CAJDiXghmsbTmnm--9B5bbuZXa1OL7SZ0HYppX3tx9XsdwfJBhA@mail.gmail.com>
 <DB3C67FCRLOO.1R5NLYCNEA6BF@gmail.com>
 <CAJDiXgiYiX+azuR76DcVx8fZn57m_4v6cB14-GW34mWa=qudFQ@mail.gmail.com>
 <CAD21AoDtPpkkQ_h1yf4oTx1qn4SRdTeVY3qs+9J07fYqa_4Gww@mail.gmail.com>
 <CAJDiXgi7KB7wSQ=Ux=ngdaCvJnJ5x-ehvTyiuZez+5uKHtV6iQ@mail.gmail.com>
 <CAD21AoCcHKKXsr9Oh736ejckqqS1i430xGEyJ=JP5OL0ExyP1A@mail.gmail.com>
 <CAJDiXghaFT_1sSv3q8mjyZ_RLZDgiogg0mWRvLxSWvkUi2CcLg@mail.gmail.com>
 <CAA5RZ0u63W41OmcEO+HLs4CSo-Sd3J+Q-4=04iud8V=xX4iUrA@mail.gmail.com>
 <CAJDiXgin1TXniVGJKzOTA=F9K342uVfm6O0EmubTVB=F+XSrbA@mail.gmail.com>
 <CAD21AoDadzAwibxf-+urjx=XL+eVu8=Ut-Lh2GxXUt32LbPG3Q@mail.gmail.com>
 <CAD21AoD6HhraqhOgkQJOrr0ixZkAZuqJRpzGv-B+_-ad6d5aPw@mail.gmail.com>
 <CAJDiXgiGSpqMQSOx-cVO_LtcB5GWHBy9ph7oOR4ebbX8A==kgw@mail.gmail.com>
 <CAD21AoBRRXbNJEvCjS-0XZgCEeRBzQPKmrSDjJ3wZ8TN28vaCQ@mail.gmail.com>
 <CAPpHfduBJfMcojvmYHUo8b_C=0cxRy1N+tNiNGoA3RAZq2ApaA@mail.gmail.com>
 <CAD21AoC82NeHKXc965pPUZO2eyo1U7P6cmfRJbrcPDcnd7_6hw@mail.gmail.com>
 <CAJDiXghP2kXnEz+cj3rAWNM3NdKSB_4WtnngFXpVz2omPhGr5A@mail.gmail.com>
 <CAD21AoA0bnRZC_OqKMnH-Ln+OZ9z9k56j2c_MXj8pw69O-wkBw@mail.gmail.com>
 <CAA5RZ0sSXDza7_nUUbhHL_Sws+M+HR1daKJPXHpdLuNCkwUgUg@mail.gmail.com>
 <CAJDiXggrBsbzOisf+Nu8pZkYGrpUZaFbosL1Wbm3kKxzTm4xgw@mail.gmail.com>
 <CAA5RZ0tbiPcgQEjnhdnjz6qSjfRsGrr8jGCaMcrMaoPpax3wig@mail.gmail.com>
 <CAJDiXgjt5ZmK2uvS0E8Ztt5ePYmq8Ze_dG05Zo2NUsKLHCEuYA@mail.gmail.com>
 <CAD21AoB7v5tLPXLK=qmtt6PaEC1f+Fb-gh+MwAbXfm6x4eZGNw@mail.gmail.com>
 <CAJDiXghwtUbiFnAh3nSaxTk8KFupQuMbp+g4z3wOLoQfMuqgDg@mail.gmail.com>
 <CAJDiXgjoNd4BF19HNY_FAcDUqiqsfw8cGhNOJwBxahB8P38E3Q@mail.gmail.com>
 <CAD21AoBT1LWqPZkcHpVMVh0ZOXUneO=p61t0i8cQ+kOP9qfODQ@mail.gmail.com>
 <CAJDiXggL=J0nV7PfBsMW9+UOU3KUp1jNBM9Gov1JvAX7aG_U1g@mail.gmail.com>
 <CAD21AoDz-1Zf9DOJJrdcB2=eNA4UdywthkowNp_dHmOGC-yV_g@mail.gmail.com>
 <CAJDiXgjzphJ313=aDwbvryHpmTi6AqE+-5crysTtzKv01-vkzA@mail.gmail.com>
 <CAD21AoD7_4gsQ2a82zO3SaRwjdw_3tyiYDHNFPUKQ5DAA5HOtA@mail.gmail.com>
 <CAJDiXggY1QzNde6_HhpzneLc9dYqmWZ+PY39cuBXYdcCTuoJBA@mail.gmail.com>
 <CAD21AoCFPiS2jcMA1JaV1kT8xrGz5BpN7iBP_gCgRuaANEbciA@mail.gmail.com>
 <CAJDiXgh6jmNGR3uOB_6YeGhNkR2=HdTdEYjmHXdumNzyY4MckQ@mail.gmail.com>
 <CAD21AoDs3SOXeAEoCRizfEKybpRkE7t7poX0+iZ6MM1MFWMsfA@mail.gmail.com>
 <CAJDiXgjTkuqSPerC_nasxDz6d2Komf1ipYKV6SupDRnc9yhO9w@mail.gmail.com>
 <CAD21AoAXMjX03h5K84u0heBLU+fqGgWBGBDwnBDGSs=DhyF9pQ@mail.gmail.com>
 <CAJDiXghjZEAYboGhujgGvY9=RiFD01ERHVVF+NQMuuAKVZDmDQ@mail.gmail.com>
 <CAD21AoAD+N5SxBr0qL7TeWnvq4iYmFT=DyWdNLQPB-XntYkwEg@mail.gmail.com>
 <CAJDiXgjgn87sH1-MmONPKkeYJG83C0ChrYkYn9UcRonLhOOfOw@mail.gmail.com>
 <CAD21AoCoJYauWO78M-CGdHpYfcqEZVV5a1Z-7wWB=-G-x8EVFg@mail.gmail.com>
 <CAJDiXghaazbrQMZZS08d9Ffh2y4w05TgH9dpBhqChv1qNTp+xA@mail.gmail.com>
 <CAD21AoDbaNtLrFRxG9OG5WrBd7DCs4q+CfJd8AJTBEqRri4WeQ@mail.gmail.com>
 <CAJDiXgjjd1jL86B--AyRo2tDM1Wiu+7Pduwh5d0u_UM8GRugvw@mail.gmail.com>
 <CAD21AoBo_wS7y0X7_7ajEFkptzo9ZrF8RFNRnu2Xe8XL74o0SQ@mail.gmail.com>
 <CAJDiXggH1bW=4n+55CGLvs_sRU4SYNXwYLZ37wvJ5H_3yURSPw@mail.gmail.com>
 <CAD21AoDxhN8Z6Lx1ZicBXKkbMsRQqEXiq4ALs4uaD648iSvXoA@mail.gmail.com>
 <CAJDiXgh3Dg2f5k3xRJnzoY39jQENUhh125ArYapXkSu5D7JJuw@mail.gmail.com>
In-Reply-To: 
 <CAJDiXgh3Dg2f5k3xRJnzoY39jQENUhh125ArYapXkSu5D7JJuw@mail.gmail.com>
From: Masahiko Sawada <sawada.mshk@gmail.com>
Date: Thu, 19 Mar 2026 16:58:05 -0700
Message-ID: 
 <CAD21AoBYc7L7W4dRdxeoJzOH5OgpiCAtKz-54iX4Ufn8PnQoww@mail.gmail.com>
Subject: Re: POC: Parallel processing of indexes in autovacuum
To: Daniil Davydov <3danissimo@gmail.com>
Cc: Sami Imseih <samimseih@gmail.com>,
 Alexander Korotkov <aekorotkov@gmail.com>,
	Matheus Alcantara <matheusssilv97@gmail.com>,
 Maxim Orlov <orlovmg@gmail.com>,
	Postgres hackers <pgsql-hackers@lists.postgresql.org>
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
Archived-At: 
 <https://www.postgresql.org/message-id/CAD21AoBYc7L7W4dRdxeoJzOH5OgpiCAtKz-54iX4Ufn8PnQoww%40mail.gmail.com>
Precedence: bulk

On Thu, Mar 19, 2026 at 7:29=E2=80=AFAM Daniil Davydov <3danissimo@gmail.co=
m> wrote:
>
> Hi,
>
> On Thu, Mar 19, 2026 at 2:49=E2=80=AFAM Masahiko Sawada <sawada.mshk@gmai=
l.com> wrote:
> >
> > Yes, we already have such a code for PARALLEL option for the VACUUM com=
mand.
> >
> > I guess it's better that autovacuum codes also somewhat follow this
> > code for better consistency.
> >
>
> I agree. You can find it in the v29-0002 patch.
>
> > > I'm afraid that I can't agree with you here. As I wrote above [1], th=
e
> > > parallel a/v feature will be useful when a user has a few huge tables=
 with
> > > a big amount of indexes. Only these tables require parallel processin=
g and a
> > > user knows about it.
> >
> > Isn't it a case where users need to increase
> > min_parallel_index_scan_size? Suppose that there are two tables that
> > are big enough and have enough indexes, it's more natural to me to use
> > parallel vacuum for both tables without user manual settings.
> >
>
> Do you mean that the user can increase this parameter so that smaller tab=
les
> are not considered for the parallel a/v? If so, I don't think it will alw=
ays
> be handy. When I say "smaller tables" I mean that they are small relative=
 to
> super huge tables. But actually these "smaller tables" can be pretty big =
and
> require a parallel index scan within parallel queries or VACUUM PARALLEL =
(not
> an autovacuum).

I think that if these small tables are actually big, these are also
eligible for using parallel autovacuums.

> > > If we implement the feature as you suggested, then after setting the
> > > av_max_parallel_workers to N > 0, the user will have to manually disa=
ble
> > > processing for all tables except the largest ones. This will need to =
be done
> > > to ensure that parallel workers are launched specifically to process =
the
> > > largest tables and not wasting on the processing of little ones.
> > >
> > > I.e. I'm proposing a design that will require manual actions to *enab=
le*
> > > parallel a/v for several large tables rather than *disable* it for al=
l of
> > > the rest tables in the cluster. I'm sure that's what users want.
> > >
> > > Allowing the system to decide which tables to process in parallel is =
a good
> > > way from a design perspective. But I'm thinking of the following exam=
ple :
> > > Imagine that we have a threshold, when exceeded, parallel a/v is used=
.
> > > Several a/v workers encounter tables which exceed this threshold by 1=
_000 and
> > > each of these workers decides to launch a few parallel workers. Anoth=
er a/v
> > > worker encounters a table which is beyond this threshold by 1_000_000=
 and
> > > tries to launch N parallel workers, but facing the max_parallel_worke=
rs
> > > shortage. Thus, processing of this table will take a very long time t=
o
> > > complete due to lack of resources. The only way for users to avoid it=
 is to
> > > disable parallel a/v for all tables, which exceeds the threshold and =
are not
> > > of particular interest.
> >
> > I think the same thing happens even with the current design as long as
> > users misconfigure max_parallel_workers, no? Setting
> > autovacuum_max_parallel_workers to >0 would mean that users want to
> > give additional resources for autovacuums in general, I think it makes
> > sense to use parallel vacuum even for tables which exceed the
> > threshold by 1000.
> >
> > Users who want to use parallel autovacuum would have to set
> > max_parallel_workers (and max_worker_processes) high enough so that
> > each autovacuum worker can use parallel workers. If resource
> > contention occurs, it's a sign that the limits are not configured
> > properly.
> >
>
> Yeah, currently user can misconfigure max_parallel_workers, so (for examp=
le)
> multiple VACUUM PARALLEL operations running at the same time will face wi=
th
> a shortage of parallel workers. But I guess that every system has some sa=
ne
> limit for this parameter's value. If we want to ensure that all a/v leade=
rs
> are guaranteed to launch as many parallel workers as required, we might n=
eed
> to increase the max_parallel_workers too much (and cross the sane limit).
> IMHO it may be unacceptable for many systems in production, because it wi=
ll
> undermine the stability.

I understand the concern that if max_parallel_workers (and/or
max_worker_processes) value are not high enough to ensure each
autovacuum workers can launch autovacuum_max_parallel_workers, an
autovacuum on the very large table might not be able to launch the
full workers in case where some parallel workers are already being
used by others (e.g., another autovacuum on a different
slightly-smaller table etc.). But I'm not sure that the opt-out style
can handle these cases. Even if there are two huge tables and users
set parallel_vacuum_workers to both tables, there is no guarantee that
autovacuums on these tables can use the full workers, as long as
max_parallel_workers value is not enough.

>
> > The 0001 patch looks good to me. I've updated the commit message and
> > attached it. I'm going to push the patch, barring any objections.
> >
>
> Great news!

Pushed the 0001 patch.

>
> BTW, do we need to mention that this parameter can be overridden by the
> per-table setting?

IIUC the per-table setting is not actually overwriting the GUC
parameter value, but it works as an additional cap. For instance, if
autovacuum_max_parallel_workers is 2 and autovacuum_parallel_workers
is 5, we cap the parallel degree by 2, which is a similar behavior to
other parallel operations such as the parallel_workers storage
parameter. BTW it actually works in a somewhat different way than
other autovacuum-related storage parameters; the per-table parameters
overwrite GUC values. I decided to use the former behavior because
autovacuum_max_parallel_workers can work as a global switch to disable
all parallel autovacuum behavior on the system.


> > Part 3 can briefly mention that autovacuum can perform parallel vacuum
> > with parallel workers capped by autovacuum_max_parallel_workers as
> > follow:
> >
> >   For tables with the <xref linkend=3D"reloption-autovacuum-parallel-wo=
rkers"/>
> >   storage parameter set, an autovacuum worker can perform index vacuumi=
ng and
> >   index cleanup with background workers. The number of workers launched=
 by
> >   a single autovacuum worker is limited by the
> >   <xref linkend=3D"guc-autovacuum-max-parallel-workers"/>.
>
> I suggest adding here also a description of the method for calculating th=
e
> number of parallel workers. If so, I feel that this part of documentation=
 will
> be completely the same as in VACUUM PARALLEL (except a few little details=
).
> Maybe we can create some dedicated subchapter in the "Routine vacuuming" =
where
> we describe how the number of parallel workers is decided. Lets call it
> something like "24.1.7 Parallel Vacuuming". Both VACUUM PARALLEL and para=
llel
> autovacuum can refer to this subchapter. I think it will be much easier t=
o
> maintain. What do you think?

Describing the parallel vacuum in a new chapter in section 24.1 sounds
like a good idea.

Regards,

--
Masahiko Sawada
Amazon Web Services: https://aws.amazon.com