MIME-Version: 1.0
References: <CAKna9VaJ_qHKBnw4O-VT3xGmzqThCuZ=LFXx-hPdw7E6RoqmeA@mail.gmail.com>
 <CAD=mzVUmXmkdvvMG30G1=D4Kq3WqnzGo=0ov9JnRCs1p=KJiTQ@mail.gmail.com>
 <CAKna9VZRc4+Vzbt6qPGMCauE84isPtz-wE_KX9AOt7WKfhwjiQ@mail.gmail.com>
 <CAD=mzVUX13ZM16kP4QhY+F5XiLr=ezCXftKOTKA4eUvhphgOJw@mail.gmail.com>
 <CAKna9Vb_mx+dX02XOV6mpr8RFC-5io38kM6=4xRHQj_MUvQ+aQ@mail.gmail.com>
 <CAKAnmmJ6fqyYafLB_im75oxxfTuCLUY0ftBPU57pUm0g+pm6FQ@mail.gmail.com>
 <CAKna9VZJ4fginFJZenGQxWs9eAw9Z8g-YkdnOFcie5RvuJ=5OQ@mail.gmail.com>
 <CAKAnmmLv72uk1p8+zmWpkC+BTatrdmRe_NpRbwRsi1LAU-cJFQ@mail.gmail.com>
 <CAKna9VZGwtNx9NAZ0QjdT-WhtFETAaFzpUsvM6R90mjaAoP3vA@mail.gmail.com>
 <CAD=mzVVS6HbV25M7EA+TJYk22G=GJvQK5Gbe9eeifYSmadYtNA@mail.gmail.com>
 <CAKna9Vb0ABStAWogCsXK+jTT9ZuLESJ0_-r3Wtvd=rZpYMYwxw@mail.gmail.com>
 <CAKAnmm+_avfVEFGgADebEyH=oQrEDuvviOcMYNa+myjJrds8Eg@mail.gmail.com>
 <CAKna9Vahx4ow0mtTEbVSeAU+f6U9v6G+Dkr-ymoyNhUZF_GRWw@mail.gmail.com> <D8602AF7-7EDE-46EC-B12F-5AE003969994@gmail.com>
In-Reply-To: <D8602AF7-7EDE-46EC-B12F-5AE003969994@gmail.com>
From: Lok P <loknath.73@gmail.com>
Date: Thu, 22 Aug 2024 12:55:48 +0530
Message-ID: <CAKna9VbRpcL4C60H11XM34zYMrDu97Q2DCKey2urdqr4Z0jH1A@mail.gmail.com>
Subject: Re: Column type modification in big tables
To: Alban Hertroys <haramrae@gmail.com>
Cc: Greg Sabino Mullane <htamfids@gmail.com>, sud <suds1434@gmail.com>, 
	pgsql-general <pgsql-general@lists.postgresql.org>
Content-Type: multipart/alternative; boundary="00000000000000ef0206204092b7"
Archived-At: <https://www.postgresql.org/message-id/CAKna9VbRpcL4C60H11XM34zYMrDu97Q2DCKey2urdqr4Z0jH1A%40mail.gmail.com>
Precedence: bulk

--00000000000000ef0206204092b7
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

On Thu, 15 Aug, 2024, 9:18 pm Alban Hertroys, <haramrae@gmail.com> wrote:

>
> > On 15 Aug 2024, at 14:15, Lok P <loknath.73@gmail.com> wrote:
>
> (=E2=80=A6)
>
> > Hello Greg,
> >
> > In terms of testing on sample data and extrapolating, as i picked the
> avg partition sizeof the table (which is ~20GB) and i created a non
> partitioned table with exactly same columns and populated with similar da=
ta
> and also created same set of indexes on it and the underlying hardware is
> exactly same as its on production. I am seeing it's taking ~5minutes to
> alter all the four columns on this table. So we have ~90 partitions in
> production with data in them and the other few are future partitions and
> are blank. (Note- I executed the alter with "work_mem=3D4GB,
> maintenance_work_mem=3D30gb, max_parallel_worker_per_gather=3D8,
> max_parallel_maintenance_worker =3D16" )
> >
> > So considering the above figures , can i safely assume it will take
> ~90*5minutes=3D ~7.5hours in production and thus that many hours of downt=
ime
> needed for this alter OR do we need to consider any other factors or
> activity here?
>
> Are all those partitions critical, or only a relative few?
>
> If that=E2=80=99s the case, you could:
>         1) detach the non-critical partitions
>         2) take the system down for maintenance
>         3) update the critical partitions
>         4) take the system up again
>         5) update the non-critical partitions
>         6) re-attach the non-critical partitions
>
> That could shave a significant amount of time off your down-time. I would
> script the detach and re-attach processes first, to save some extra.
>
> Admittedly, I haven=E2=80=99t actually tried that procedure, but I see no=
 reason
> why it wouldn=E2=80=99t work.
>
> Apart perhaps, from inserts happening that should have gone to some of
> those detached partitions. Maybe those could be sent to a =E2=80=98defaul=
t=E2=80=99
> partition that gets detached at step 7, after which you can insert+select
> those from the default into the appropriate partitions?
>
> But you were going to test that first anyway, obviously.
>

We were checking this strategy , but what we found is while attaching any
of the historical partition back to the child table , if there runs any
existing inserts on the other live partitions of the same child table that
attach keeps on hang state. Also during this period the parent table (which
is also partitioned) takes an exclusive lock on itself!!

Even detaching any partition  "concurrently" also waits for any inserts to
finish, even those are on other partitions. Is this behavior expected?

--00000000000000ef0206204092b7
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"auto"><div><br><br><div class=3D"gmail_quote"><div dir=3D"ltr" =
class=3D"gmail_attr">On Thu, 15 Aug, 2024, 9:18 pm Alban Hertroys, &lt;<a h=
ref=3D"mailto:haramrae@gmail.com" target=3D"_blank" rel=3D"noreferrer">hara=
mrae@gmail.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" st=
yle=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><br>
&gt; On 15 Aug 2024, at 14:15, Lok P &lt;<a href=3D"mailto:loknath.73@gmail=
.com" rel=3D"noreferrer noreferrer" target=3D"_blank">loknath.73@gmail.com<=
/a>&gt; wrote:<br>
<br>
(=E2=80=A6)<br>
<br>
&gt; Hello Greg, <br>
&gt; <br>
&gt; In terms of testing on sample data and extrapolating, as i picked the =
avg partition sizeof the table (which is ~20GB) and i created a non partiti=
oned table with exactly same columns and populated with similar data and al=
so created same set of indexes on it and the underlying hardware is exactly=
 same as its on production. I am seeing it&#39;s taking ~5minutes to alter =
all the four columns on this table. So we have ~90 partitions in production=
 with data in them and the other few are future partitions and are blank. (=
Note- I executed the alter with &quot;work_mem=3D4GB, maintenance_work_mem=
=3D30gb, max_parallel_worker_per_gather=3D8, max_parallel_maintenance_worke=
r =3D16&quot; )<br>
&gt; <br>
&gt; So considering the above figures , can i safely assume it will take ~9=
0*5minutes=3D ~7.5hours in production and thus that many hours of downtime =
needed for this alter OR do we need to consider any other factors or activi=
ty here? <br>
<br>
Are all those partitions critical, or only a relative few?<br>
<br>
If that=E2=80=99s the case, you could:<br>
=C2=A0 =C2=A0 =C2=A0 =C2=A0 1) detach the non-critical partitions<br>
=C2=A0 =C2=A0 =C2=A0 =C2=A0 2) take the system down for maintenance<br>
=C2=A0 =C2=A0 =C2=A0 =C2=A0 3) update the critical partitions<br>
=C2=A0 =C2=A0 =C2=A0 =C2=A0 4) take the system up again<br>
=C2=A0 =C2=A0 =C2=A0 =C2=A0 5) update the non-critical partitions<br>
=C2=A0 =C2=A0 =C2=A0 =C2=A0 6) re-attach the non-critical partitions<br>
<br>
That could shave a significant amount of time off your down-time. I would s=
cript the detach and re-attach processes first, to save some extra.<br>
<br>
Admittedly, I haven=E2=80=99t actually tried that procedure, but I see no r=
eason why it wouldn=E2=80=99t work.<br>
<br>
Apart perhaps, from inserts happening that should have gone to some of thos=
e detached partitions. Maybe those could be sent to a =E2=80=98default=E2=
=80=99 partition that gets detached at step 7, after which you can insert+s=
elect those from the default into the appropriate partitions?<br>
<br>
But you were going to test that first anyway, obviously.<br></blockquote></=
div></div><div dir=3D"auto"><br></div><div dir=3D"auto"><span style=3D"font=
-size:12.8px">We were checking this strategy=C2=A0, but what we found is wh=
ile attaching any of the historical partition back to the child table , if =
there runs any existing inserts on the other live partitions of the same ch=
ild table that attach keeps on hang state. Also during this period the pare=
nt table (which is also partitioned) takes an exclusive lock on itself!!=C2=
=A0</span><br></div><div dir=3D"auto"><span style=3D"font-size:12.8px"><br>=
</span></div><div dir=3D"auto"><span style=3D"font-size:12.8px">Even detach=
ing any partition=C2=A0 &quot;concurrently&quot; also waits for any inserts=
 to finish, even those are on other partitions. Is this behavior expected?=
=C2=A0</span></div><div dir=3D"auto"><span style=3D"font-size:12.8px"><br><=
/span></div><div dir=3D"auto"><span style=3D"font-size:12.8px"><br></span><=
/div><div dir=3D"auto"><div class=3D"gmail_quote"><blockquote class=3D"gmai=
l_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left=
:1ex"></blockquote></div></div></div>

--00000000000000ef0206204092b7--