MIME-Version: 1.0
References: 
 <CAOC+FBVyTiuOMF92pQG3QOrE9j8qFegvZLoi-3UmdhFNg94e2A@mail.gmail.com>
 <14770c231bf27ea6d22376395ac8f02e41462ed5.camel@cybertec.at>
 <CAOC+FBX1L6hx4DHzH34eaWyhH6NK9ywYSJMMcEJH3rHZfW86+A@mail.gmail.com>
 <CANzqJaAh+p4Hys-aPTW2zAuYg_MLj7QJ7J6tQB3g8dA2K3DtWg@mail.gmail.com>
 <CAOC+FBUoykok2udm_3S8U4wRTDbKGdf=oY0Ga6QDCbLZ9L4wnw@mail.gmail.com>
 <CAOC+FBVHX5yPrawa6wQh6rC4F6GJYMHK9zjH-DhF7xV-g0PaZQ@mail.gmail.com>
In-Reply-To: 
 <CAOC+FBVHX5yPrawa6wQh6rC4F6GJYMHK9zjH-DhF7xV-g0PaZQ@mail.gmail.com>
From: Wells Oliver <wells.oliver@gmail.com>
Date: Sun, 17 Nov 2024 09:34:42 -0800
Message-ID: 
 <CAOC+FBWW4SYT4K0Hk8rL+xMwK7mGi6O8CqBeMjW66GdxCBruPQ@mail.gmail.com>
Subject: Re: RDS restore failed due to WAL log and disk space-- any tidy
 fixes?
To: Ron Johnson <ronljohnsonjr@gmail.com>
Cc: pgsql-admin <pgsql-admin@postgresql.org>
Content-Type: multipart/alternative; boundary="0000000000003e33cd06271f3908"
Archived-At: 
 <https://www.postgresql.org/message-id/CAOC%2BFBWW4SYT4K0Hk8rL%2BxMwK7mGi6O8CqBeMjW66GdxCBruPQ%40mail.gmail.com>
Precedence: bulk

--0000000000003e33cd06271f3908
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Would setting max_slot_wal_keep_size to something like 1GB ensure that WAL
logs don't cause runaway disk use during restore? It's currently -1...

On Sun, Nov 17, 2024 at 9:31=E2=80=AFAM Wells Oliver <wells.oliver@gmail.co=
m> wrote:

> Actually, in RDS it seems you cannot set archive_mode either.
>
> On Sun, Nov 17, 2024 at 9:23=E2=80=AFAM Wells Oliver <wells.oliver@gmail.=
com>
> wrote:
>
>> It does. I think it uses WAL behind the scenes. In RDS unfortunately
>> cannot set wal_level, but you can set archive_mode.
>>
>> On Sun, Nov 17, 2024 at 9:21=E2=80=AFAM Ron Johnson <ronljohnsonjr@gmail=
.com>
>> wrote:
>>
>>> Doesn't RDS have its own replication?
>>>
>>> Anyway, for pg_restore, I'd absolutely set archive_mode=3Doff
>>> and wal_level=3Dminimal, then set them to their production values when =
it's
>>> finished.
>>>
>>> On Sun, Nov 17, 2024 at 12:12=E2=80=AFPM Wells Oliver <wells.oliver@gma=
il.com>
>>> wrote:
>>>
>>>> Interesting. I am migrating a pg_dump archive to a new server, in a
>>>> single go. Does it make sense to disable (or speed up?) WAL archiving
>>>> during the restore, then reenable it after the restore so a future rep=
lica
>>>> could work? What would be the steps here? Would disabling or "speeding=
 up"
>>>> be faster?
>>>>
>>>> max_slot_wal_keep_size is -1 at the moment so I think that's why it
>>>> kept a ton of WAL and ran out of space.
>>>>
>>>> On Sun, Nov 17, 2024 at 7:41=E2=80=AFAM Laurenz Albe <laurenz.albe@cyb=
ertec.at>
>>>> wrote:
>>>>
>>>>> On Sat, 2024-11-16 at 16:33 -0800, Wells Oliver wrote:
>>>>> > I provisioned an RDS instance with 2500GB space and began the
>>>>> restore of a database I know to be about 1750 GB using 16 jobs.
>>>>> >
>>>>> > Unfortunately, it died very near the end when it ran out of disk
>>>>> space due to WAL log usage. Lots of:
>>>>> >
>>>>> > 2024-11-17 00:07:09 UTC::@:[19861]:PANIC:  could not write to file
>>>>> "pg_wal/xlogtemp.19861": No space left on device
>>>>> >
>>>>> >
>>>>> > And then kaboom.
>>>>> >
>>>>> > I'm wondering what my course of action should be. Can I
>>>>> disable/reduce WAL during a restore?
>>>>> > wal_level is set to replica, can this temporarily be set to minimal=
?
>>>>> Should I just eat the extra
>>>>> > costs to add headroom for the WAL? Would using fewer jobs during a
>>>>> restore reduce the amount of WAL
>>>>> > created?
>>>>>
>>>>> If you are using minimal WAL logging and you restore the dump in a
>>>>> single transaction, you
>>>>> should see way less WAL generated, because data inserted into the
>>>>> table in the same transaction
>>>>> as the CREATE TABLE statement need not be WAL logged.
>>>>>
>>>>> But you might more easily solve the problem by speeding up or
>>>>> disabling the WAL archiver,
>>>>> so that PostgreSQL removes old WAL after the next checkpoint.
>>>>>
>>>>> Yours,
>>>>> Laurenz Albe
>>>>>
>>>>
>>>>
>>>> --
>>>> Wells Oliver
>>>> wells.oliver@gmail.com <wellsoliver@gmail.com>
>>>>
>>>
>>>
>>> --
>>> Death to <Redacted>, and butter sauce.
>>> Don't boil me, I'm still alive.
>>> <Redacted> lobster!
>>>
>>
>>
>> --
>> Wells Oliver
>> wells.oliver@gmail.com <wellsoliver@gmail.com>
>>
>
>
> --
> Wells Oliver
> wells.oliver@gmail.com <wellsoliver@gmail.com>
>


--=20
Wells Oliver
wells.oliver@gmail.com <wellsoliver@gmail.com>

--0000000000003e33cd06271f3908
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div class=3D"gmail_default" style=3D"font-size:small">Wou=
ld setting=C2=A0max_slot_wal_keep_size to something like 1GB ensure that WA=
L logs don&#39;t cause runaway disk use during restore? It&#39;s currently =
-1...</div></div><br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"g=
mail_attr">On Sun, Nov 17, 2024 at 9:31=E2=80=AFAM Wells Oliver &lt;<a href=
=3D"mailto:wells.oliver@gmail.com">wells.oliver@gmail.com</a>&gt; wrote:<br=
></div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;=
border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr"><=
div class=3D"gmail_default" style=3D"font-size:small">Actually, in RDS it s=
eems you cannot set=C2=A0archive_mode either.</div></div><br><div class=3D"=
gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Sun, Nov 17, 2024 at =
9:23=E2=80=AFAM Wells Oliver &lt;<a href=3D"mailto:wells.oliver@gmail.com" =
target=3D"_blank">wells.oliver@gmail.com</a>&gt; wrote:<br></div><blockquot=
e class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px s=
olid rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmai=
l_default" style=3D"font-size:small">It does. I think it uses WAL behind th=
e scenes. In RDS unfortunately cannot set wal_level, but you can set archiv=
e_mode.</div></div><br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D=
"gmail_attr">On Sun, Nov 17, 2024 at 9:21=E2=80=AFAM Ron Johnson &lt;<a hre=
f=3D"mailto:ronljohnsonjr@gmail.com" target=3D"_blank">ronljohnsonjr@gmail.=
com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"marg=
in:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1e=
x"><div dir=3D"ltr"><div>Doesn&#39;t RDS have its own replication?</div><di=
v><br></div><div>Anyway, for pg_restore, I&#39;d absolutely set=C2=A0archiv=
e_mode=3Doff and=C2=A0wal_level=3Dminimal, then set them to their productio=
n values when it&#39;s finished.</div><div><br></div><div dir=3D"ltr">On Su=
n, Nov 17, 2024 at 12:12=E2=80=AFPM Wells Oliver &lt;<a href=3D"mailto:well=
s.oliver@gmail.com" target=3D"_blank">wells.oliver@gmail.com</a>&gt; wrote:=
<br></div><div class=3D"gmail_quote"><blockquote class=3D"gmail_quote" styl=
e=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);paddin=
g-left:1ex"><div dir=3D"ltr"><div style=3D"font-size:small">Interesting. I =
am migrating a pg_dump archive to a new server,=C2=A0in a single go. Does i=
t make=C2=A0sense to disable (or speed up?) WAL archiving during the restor=
e, then reenable it after the restore so a future replica could work? What =
would be the steps here? Would disabling or &quot;speeding up&quot; be fast=
er?</div><div style=3D"font-size:small"><br></div><div style=3D"font-size:s=
mall">max_slot_wal_keep_size is -1 at the moment so I think that&#39;s why =
it kept a ton of WAL and ran out of space.</div></div><br><div class=3D"gma=
il_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Sun, Nov 17, 2024 at 7:4=
1=E2=80=AFAM Laurenz Albe &lt;<a href=3D"mailto:laurenz.albe@cybertec.at" t=
arget=3D"_blank">laurenz.albe@cybertec.at</a>&gt; wrote:<br></div><blockquo=
te class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px =
solid rgb(204,204,204);padding-left:1ex">On Sat, 2024-11-16 at 16:33 -0800,=
 Wells Oliver wrote:<br>
&gt; I provisioned an RDS instance with 2500GB space and began the restore =
of a database I know to be about 1750 GB using 16 jobs.<br>
&gt; <br>
&gt; Unfortunately, it died very near the end when it ran out of disk space=
 due to WAL log usage. Lots of:<br>
&gt; <br>
&gt; 2024-11-17 00:07:09 UTC::@:[19861]:PANIC:=C2=A0 could not write to fil=
e &quot;pg_wal/xlogtemp.19861&quot;: No space left on device<br>
&gt; <br>
&gt; <br>
&gt; And then kaboom.<br>
&gt; <br>
&gt; I&#39;m wondering what my course of action should be. Can I disable/re=
duce WAL during a restore?<br>
&gt; wal_level is set to replica, can this temporarily be set to minimal? S=
hould I just eat the extra<br>
&gt; costs to add headroom for the WAL? Would using fewer jobs during a res=
tore reduce the amount of WAL<br>
&gt; created?<br>
<br>
If you are using minimal WAL logging and you restore the dump in a single t=
ransaction, you<br>
should see way less WAL generated, because data inserted into the table in =
the same transaction<br>
as the CREATE TABLE statement need not be WAL logged.<br>
<br>
But you might more easily solve the problem by speeding up or disabling the=
 WAL archiver,<br>
so that PostgreSQL removes old WAL after the next checkpoint.<br>
<br>
Yours,<br>
Laurenz Albe<br>
</blockquote></div><div><br clear=3D"all"></div><div><br></div><span class=
=3D"gmail_signature_prefix">-- </span><br><div dir=3D"ltr" class=3D"gmail_s=
ignature"><div dir=3D"ltr"><div>Wells Oliver<br><a href=3D"mailto:wellsoliv=
er@gmail.com" target=3D"_blank">wells.oliver@gmail.com</a></div></div></div=
>
</blockquote></div><div><br clear=3D"all"></div><div><br></div><span class=
=3D"gmail_signature_prefix">-- </span><br><div dir=3D"ltr" class=3D"gmail_s=
ignature"><div dir=3D"ltr">Death to &lt;Redacted&gt;, and butter sauce.<div=
>Don&#39;t boil me, I&#39;m still alive.<br><div><div>&lt;Redacted&gt; lobs=
ter!</div></div></div></div></div></div>
</blockquote></div><div><br clear=3D"all"></div><div><br></div><span class=
=3D"gmail_signature_prefix">-- </span><br><div dir=3D"ltr" class=3D"gmail_s=
ignature"><div dir=3D"ltr"><div>Wells Oliver<br><a href=3D"mailto:wellsoliv=
er@gmail.com" target=3D"_blank">wells.oliver@gmail.com</a></div></div></div=
>
</blockquote></div><div><br clear=3D"all"></div><div><br></div><span class=
=3D"gmail_signature_prefix">-- </span><br><div dir=3D"ltr" class=3D"gmail_s=
ignature"><div dir=3D"ltr"><div>Wells Oliver<br><a href=3D"mailto:wellsoliv=
er@gmail.com" target=3D"_blank">wells.oliver@gmail.com</a></div></div></div=
>
</blockquote></div><div><br clear=3D"all"></div><div><br></div><span class=
=3D"gmail_signature_prefix">-- </span><br><div dir=3D"ltr" class=3D"gmail_s=
ignature"><div dir=3D"ltr"><div>Wells Oliver<br><a href=3D"mailto:wellsoliv=
er@gmail.com" target=3D"_blank">wells.oliver@gmail.com</a></div></div></div=
>

--0000000000003e33cd06271f3908--