Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1u1ipx-0070Eu-FW for pgsql-general@arkaria.postgresql.org; Mon, 07 Apr 2025 09:32:13 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.94.2) (envelope-from ) id 1u1ipu-00EH9k-5E for pgsql-general@arkaria.postgresql.org; Mon, 07 Apr 2025 09:32:10 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1u1ipt-00EH9c-P2 for pgsql-general@lists.postgresql.org; Mon, 07 Apr 2025 09:32:10 +0000 Received: from mail-yb1-xb2d.google.com ([2607:f8b0:4864:20::b2d]) by magus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.96) (envelope-from ) id 1u1ipr-003tYn-2X for pgsql-general@lists.postgresql.org; Mon, 07 Apr 2025 09:32:09 +0000 Received: by mail-yb1-xb2d.google.com with SMTP id 3f1490d57ef6-e694601f624so2488789276.1 for ; Mon, 07 Apr 2025 02:32:07 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1744018326; x=1744623126; darn=lists.postgresql.org; h=to:subject:message-id:date:from:mime-version:from:to:cc:subject :date:message-id:reply-to; bh=uZYpUJXyB4cNHEfUYPdsR7A0U7A1zanFLigKPhAlx3k=; b=KcUICLo6MGYcEkbf1R9DxsghCK9GRYhwQAdp6lofE5ankasZRiU6KHZ4Et84BLZ4If L3RqqJgrU6/XMWyNs9SHhfOu/3UWcPmvS8jOXYW5s5Q7ks9u/LAaAC61ihrtxk3oyxp+ YgWIMrtQpSR4ZsCZQHlHaPoeIyAkz2ugjoXHF9OHZBLtooTMpGIWfNhDrxFawg4jsa8K +5FM/ZlwKsFyu6UYt7q02Od/WY52idqiOVJv3IeVOZujQZEf6zyY9tvERdDC1gTSU+BZ Ef1xGquOiGEHZtnNXjBRwCsgEDUIXqK3deOYnrkDR4+wmoIXbAX0N88k3zczbZSnGLXI vrvw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1744018326; x=1744623126; h=to:subject:message-id:date:from:mime-version:x-gm-message-state :from:to:cc:subject:date:message-id:reply-to; bh=uZYpUJXyB4cNHEfUYPdsR7A0U7A1zanFLigKPhAlx3k=; b=j8OvKDXfkc9XMSMHYJbiwcl0svAUZvzkbRP5uyjPMofg34G10zZXT4OaSkQnRgwif3 w/6XmmYh6YhQ+qaUnD81+1hqyVUbaJcTpBbmzkQwpBNJMONdXoO6r3yvsZKhzxuLRh5Y gtvjyTiFI9AbEeKKnoS3rU4fk9QPkhZOKBzkg4I0utO9sGhsWNEva4F84SnoMlbaQE36 e9JIzsUrmsHwQeKnBkSXE/Tdx0KrYk1MBQ+SbxtupUqloi80/6pkqkPlp9j1+WOr5K4l /dR+hgVmwDAs9+qUOrcNZykXOSxHGkTlu9hpvruW7ki9NNh70IaXN7z4Oh4x9abb0qf5 /kdA== X-Gm-Message-State: AOJu0Yzd+Tv36or4iZwA8OFXRTaZLkiq4UmlURva5YhgHmKlFhqOcCma dk1TwEEA1V38zcylCMzRzAqNxVnI+5OGoDktwuHEsn23f4mmgfZj01jBUTSRuSqN/bJ05gz0fqD 6ruOJtZM+zYHPSqs010u3hyOHdIU2Ce50 X-Gm-Gg: ASbGncvr1W9xi+QX0M3kCE8sESbEmhsQqWVr2B1VXc4h+sX7LdcdwOpAPDcqZgvy0nP smQJVv8Yyk8xwxEIq1zNfgPCOMWVXH83ODJyjZphzaes12hlPAvyX9pJiuzWYl7MFAu5pRPLTBF 7FGsLg6t/0X5kvLuDhvZ9h2/LDKaQ= X-Google-Smtp-Source: AGHT+IHVS7XzMG+ONMKmPfwnsww6aIeeX8Q1DkBC9P79a2589Go46mJ/YlxTlpMQbfaJFM83baujhYa7KrsgH5TPp48= X-Received: by 2002:a05:6902:2383:b0:e6d:f066:4061 with SMTP id 3f1490d57ef6-e6e318b1783mr11859063276.46.1744018325698; Mon, 07 Apr 2025 02:32:05 -0700 (PDT) MIME-Version: 1.0 From: KK CHN Date: Mon, 7 Apr 2025 15:04:26 +0530 X-Gm-Features: ATxdqUEK-tre0fEWclwJkHdQUmzST-AJXC4OHQ0xQydxP4VLOjFfAzOWK_dWiEY Message-ID: Subject: PgBackRest fails due to filesystem full To: pgsql-general@lists.postgresql.org Content-Type: multipart/alternative; boundary="000000000000c903b706322ce891" List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk --000000000000c903b706322ce891 Content-Type: text/plain; charset="UTF-8" List, I am running PgbackRest-2.52.1 on RHEL9.3 and EDB16 to backup to a remote repo server . Everything was working fine and backups were regularly taken with cron scheduler daily. But due to a / partition full 100 % utilization, the pgbackrest backup failed the other day. I came to know the backup script is not working for the backup which is scheduled daily from a cron scheduler. I made space in / file system by removing few log files from /var/pgbackrest/DBCluster1 I tried to reschedule the backup script (after deleting some log files from /var and now / is having 50 % free space ) but after running for 2 or 3 minutes pgbackrest fails as follows. [root@dbtest log]# sudo -u postgres pgbackrest --stanza=DBCluster1_Repo --type=full backup 2025-04-07 14:29:36.171 P00 INFO: backup command begin 2.52.1: --delta --exec-id=4175219-0893aa9e --log-level-console=info --log-level-file=debug --pg1-host=10.x.0.y --pg1-host-user=enterprisedb --pg1-path=/data/edb/as16/data --pg-version-force=16 --process-max=5 --repo1-block --repo1-bundle --repo1-cipher-pass= --repo1-cipher-type=aes-256-cbc --repo1-path=/data/DB_BKUPS --repo1-retention-diff=6 --repo1-retention-full=3 --stanza=DBCluster1_Repo --start-fast --type=full 2025-04-07 14:29:40.007 P00 INFO: execute non-exclusive backup start: backup begins after the requested immediate checkpoint completes 2025-04-07 14:29:41.383 P00 INFO: backup start archive = 00000001000001EB0000004C, lsn = 1EB/4C0003D8 2025-04-07 14:29:41.383 P00 INFO: check archive for prior segment 00000001000001EB0000004B *ERROR: [082]: WAL segment 00000001000001EB000000*4B was not archived before the 60000ms timeout HINT: check the archive_command to ensure that all options are correct (especially --stanza). HINT: check the PostgreSQL server log for errors. HINT: run the 'start' command if the stanza was previously stopped. *Again I ran the backup script but each time it fails with error (each time the WAL segment error with a new WAL segment number ) * 2025-04-07 14:30:41.383 P00 INFO: backup command end: aborted with exception [082] 2025-04-07 14:33:03.382 P00 INFO: check archive for prior segment 00000001000001EB0000004D ERROR: [082]: WAL segment 00000001000001EB0000004D was not archived before the 60000ms timeout HINT: check the archive_command to ensure that all options are correct (especially --stanza). HINT: check the PostgreSQL server log for errors. HINT: run the 'start' command if the stanza was previously stopped. 2025-04-07 14:34:03.382 P00 INFO: backup command end: aborted with exception [082] This may be due to the WAL segment from the DB server being unable to sync that time when the file system was full at the Repo Server side which was observed by me after 2 days !! Any hints how can I rectify this issue and put pgbackrest working back ?? How can I enforce the consistency of the Backups and WAL files since there may be missing WAL files in between when the RepoServer file system is full ? Thanks in advance Krishane --000000000000c903b706322ce891 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
List,=C2=A0

=C2=A0I am runni= ng PgbackRest-2.52.1 on RHEL9.3=C2=A0 and=C2=A0 EDB16=C2=A0 to backup to a = remote repo server .=C2=A0 =C2=A0Everything was working fine and backups we= re regularly taken=C2=A0 with=C2=A0 cron scheduler daily.=C2=A0=C2=A0
=

=C2=A0But due to a=C2=A0 =C2=A0/=C2=A0 partition=C2=A0f= ull 100 % utilization, the pgbackrest=C2=A0 backup failed the other day.=C2= =A0 I came to know the backup script is not working for the backup which is= scheduled daily from a cron scheduler.=C2=A0 I made=C2=A0 space in=C2=A0 /= =C2=A0 file system by removing few=C2=A0=C2=A0log files=C2=A0 from /var/pgb= ackrest/DBCluster1=C2=A0

I tried to=C2=A0 reschedu= le the backup script (after deleting some log files from=C2=A0 /var and now= / is having 50 % free space ) but after running for 2 or 3 minutes pgbackr= est fails as follows.=C2=A0


[root@d= btest log]# sudo -u postgres pgbackrest --stanza=3DDBCluster1_Repo --type= =3Dfull backup
2025-04-07 14:29:36.171 P00 =C2=A0 INFO: backup command b= egin 2.52.1: --delta --exec-id=3D4175219-0893aa9e --log-level-console=3Dinf= o --log-level-file=3Ddebug --pg1-host=3D10.x.0.y --pg1-host-user=3Denterpri= sedb --pg1-path=3D/data/edb/as16/data --pg-version-force=3D16 --process-max= =3D5 --repo1-block --repo1-bundle --repo1-cipher-pass=3D<redacted> --= repo1-cipher-type=3Daes-256-cbc --repo1-path=3D/data/DB_BKUPS --repo1-reten= tion-diff=3D6 --repo1-retention-full=3D3 --stanza=3DDBCluster1_Repo=C2=A0= =C2=A0--start-fast --type=3Dfull
2025-04-07 14:29:40.007 P00 =C2= =A0 INFO: execute non-exclusive backup start: backup begins after the reque= sted immediate checkpoint completes
2025-04-07 14:29:41.383 P00 =C2=A0 I= NFO: backup start archive =3D 00000001000001EB0000004C, lsn =3D 1EB/4C0003D= 8
2025-04-07 14:29:41.383 P00 =C2=A0 INFO: check archive for prior segme= nt 00000001000001EB0000004B
ERROR: [082]: WAL segment 00000001000001E= B0000004B was not archived before the 60000ms timeout
=C2=A0 =C2=A0 = =C2=A0 =C2=A0HINT: check the archive_command to ensure that all options are= correct (especially --stanza).
=C2=A0 =C2=A0 =C2=A0 =C2=A0HINT: check t= he PostgreSQL server log for errors.
=C2=A0 =C2=A0 =C2=A0 =C2=A0HINT: ru= n the 'start' command if the stanza was previously stopped.

=
Again I ran the backup script=C2=A0but each time it fails wit= h error (each time the WAL segment error with a new WAL segment=C2=A0number= )=C2=A0

2025-04-07 14:30:41.383 P00 =C2=A0 IN= FO: backup command end: aborted with exception [082]

=C2=A0 =C2=A0 =C2=A02025-04-07 14:33:03.382 P00 =C2=A0 INFO: check archi= ve for prior segment 00000001000001EB0000004D
ERROR: [082]: WAL segmen= t 00000001000001EB0000004D was not archived before the 60000ms timeout
= =C2=A0 =C2=A0 =C2=A0 =C2=A0HINT: check the archive_command to ensure that a= ll options are correct (especially --stanza).
=C2=A0 =C2=A0 =C2=A0 =C2= =A0HINT: check the PostgreSQL server log for errors.
=C2=A0 =C2=A0 =C2= =A0 =C2=A0HINT: run the 'start' command if the stanza was previousl= y stopped.

2025-04-07 14:34:03.382 P00 =C2=A0 INFO: backup command = end: aborted with exception [082]



=C2=A0 This may be due to = the WAL segment from the DB server being unable to sync that time when the = file system was full at the Repo Server side which was observed by me after= 2 days !!

Any hints how can I rectify this issue = and put pgbackrest working back ??=C2=A0

How can I= =C2=A0 enforce=C2=A0 the consistency of the Backups and WAL files since the= re may be missing WAL files in between when the RepoServer file system is f= ull ?



Thanks in adva= nce
Krishane





--000000000000c903b706322ce891--