Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1vdOZ0-0001tp-2o for pgsql-hackers@arkaria.postgresql.org; Wed, 07 Jan 2026 08:06:43 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.96) (envelope-from ) id 1vdOYz-00CZGi-2b for pgsql-hackers@arkaria.postgresql.org; Wed, 07 Jan 2026 08:06:42 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1vdOYz-00CZGZ-1V for pgsql-hackers@lists.postgresql.org; Wed, 07 Jan 2026 08:06:42 +0000 Received: from mail-oo1-xc34.google.com ([2607:f8b0:4864:20::c34]) by magus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.96) (envelope-from ) id 1vdOYx-0052qF-1y for pgsql-hackers@lists.postgresql.org; Wed, 07 Jan 2026 08:06:41 +0000 Received: by mail-oo1-xc34.google.com with SMTP id 006d021491bc7-65b6b69baf8so594275eaf.3 for ; Wed, 07 Jan 2026 00:06:39 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1767773197; x=1768377997; darn=lists.postgresql.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=Syu/Ueq0Xd+vlxoEWG6PB1QZtl+34riak2IW+0rWwsw=; b=VQqBa5O9wQCMrO7dd404xFO53c3aVbRHp8+i7zUGGYDDPxgc3XV1oaqqCXv4Q+R7Jc PinL1xJaSgt/ZeozPdoNTqfCiTpk5Xk1vpx2K6PA8GMNniL4kCMZc//Q+M/AGIyOldDa umhI0fsbzcVfC+6N1l4j1UTG0AwOuu19hzQziiQYjw0UT+7PwtS5EVRMqnsHmzdkzD4J 0jNFJkuz2uQbNZxspMXMOabnMGyUmm2Xp68CFteyu7VNfFcgN2B6S475MgFGsGxEDBoq uEzlQAJ+1R4zdqoBXb7rkgetD9rsJ/mzhKnSAqtQ+nclsZzrPExPfyOsObg19p4z6t+e yKZw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1767773197; x=1768377997; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-gg:x-gm-message-state:from:to:cc:subject:date :message-id:reply-to; bh=Syu/Ueq0Xd+vlxoEWG6PB1QZtl+34riak2IW+0rWwsw=; b=NIa3pwKb0Ppyv9NngoLtX7w+ELfHIISprEUJ6KsUDQOsFJL4wFmRhommShBhnq1a+2 sgol+K3Gd61fbEAtHTSGYLOv5bg+00YlgjgKaLa+FLJfxML/ELr5z3OKz6vCGX+y2QK9 i3ewcHg2LMgiJ7k/SXAVcVFJYfHOMrfcVkQE1AIdtG7Eu8uGFsMxWxhZCo9vdJKrlksi ZJuxW82OBfaGhxPWKHDdxZUI502Va3b93Jqf0PVY+EsVNSyjxeFPlz0iS8PJZr4dGCrY V2zOjfR6N4MUrYHhzq36DeUFvgjeji0L1sGEGrmsMnc02h/jQvjQqZmG1GmYnIHYbY8Z oNZA== X-Forwarded-Encrypted: i=1; AJvYcCVfM94VITUMT25/6wv7YLY2h397RemQWaMjrQj8s2EFYovRSjtlXrv9GnZ8RIWNpAmrvJVRBB3p9fO0NmDh@lists.postgresql.org X-Gm-Message-State: AOJu0YxO6kk0gUHsVmjrhglQsUieiEn4mjmEL/8L5D9ODaA0OPObowcR AqZbff7iGpvPYWVX8MK2Brh6GQJmqE7kBsloJBSG0cmY0FLJAeu6slhyITyXgHG54JdoHowBsoQ ub1MsjDgr8JAzJOtY5MzzNLFquo9ACfI= X-Gm-Gg: AY/fxX5H5TxFnYXNW8TEh85kDCXMG6PCw0SNEN8WjZriw8w191Kbv8kr+QjNQGK6ARG elqw5tZNojwk+PBuZJtq7Gw4Tahds2zHtgR2Xcl1UcWbI6L6HAnt5SoBhVhQyVCEfwsnYVeiCVA G7/MZz/4IJBODOwQC0YbFp/ol1kzOVkQz/hisbvNb1JY8m0z0n7ipgyeIX2E3CG4kYfM6JfHHYY F9in815TjQXDHzH0NHqrhPEPGGHWsRt5ZHzUWlmaAff7YDnPIcmBHPE+pZk5xy6txS/FkR0EcJ7 F3e/6i3crAmC5f3nLk62XBsber1itDK3ATAJK4kn+Xn0/85wPzysvkh1mtqjA/MpjQLpHpkvpxq jBqdT33wFXya3i4qXF/kwAzAbId0b X-Google-Smtp-Source: AGHT+IFamf5gl0w5OPqsMZPNE2EEB9Z3aDVYFZSgfWypgXtyAeIkMWpK3n92S4Tys3KlkDlfM7ooHFnpQhn9haz4Zzw= X-Received: by 2002:a4a:d384:0:b0:65d:c57:70c6 with SMTP id 006d021491bc7-65f54f5ae19mr500751eaf.45.1767773197214; Wed, 07 Jan 2026 00:06:37 -0800 (PST) MIME-Version: 1.0 References: <202601011659.ikh4ku4p3ovb@alvherre.pgsql> In-Reply-To: From: Alexander Korotkov Date: Wed, 7 Jan 2026 10:06:23 +0200 X-Gm-Features: AQt7F2rw0cJ7hsxEGrXog5vF5v6mJyHNlGj1lvkjOw4NMpWiSR2EvxCL3N0LDDo Message-ID: Subject: Re: Implement waiting for wal lsn replay: reloaded To: Andres Freund Cc: Thomas Munro , Xuneng Zhou , =?UTF-8?Q?=C3=81lvaro_Herrera?= , Chao Li , pgsql-hackers , Michael Paquier , jian he , Tomas Vondra , Yura Sokolov Content-Type: multipart/alternative; boundary="00000000000076a9640647c7c51a" List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk --00000000000076a9640647c7c51a Content-Type: text/plain; charset="UTF-8" On Wed, Jan 7, 2026, 02:32 Andres Freund wrote: > Hi, > > On 2026-01-06 18:42:59 +1300, Thomas Munro wrote: > > Could this be causing the recent flapping failures on CI/macOS in > > recovery/031_recovery_conflict? I didn't have time to dig personally > > but f30848cb looks relevant: > > > > Waiting for replication conn standby's replay_lsn to pass 0/03467F58 on > primary > > error running SQL: 'psql::1: ERROR: canceling statement due to > > conflict with recovery > > DETAIL: User was or might have been using tablespace that must be > dropped.' > > while running 'psql --no-psqlrc --no-align --tuples-only --quiet > > --dbname port=25195 > > host=/var/folders/g9/7rkt8rt1241bwwhd3_s8ndp40000gn/T/LqcCJnsueI > > dbname='postgres' --file - --variable ON_ERROR_STOP=1' with sql 'WAIT > > FOR LSN '0/03467F58' WITH (MODE 'standby_replay', timeout '180s', > > no_throw);' at > /Users/admin/pgsql/src/test/perl/PostgreSQL/Test/Cluster.pm > > line 2300. > > > > https://cirrus-ci.com/task/5771274900733952 > > > > The master branch in time-descending order, macOS tasks only: > > > > task_id | substring | status > > ------------------+-----------+----------- > > 6460882231754752 | c970bdc0 | FAILED > > 5771274900733952 | 6ca8506e | FAILED > > 6217757068361728 | 63ed3bc7 | FAILED > > 5980650261446656 | ae283736 | FAILED > > 6585898394976256 | 5f13999a | COMPLETED > > 4527474786172928 | 7f9acc9b | COMPLETED > > 4826100842364928 | e8d4e94a | COMPLETED > > 4540563027918848 | b9ee5f2d | FAILED > > 6358528648019968 | c5af141c | FAILED > > 5998005284765696 | e212a0f8 | COMPLETED > > 6488580526178304 | b85d5dc0 | FAILED > > 5034091344560128 | 7dc95cc3 | ABORTED > > 5688692477526016 | bb048e31 | COMPLETED > > 5481187977723904 | d351063e | COMPLETED > > 5101831568752640 | f30848cb | COMPLETED <-- the change > > 6395317408497664 | 3f33b63d | COMPLETED > > 6741325208354816 | 877ae5db | COMPLETED > > 4594007789010944 | de746e0d | COMPLETED > > 6497208998035456 | 461b8cc9 | COMPLETED > > The failure rates of this are very high - the majority of the CI runs on > the > postgres/postgres repos failed since the change went in. Which then also > means > cfbot has a very high spurious failure rate. I think we need to revert this > change until the problem has been verified as fixed. > This is fair. I will revert the commit causing the failures in the next few hours. ------ Regards, Alexander Korotkov > --00000000000076a9640647c7c51a Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
On Wed, Jan 7, 2026, 02:32 Andres Freun= d <andres@anarazel.de> wrot= e:
Hi,

On 2026-01-06 18:42:59 +1300, Thomas Munro wrote:
> Could this be causing the recent flapping failures on CI/macOS in
> recovery/031_recovery_conflict?=C2=A0 I didn't have time to dig pe= rsonally
> but f30848cb looks relevant:
>
> Waiting for replication conn standby's replay_lsn to pass 0/03467F= 58 on primary
> error running SQL: 'psql:<stdin>:1: ERROR:=C2=A0 canceling s= tatement due to
> conflict with recovery
> DETAIL:=C2=A0 User was or might have been using tablespace that must b= e dropped.'
> while running 'psql --no-psqlrc --no-align --tuples-only --quiet > --dbname port=3D25195
> host=3D/var/folders/g9/7rkt8rt1241bwwhd3_s8ndp40000gn/T/LqcCJnsueI
> dbname=3D'postgres' --file - --variable ON_ERROR_STOP=3D1'= with sql 'WAIT
> FOR LSN '0/03467F58' WITH (MODE 'standby_replay', time= out '180s',
> no_throw);' at /Users/admin/pgsql/src/test/perl/PostgreSQL/Test/Cl= uster.pm
> line 2300.
>
> https://cirrus-ci.com/task/57712749007339= 52
>
> The master branch in time-descending order, macOS tasks only:
>
>=C2=A0 =C2=A0 =C2=A0 task_id=C2=A0 =C2=A0 =C2=A0 | substring |=C2=A0 st= atus
> ------------------+-----------+-----------
>=C2=A0 6460882231754752 | c970bdc0=C2=A0 | FAILED
>=C2=A0 5771274900733952 | 6ca8506e=C2=A0 | FAILED
>=C2=A0 6217757068361728 | 63ed3bc7=C2=A0 | FAILED
>=C2=A0 5980650261446656 | ae283736=C2=A0 | FAILED
>=C2=A0 6585898394976256 | 5f13999a=C2=A0 | COMPLETED
>=C2=A0 4527474786172928 | 7f9acc9b=C2=A0 | COMPLETED
>=C2=A0 4826100842364928 | e8d4e94a=C2=A0 | COMPLETED
>=C2=A0 4540563027918848 | b9ee5f2d=C2=A0 | FAILED
>=C2=A0 6358528648019968 | c5af141c=C2=A0 | FAILED
>=C2=A0 5998005284765696 | e212a0f8=C2=A0 | COMPLETED
>=C2=A0 6488580526178304 | b85d5dc0=C2=A0 | FAILED
>=C2=A0 5034091344560128 | 7dc95cc3=C2=A0 | ABORTED
>=C2=A0 5688692477526016 | bb048e31=C2=A0 | COMPLETED
>=C2=A0 5481187977723904 | d351063e=C2=A0 | COMPLETED
>=C2=A0 5101831568752640 | f30848cb=C2=A0 | COMPLETED <-- the change<= br> >=C2=A0 6395317408497664 | 3f33b63d=C2=A0 | COMPLETED
>=C2=A0 6741325208354816 | 877ae5db=C2=A0 | COMPLETED
>=C2=A0 4594007789010944 | de746e0d=C2=A0 | COMPLETED
>=C2=A0 6497208998035456 | 461b8cc9=C2=A0 | COMPLETED

The failure rates of this are very high - the majority of the CI runs on th= e
postgres/postgres repos failed since the change went in. Which then also me= ans
cfbot has a very high spurious failure rate. I think we need to revert this=
change until the problem has been verified as fixed.
=

This is fair. I will revert t= he commit causing the failures in the next few hours.

------
Regards,
<= div dir=3D"auto">Alexander Korotkov
--00000000000076a9640647c7c51a--