Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_CBC_SHA1:256) (Exim 4.89) (envelope-from ) id 1iBR7p-0004q9-SU for pgsql-hackers@arkaria.postgresql.org; Fri, 20 Sep 2019 22:11:38 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.89) (envelope-from ) id 1iBR7m-0008EK-7y for pgsql-hackers@arkaria.postgresql.org; Fri, 20 Sep 2019 22:11:34 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_CBC_SHA1:256) (Exim 4.89) (envelope-from ) id 1iBR7l-0008E9-Uw for pgsql-hackers@lists.postgresql.org; Fri, 20 Sep 2019 22:11:33 +0000 Received: from wout3-smtp.messagingengine.com ([64.147.123.19]) by magus.postgresql.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_CBC_SHA1:256) (Exim 4.89) (envelope-from ) id 1iBR7i-0003qa-Lc for pgsql-hackers@lists.postgresql.org; Fri, 20 Sep 2019 22:11:33 +0000 Received: from compute5.internal (compute5.nyi.internal [10.202.2.45]) by mailout.west.internal (Postfix) with ESMTP id 35CCD542; Fri, 20 Sep 2019 18:11:27 -0400 (EDT) Received: from mailfrontend2 ([10.202.2.163]) by compute5.internal (MEProxy); Fri, 20 Sep 2019 18:11:27 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=anarazel.de; h= date:in-reply-to:references:mime-version:content-type :content-transfer-encoding:subject:to:cc:from:message-id; s=fm2; bh=UguPoHaz2s9S+R03ZN3VS8F3DLha6fa8i89Shf1aIgc=; b=NNKLP8zJaYrS 0MNwIgzwkcqXJSFa10nd0Fxn3+0iH13t/nTaE1EbOuR35qhLAEG1itW+DzyR8kFG 1cOBZE3oZhnA1SKd+4rVK51vtnolsX5VGuB8FQ7HGAftiOR6eUdDA7e/33Xbm8v+ etVREwsU6IOUL4E+c8O47ovTW1xbTNZJmcdZddVjesRODDlqXO6l/L7zgSgwatEm GrPu6Hx/3uF/lnCEmmWt21B9L+gayGSWArzt8+mz5vuoPHgwsdcAkF1VREiPzSI0 9diInBfuDrqCiVx4TMoRFxPMnj2Xm7bXFSkDxTPaOxHDtAvP1ltUCVg22lI6U5MS U61ftNzvGA== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:content-transfer-encoding:content-type :date:from:in-reply-to:message-id:mime-version:references :subject:to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender :x-sasl-enc; s=fm3; bh=UguPoHaz2s9S+R03ZN3VS8F3DLha6fa8i89Shf1aI gc=; b=OGQWXJoe/dgVJ9VGqXlReroFP6HAEXV18Sb9SrYgU3tP8nlqeY15CqOIA hh7AFnnZuVEAtiG8Ug1kYPkTXy6oR1slCyWT+rPgOd5G+dRIFeADZODfckBvyHLO hNGi2NvHqWvGEWUu2lW85Mtn3bOkoqm+KM5iVyASexsyXFqgq4A0feW98di8iVo0 tnqTHojoeUwBeihIRvbxYyhManyLFIWz2lK/1qNbB65Zv6jnwKmx63HGrN7CbicK CWD/iN/xpoPur/cUxQk5PzNfek0j2HWQ5+r0xZ1I2eWUunSxKIjkbePDzwTsti/5 wQ/+QFIcS+35ChCt93e+Ss5qMwyVA== X-ME-Sender: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedufedrvdefgddtjecutefuodetggdotefrodftvf curfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfghnecu uegrihhlohhuthemuceftddtnecusecvtfgvtghiphhivghnthhsucdlqddutddtmdenuc fjughrpefffggjfhggtgfguffvhffksehtqhhmtddtreejnecuhfhrohhmpeetnhgurhgv shcuhfhrvghunhguuceorghnughrvghssegrnhgrrhgriigvlhdruggvqeenucfkphepie ejrdduiedtrddvudekrddvfeejnecurfgrrhgrmhepmhgrihhlfhhrohhmpegrnhgurhgv shesrghnrghrrgiivghlrdguvgenucevlhhushhtvghrufhiiigvpedt X-ME-Proxy: Received: from ahand.lan (c-67-160-218-237.hsd1.ca.comcast.net [67.160.218.237]) by mail.messagingengine.com (Postfix) with ESMTPA id 1BE3FD6005E; Fri, 20 Sep 2019 18:11:24 -0400 (EDT) Date: Fri, 20 Sep 2019 15:11:22 -0700 User-Agent: K-9 Mail for Android In-Reply-To: <20190920220620.GA5214@alvherre.pgsql> References: <20190920220620.GA5214@alvherre.pgsql> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Subject: Re: subscriptionCheck failures on nightjar To: Alvaro Herrera ,Tom Lane CC: Andrew Dunstan ,Kuntal Ghosh ,Michael Paquier ,Tomas Vondra ,Robert Haas ,Thomas Munro ,PostgreSQL Hackers From: Andres Freund Message-ID: List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Precedence: bulk Hi,=20 On September 20, 2019 3:06:20 PM PDT, Alvaro Herrera wrote: >On 2019-Sep-20, Tom Lane wrote: > >> Actually, what I did was as attached [1], and I am getting traces >like >> [2]=2E The problem seems to occur only when there are two or three >> processes concurrently creating the same snapshot file=2E It's not >> obvious from the debug trace, but the snapshot file *does* exist >> after the music stops=2E > >Uh =2E=2E I didn't think it was possible that we would build the same >snapshot file more than once=2E Isn't that a waste of time anyway?=20 >Maybe >we can fix the symptom by just not doing that in the first place? >I don't have a strategy to do that, but seems worth considering before >retiring the bf animals=2E We try to avoid it, but the check is racy=2E Check comments in SnapBuildSe= rialize=2E We could introduce locking etc to avoid that, but that seems ove= rkill, given that were really just dealing with a broken os=2E Andres --=20 Sent from my Android device with K-9 Mail=2E Please excuse my brevity=2E