Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_CBC_SHA1:256) (Exim 4.89) (envelope-from ) id 1iAJQD-0004oY-FT for pgsql-hackers@arkaria.postgresql.org; Tue, 17 Sep 2019 19:45:57 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.89) (envelope-from ) id 1iAJQC-0000oM-7C for pgsql-hackers@arkaria.postgresql.org; Tue, 17 Sep 2019 19:45:56 +0000 Received: from makus.postgresql.org ([2001:4800:3e1:1::229]) by malur.postgresql.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_CBC_SHA1:256) (Exim 4.89) (envelope-from ) id 1iAJQB-0000o6-OA for pgsql-hackers@lists.postgresql.org; Tue, 17 Sep 2019 19:45:55 +0000 Received: from mail-wr1-x442.google.com ([2a00:1450:4864:20::442]) by makus.postgresql.org with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1iAJPW-0001BX-HV for pgsql-hackers@lists.postgresql.org; Tue, 17 Sep 2019 19:45:54 +0000 Received: by mail-wr1-x442.google.com with SMTP id o18so4386060wrv.13 for ; Tue, 17 Sep 2019 12:45:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=2ndquadrant-com.20150623.gappssmtp.com; s=20150623; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=N9XBef8EV1bENws8/Kp2dnvTvPJfTeJwtHWNPm6Ycqs=; b=lmTrARVsa+BRffzWaeRMNc8oZFCrtujaZxJDFR0rsLcODWL+d8bHuzD/RRoPBS10DD 4IAhrZfUUb6LxyaWD590f5aNWqcb7j7X137dHFHfRLsakM0ewWBXbkVFTcuOjWVKX7Z6 ER89Lx+gmCs6MbBM27EayN/SjOAFIvb9GGQOJDZ1t0FlwN3xnexm2xP4xkjQ0MW9QHzo aiO4HkIWKpgrYtO2crPyGV0Lsut6pZ90vKrXBntKPBwu4L3QU2EphPbpBgVLpppOKt2U JgW98Kfb8a+E8hzAykCyDccWSddS5SYuG8eLeJbkamuvSF+8xBJE29t0fkdkWj180hgk Rvrw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=N9XBef8EV1bENws8/Kp2dnvTvPJfTeJwtHWNPm6Ycqs=; b=gnUch8mB6c6Ug4as/a2QeGN3F4C9wYRdb/o/pqn6sASNXdM1tdN+U79J0FjoCmjwgz bAhBIP6FYRvJKgY7DksBR+rAuLiq/Uy33yTQniEgt8ccmd0VggKIuCGvo4vsqI+s8y/c jweEBSd4rkt3CNAvx0k0IEUKlGgz/YKqHeT9XUkrixYG8cSl2ZdjkYhCPHqNpENd6gPD 9pmkL/Bcx0mD/2ofSRMoqH0gYcTNC8YNxb091wZQp4OurYVPyB+T/gC6MH7QcdMgIqGh 7gmMEf7f06UVS1XxQ3zvTx67hJkeHktSw/mKgCChuduRWnR82nymFvRwukluq7RgVF5R hByg== X-Gm-Message-State: APjAAAVtNP3tNnzAp4n6B6IxxIBLgZIh9j9Vx3qp2COFhnWfCsANbAEL kpMDFlMvTZmOSqnDULiaMhcdKw== X-Google-Smtp-Source: APXvYqwHFR2OwYv855nk8zVtM9iIc/MFDlDVLgppcdCV2wglm2RrendrU7D0KPuihIxyJTChR5SLdQ== X-Received: by 2002:a5d:430f:: with SMTP id h15mr202129wrq.177.1568749512230; Tue, 17 Sep 2019 12:45:12 -0700 (PDT) Received: from localhost (ip-86-49-253-183.net.upcbroadband.cz. [86.49.253.183]) by smtp.gmail.com with ESMTPSA id e20sm5505676wrc.34.2019.09.17.12.45.11 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 17 Sep 2019 12:45:11 -0700 (PDT) Date: Tue, 17 Sep 2019 21:45:10 +0200 From: Tomas Vondra To: Tom Lane Cc: Robert Haas , Michael Paquier , Andres Freund , Thomas Munro , PostgreSQL Hackers Subject: Re: subscriptionCheck failures on nightjar Message-ID: <20190917194510.iqwyl3be62pz7l27@development> References: <20190213181225.fathyapig4sm4exa@alap3.anarazel.de> <31663.1550082243@sss.pgh.pa.us> <20190213183303.ns54frt7cmvo6pgg@alap3.anarazel.de> <1466.1550085086@sss.pgh.pa.us> <20190213215147.cjbymfojf6xndr4t@alap3.anarazel.de> <20190813080435.GL2551@paquier.xyz> <20190826132904.3ayuw36qzl2c4ktr@development> <7361.1568738373@sss.pgh.pa.us> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Disposition: inline In-Reply-To: <7361.1568738373@sss.pgh.pa.us> User-Agent: NeoMutt/20180716 List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Precedence: bulk On Tue, Sep 17, 2019 at 12:39:33PM -0400, Tom Lane wrote: >Robert Haas writes: >> On Mon, Aug 26, 2019 at 9:29 AM Tomas Vondra >> wrote: >>> This is one of the remaining open items, and we don't seem to be moving >>> forward with it :-( > >> Why exactly is this an open item, anyway? > >The reason it's still here is that Andres expressed a concern that >there might be more than meets the eye in this. What meets the eye >is that PANICing on file-not-found is not appropriate here, but Andres >seemed to think that the file not being present might reflect an >actual bug not just an expectable race condition [1]. > >Personally I'd be happy just to treat it as an expectable case and >fix the code to not PANIC on file-not-found. > FWIW I agree with Andres that there probably is an actual bug. The file should not just disappear like this, it's clearly unexpected so the PANIC does not seem entirely inappropriate. I've tried reproducing the issue on my local systems, with the extra sleeps between fsyncs and so on, but I haven't managed to trigger it so far :-( >In either case, it probably belongs in the "older bugs" section; >nightjar is showing the same failure on v11 from time to time. > Yes, it should be moved to the older section - it's clearly a v11 bug. regards -- Tomas Vondra http://www.2ndQuadrant.com PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services