Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1vpxqP-00223H-0g for pgsql-admin@arkaria.postgresql.org; Wed, 11 Feb 2026 00:12:38 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.96) (envelope-from ) id 1vpxqO-001gq9-03 for pgsql-admin@arkaria.postgresql.org; Wed, 11 Feb 2026 00:12:36 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1vpxqN-001gq1-0S for pgsql-admin@lists.postgresql.org; Wed, 11 Feb 2026 00:12:36 +0000 Received: from mailout.easymail.ca ([64.68.200.34]) by magus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.98.2) (envelope-from ) id 1vpxqK-0000000049k-3qOA for pgsql-admin@lists.postgresql.org; Wed, 11 Feb 2026 00:12:35 +0000 Received: from localhost (localhost [127.0.0.1]) by mailout.easymail.ca (Postfix) with ESMTP id 649F0E0F25; Wed, 11 Feb 2026 00:12:30 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=elevated-dev.com; s=easymail; t=1770768750; bh=OzM4Ng6F8sOEwo9aqEwAElmWY0EJDipCTimTZgtDWeM=; h=Subject:From:In-Reply-To:Date:Cc:References:To:From; b=Ng3qKRfwFJCtg1TBK4szcdblIHn5aut0dD//4OZ2aLMQeXEg6/0jfQbvL3VDOlz66 dRl5oQwzeBqhmByXdOUSMUJTK3lUQSUxCluDo6myoB52OxfXsAELZgzP/0zouH0roW k+PfPDpoM+UuOSvUd0uLwlxaH2ThVOyCINY+e1EvHlAnaydN8VXmWTgIOAbLKv6WDx 9xfuO0GEOB6mNk+KadxajWLEA8JvGRkIsoexUs0bnmxuvGoLLlZ0NGrCVMC0zvYAV8 U8064wjXQX3HibsZ4E0v/s7P3u0ZPL2htNPBJNr9vzGalVBUvZfKqtDZ0P9rYaU6LX H72z6OizZfDPA== X-Virus-Scanned: Debian amavisd-new at emo08-pco.easydns.vpn Received: from mailout.easymail.ca ([127.0.0.1]) by localhost (emo08-pco.easydns.vpn [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Klf6duVvJ2Lr; Wed, 11 Feb 2026 00:12:30 +0000 (UTC) Received: from smtpclient.apple (unknown [75.70.97.102]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mailout.easymail.ca (Postfix) with ESMTPSA id C31B8E0C93; Wed, 11 Feb 2026 00:12:29 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=elevated-dev.com; s=easymail; t=1770768750; bh=OzM4Ng6F8sOEwo9aqEwAElmWY0EJDipCTimTZgtDWeM=; h=Subject:From:In-Reply-To:Date:Cc:References:To:From; b=Ng3qKRfwFJCtg1TBK4szcdblIHn5aut0dD//4OZ2aLMQeXEg6/0jfQbvL3VDOlz66 dRl5oQwzeBqhmByXdOUSMUJTK3lUQSUxCluDo6myoB52OxfXsAELZgzP/0zouH0roW k+PfPDpoM+UuOSvUd0uLwlxaH2ThVOyCINY+e1EvHlAnaydN8VXmWTgIOAbLKv6WDx 9xfuO0GEOB6mNk+KadxajWLEA8JvGRkIsoexUs0bnmxuvGoLLlZ0NGrCVMC0zvYAV8 U8064wjXQX3HibsZ4E0v/s7P3u0ZPL2htNPBJNr9vzGalVBUvZfKqtDZ0P9rYaU6LX H72z6OizZfDPA== Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3864.300.41.1.7\)) Subject: Re: debug a lockup From: Scott Ribe In-Reply-To: Date: Tue, 10 Feb 2026 17:12:19 -0700 Cc: Pgsql-admin Content-Transfer-Encoding: quoted-printable Message-Id: References: <62854B1F-FE6F-47BD-8B8C-5BEAA33EE3AD@elevated-dev.com> To: Aislan Luiz Wendling , Tom Lane X-Mailer: Apple Mail (2.3864.300.41.1.7) List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk OK, we figured it out--I think. pgbench was stuck in restart_syscall(<...resuming interrupted read... it was set to open 100 connections there were ~20 pg sessions in idle, and the last one (highest pid) in = auth that one was in write to fd 2 So... This is running in kubernetes. I was doing some load testing = against a storage service (thus 100 connections). PG was launched = manually in a bash session connected to the pod, in k9s. There were ~20 = total bash sessions open in k9s across 15 nodes. Theory: k9s glitched and stopped reading the piped file descriptor, = buffer filled, and PG blocked on the write. (I have seen prior evidence = of less-than-perfect handling of output by k9s). Particularly, I had = logging of connections on, so at auth it would have been writing to = stderr. This happened in one of probably over 100 runs of the same test, so not = readily reproducible and I wanted to autopsy it before killing off the = hung processes. Unless someone pokes a hole in my theory, at this point = I think it is neither pgbench nor PG nor Pure/Portworx at fault. -- Scott Ribe scott_ribe@elevated-dev.com https://www.linkedin.com/in/scottribe/