Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1vVWM0-004geQ-0m for pgsql-hackers@arkaria.postgresql.org; Tue, 16 Dec 2025 14:48:45 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.96) (envelope-from ) id 1vVWLy-0079gI-2G for pgsql-hackers@arkaria.postgresql.org; Tue, 16 Dec 2025 14:48:43 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1vVWLy-0079gA-1J for pgsql-hackers@lists.postgresql.org; Tue, 16 Dec 2025 14:48:43 +0000 Received: from goedel.df7cb.de ([49.13.90.212]) by magus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1vVWLw-0013qd-2N for pgsql-hackers@lists.postgresql.org; Tue, 16 Dec 2025 14:48:42 +0000 Received: from msg.df7cb.de (unknown [IPv6:2a02:908:1472:9340:f0ad:fc6e:9c86:f1dc]) by goedel.df7cb.de (Postfix) with ESMTPSA id DF1662B4D; Tue, 16 Dec 2025 14:48:38 +0000 (UTC) Date: Tue, 16 Dec 2025 15:48:38 +0100 From: Christoph Berg To: Tomas Vondra Cc: Jakub Wartak , pgsql-hackers@lists.postgresql.org Subject: Re: failed NUMA pages inquiry status: Operation not permitted Message-ID: References: <7bbc582b-cc70-4a6f-bbf2-b5fd9b13a867@vondra.me> <54329add-59b6-4c08-96f0-a025a7804174@vondra.me> <4ff9578d-1de2-45c1-98c4-29caf99334ff@vondra.me> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk Re: To Tomas Vondra > I've managed to reproduce it once, running this loop on > 18-as-of-today. It errored out after a few 100 iterations: > > while psql -c 'SELECT COUNT(*) >= 0 AS ok FROM pg_shmem_allocations_numa'; do :; done > > 2025-12-16 11:49:35.982 UTC [621807] myon@postgres ERROR: invalid NUMA node id outside of allowed range [0, 0]: -2 > 2025-12-16 11:49:35.982 UTC [621807] myon@postgres STATEMENT: SELECT COUNT(*) >= 0 AS ok FROM pg_shmem_allocations_numa > > That was on the apt.pg.o amd64 build machine while a few things were > just building. Maybe ENOENT "The page is not present" means something > was just swapped out because the machine was under heavy load. I played a bit more with it. * It seems to trigger only once for a running cluster. The next one needs a restart * If it doesn't trigger within the first 30s, it probably never will * It seems easier to trigger on a system that is under load (I started a few pgmodeler compile runs in parallel (C++)) But none of that answers the "why". Christoph