Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1tI6pv-00CD1c-JG for buildfarm-members@arkaria.postgresql.org; Mon, 02 Dec 2024 13:51:40 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.94.2) (envelope-from ) id 1tI6ps-0004aX-I8 for buildfarm-members@arkaria.postgresql.org; Mon, 02 Dec 2024 13:51:37 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1tI6ps-0004Zu-5h for buildfarm-members@lists.postgresql.org; Mon, 02 Dec 2024 13:51:37 +0000 Received: from relay1-d.mail.gandi.net ([2001:4b98:dc4:8::221]) by magus.postgresql.org with esmtps (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1tI6pp-000enl-AS for buildfarm-members@lists.postgresql.org; Mon, 02 Dec 2024 13:51:36 +0000 Received: by mail.gandi.net (Postfix) with ESMTPSA id 8503F240003; Mon, 2 Dec 2024 13:51:30 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=vondra.me; s=gm1; t=1733147491; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=erJtvgOl6gD7SUWa15x7scvwPTqg3DG8Wltm6UHRcVg=; b=Whbp2QIfrrkBT9PREFgZdUTEDHrmIO6SWxQa3aPHDFB7zK+wUzDYSejB1orl8rfFoQynZJ F5xB5/qVe15KT+uZ4VR+AAwZbPQSe1wAMqi7ub4rxkkBEl5KhAFZ/UG08wa6OjwPJrSb4J EAop6rsUhy9lmrjpxTfC3fYCZDs2ph/ZCtf47sinOdYcJHfwsHnnP0HiSE4W21VkfqQINh AKvae2GIz4UhOYt9jyWfZPVTfpOnu52QGxOfwk1M66qkGEL3Low15763tVUHBSku+s1RCa dJ1Imk9xF1XrRYmQmmwQey07182z/jIeMFlZWM710qtozyTh9ixZgyd2hz0kGg== Message-ID: Date: Mon, 2 Dec 2024 14:51:29 +0100 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: strange git problems on turaco To: Noah Misch Cc: buildfarm-members@lists.postgresql.org References: <6a705172-5b28-4023-a40e-fb7805c717c4@vondra.me> <20241202034623.39@rfd.leadboat.com> Content-Language: en-US From: Tomas Vondra In-Reply-To: <20241202034623.39@rfd.leadboat.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-GND-Sasl: tomas@vondra.me List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk On 12/2/24 04:46, Noah Misch wrote: > On Mon, Dec 02, 2024 at 02:20:35AM +0100, Tomas Vondra wrote: >> turaco seems to be having some strange git issues - some of the >> buildfarm runs fail like this: >> >> >> turaco:REL_16_STABLE [22:41:11] OK >> Sun Dec 1 22:41:27 2024: buildfarm run for turaco:REL_17_STABLE starting >> turaco:REL_17_STABLE [22:41:27] checking out source ... >> Missing checked out branch bf_REL_17_STABLE: >> fatal: not a git repository (or any parent up to mount point /mnt) >> Stopping at filesystem boundary (GIT_DISCOVERY_ACROSS_FILESYSTEM not set). >> turaco:REL_17_STABLE [22:41:32] failed at stage pgsql-Git >> Sun Dec 1 22:41:33 2024: buildfarm run for turaco:HEAD starting >> turaco:HEAD [22:41:33] checking out source ... >> >> >> I initially suspected this might be due to aging storage (SD card on >> rpi), but I replaced that, and there's nothing strange in dmesg. Also, >> other branches seem to be working fine ... >> >> Any ideas what could be causing this? > > I had this happen ~9 times on the host of my AIX buildfarm members. Example: > https://buildfarm.postgresql.org/cgi-bin/show_log.pl?nm=mandrill&dt=2024-07-10%2019%3A51%3A28 > > I figured it was some system problem, so I didn't root-cause it. I carry the > following workaround in my fork of the buildfarm client code. The unknown > problem caused failure reports and work stoppage ~4 times before I installed > this workaround, then logs show the workaround prevented damage 5 times. The > last "removed intruder .git" log message appeared on 2024-07-23. There was no > kernel reboot, and logs don't point to buildfarm client processes getting > involuntary termination, either. > Thanks. I suspect some system issue too, but I didn't want to blame the system without some kind of proof. I applied your patch, let's see if that helped after a couple runs. regards -- Tomas Vondra