Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1tHxON-00B4w4-3e for buildfarm-members@arkaria.postgresql.org; Mon, 02 Dec 2024 03:46:35 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.94.2) (envelope-from ) id 1tHxOJ-00DQYa-1o for buildfarm-members@arkaria.postgresql.org; Mon, 02 Dec 2024 03:46:32 +0000 Received: from makus.postgresql.org ([2001:4800:3e1:1::229]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1tHxOI-00DQYS-Fr for buildfarm-members@lists.postgresql.org; Mon, 02 Dec 2024 03:46:31 +0000 Received: from mail-pg1-x52b.google.com ([2607:f8b0:4864:20::52b]) by makus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.94.2) (envelope-from ) id 1tHxOG-000aAQ-8t for buildfarm-members@lists.postgresql.org; Mon, 02 Dec 2024 03:46:29 +0000 Received: by mail-pg1-x52b.google.com with SMTP id 41be03b00d2f7-7fbc29b3145so3330401a12.0 for ; Sun, 01 Dec 2024 19:46:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=leadboat.com; s=google; t=1733111186; x=1733715986; darn=lists.postgresql.org; h=user-agent:in-reply-to:content-disposition:mime-version:references :message-id:subject:cc:to:from:date:from:to:cc:subject:date :message-id:reply-to; bh=TFhhr1zf/6IXw2fEpFESxwRcuNbv8UDy5Z6HRHRIIRI=; b=BuJOEwWYLvu3/MmuEsJGhTsisnRkCODIWc5G4qLk5MKG5eVPJIozvmBq9BfJPCzRQ5 QCzlj7kFtUlyNv4mzFyV8LYZW5B7Cnf06onssXVR1KsiuhIuNPn4JYVQTR4Ge20HzDKP udDSFrxWYu0RBAIKXOetHxHYOOFiULBVAyVJ4= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1733111186; x=1733715986; h=user-agent:in-reply-to:content-disposition:mime-version:references :message-id:subject:cc:to:from:date:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=TFhhr1zf/6IXw2fEpFESxwRcuNbv8UDy5Z6HRHRIIRI=; b=IBUVIkuyPsA0gV65NBtJWET2STTneiAoBMIU/bFxYOzdwv8t4xU/6SYbXOKKX+751u pwQrIzcOMA61i2/coRBWDAhWbIpJ5cxTbENG3dtxhrF9SckwQiZRR4HJXQJXg8GaVIkw fRRFH64Ifi82A1HNQ5JY9R2A2Z9bXu8on045DKJb/ipHejIBc0XzTuBhYYZ7lpKB5S+q jszTSGhfKWPY7o5bghOrLf3Bgp/yVVlte3uUHgOSIuLbtamzaidG2Olb2PvLUHzVSofM KGeRYZOSlrnVVcDu1WDDi3m2kgmYveuNRbVWAOnf8F6hw2F5negfNg6XXGS88oV+UocN P2Ug== X-Gm-Message-State: AOJu0YztbxeaJt5mNcOJTRzgaNmd7hH+pF3RR3A6KHKcsX6Ol00+Q8fS mrhD1BqNqC0HoEuemTzomu2+y20702W32WhL1jBsXgRPyPqXSnx+B2jChFStBw== X-Gm-Gg: ASbGncsn/fLrpydj8YPHK0GcVI0kySsGpn10DZAx44G/NY7smmgNplwhDWhVW7DTJWw 3GlWAVJ6+MWVTJ2eDcAg+Mjd+F/NLRBFdXk5TRQvkIJ9Ho+I2i0EbxiMrPHQnt5l4PJWNwhzt2A 31/K8m9xF8yCzO+4cNZJMGzah9tKn927ILTKu+Zael10130F1eN+aSJ7ItXNLqylkYxoGeMV6Mg H7ipn1km0w4HG2KBexXRYpWRujp4PDY/FLn4JSiuzwj/sDHlhEN X-Google-Smtp-Source: AGHT+IFukjp03naz6Cx/Xb/BL/thxtfF99cyTcexWeh9MKssXbOwOyqoPROtE9zIMp07rHEOESXsZg== X-Received: by 2002:a05:6a21:7893:b0:1d9:d5e:8297 with SMTP id adf61e73a8af0-1e0ec80bb10mr27870010637.6.1733111186409; Sun, 01 Dec 2024 19:46:26 -0800 (PST) Received: from rfd.leadboat.com ([2600:1702:a20:5750::48]) by smtp.gmail.com with ESMTPSA id d2e1a72fcca58-725417fbfc3sm7600588b3a.123.2024.12.01.19.46.25 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Sun, 01 Dec 2024 19:46:26 -0800 (PST) Date: Sun, 1 Dec 2024 19:46:23 -0800 From: Noah Misch To: Tomas Vondra Cc: buildfarm-members@lists.postgresql.org Subject: Re: strange git problems on turaco Message-ID: <20241202034623.39@rfd.leadboat.com> References: <6a705172-5b28-4023-a40e-fb7805c717c4@vondra.me> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <6a705172-5b28-4023-a40e-fb7805c717c4@vondra.me> User-Agent: Mutt/2.2.12 (2023-09-09) List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk On Mon, Dec 02, 2024 at 02:20:35AM +0100, Tomas Vondra wrote: > turaco seems to be having some strange git issues - some of the > buildfarm runs fail like this: > > > turaco:REL_16_STABLE [22:41:11] OK > Sun Dec 1 22:41:27 2024: buildfarm run for turaco:REL_17_STABLE starting > turaco:REL_17_STABLE [22:41:27] checking out source ... > Missing checked out branch bf_REL_17_STABLE: > fatal: not a git repository (or any parent up to mount point /mnt) > Stopping at filesystem boundary (GIT_DISCOVERY_ACROSS_FILESYSTEM not set). > turaco:REL_17_STABLE [22:41:32] failed at stage pgsql-Git > Sun Dec 1 22:41:33 2024: buildfarm run for turaco:HEAD starting > turaco:HEAD [22:41:33] checking out source ... > > > I initially suspected this might be due to aging storage (SD card on > rpi), but I replaced that, and there's nothing strange in dmesg. Also, > other branches seem to be working fine ... > > Any ideas what could be causing this? I had this happen ~9 times on the host of my AIX buildfarm members. Example: https://buildfarm.postgresql.org/cgi-bin/show_log.pl?nm=mandrill&dt=2024-07-10%2019%3A51%3A28 I figured it was some system problem, so I didn't root-cause it. I carry the following workaround in my fork of the buildfarm client code. The unknown problem caused failure reports and work stoppage ~4 times before I installed this workaround, then logs show the workaround prevented damage 5 times. The last "removed intruder .git" log message appeared on 2024-07-23. There was no kernel reboot, and logs don't point to buildfarm client processes getting involuntary termination, either. diff --git a/PGBuild/SCM.pm b/PGBuild/SCM.pm index dcfd180..2cd610a 100644 --- a/PGBuild/SCM.pm +++ b/PGBuild/SCM.pm @@ -1059,9 +1059,19 @@ sub _update_target my @gitlog; # If a run crashed during copy_source(), repair. - if (-d "./git-save" && !-d "$target/.git") + if (-d "./git-save") { + # As of 2024-07-13, the following has happened about four times in the + # last month, to different gcc111 animals. Despite no known crash, + # there's a git-save directory containing the proper git repo, and + # there's a bogus .git missing most content. Remove the bogus one. + # This is deeply hacky, but it beats buildfarm report noise and manual + # intervention. + if (rmtree("$target/.git") > 0) { + print "removed intruder .git\n" if $verbose; + } move "./git-save", "$target/.git"; + print "restored git-save\n" if $verbose; } chdir $target;