Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1rpPSa-006ZaC-W6 for pgsql-hackers@arkaria.postgresql.org; Wed, 27 Mar 2024 09:20:41 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.94.2) (envelope-from ) id 1rpPSZ-00CNiH-KK for pgsql-hackers@arkaria.postgresql.org; Wed, 27 Mar 2024 09:20:39 +0000 Received: from makus.postgresql.org ([2001:4800:3e1:1::229]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1rpPSZ-00CNi9-1I for pgsql-hackers@lists.postgresql.org; Wed, 27 Mar 2024 09:20:39 +0000 Received: from mail-ed1-x52f.google.com ([2a00:1450:4864:20::52f]) by makus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.94.2) (envelope-from ) id 1rpPST-006g9L-Ue for pgsql-hackers@postgresql.org; Wed, 27 Mar 2024 09:20:37 +0000 Received: by mail-ed1-x52f.google.com with SMTP id 4fb4d7f45d1cf-56845954ffeso9157166a12.2 for ; Wed, 27 Mar 2024 02:20:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1711531233; x=1712136033; darn=postgresql.org; h=user-agent:in-reply-to:content-disposition:mime-version:references :subject:cc:to:from:date:sender:message-id:from:to:cc:subject:date :message-id:reply-to; bh=T/oFh7B2l3q7t7hZOgYp4Gs/LtgAVIU8GyhUiIpo4Qc=; b=jw5k8mBxvB2HqlG92dCgRRuqcxxUaKI038e0WX2q2tJOgsLQ1XakRHvFOpfK5aiv9K fwrcMRUK5w2fLKyD5xOhqw5tnY8neG/BccY1bc1yJtdyzy9VfGQBBXnd7/vHzC2UW0J9 Jhp8tZgYVunwvt6QQyYLAeRVwZ8eiYBC00131xn8u3T8PZN/UrLrG8xOZ6wI7ocvOe/h lRfR7A+nAM2GNQhQTlk+xGf+e1eTWxpoinphp3etDcXCwPqvGjBdNhOTEhkd+TH5C1CT cynJdaGMT1iuHZM8AuJLdNjphB8PrdTOeUj9W0BEg5ORg1yA6rS5/oDVgK3bCaT1NL6F k+Bg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1711531233; x=1712136033; h=user-agent:in-reply-to:content-disposition:mime-version:references :subject:cc:to:from:date:sender:message-id:x-gm-message-state:from :to:cc:subject:date:message-id:reply-to; bh=T/oFh7B2l3q7t7hZOgYp4Gs/LtgAVIU8GyhUiIpo4Qc=; b=PlEp43MIK6aM4v2aDYLfJonXPnYrr+grvRyxGQ3FlYSYw+HzzCsVw8/KSJ1KkTJP8g WO+zZq0NsWrEuOaUEqFMnitktle65BRByJjOopGWCQmRbzrtmWG1Ulm5utDM2TMALiht EZbj9zsno7/PQqEgPS6fpjINAr+LDYLb8H1iaj+FiZIVTdrR+mKWfYnGRUMhgeOccHr2 pdNDpBMlqPge+A0C6oP4edYuVvnXrlLLYovrZzarb3pdO4quyDwjXV5rkT5mool2G7Tt AbyGq14QFJyimqjdbnrcqUbzsX0TYIl3eEyjuENJIbNSwjinbi5wRGYOo+EmxqBB1Xwl fDNA== X-Forwarded-Encrypted: i=1; AJvYcCXrZygxl8rNNaszCJlN7qdlukD06OjH1B5rfB4CMW2p4Zooxz5XQWBbr4JSI91o0o93kW9MDY8XMQnlxdl9l8z/kemYV0IrTPw2hRrb X-Gm-Message-State: AOJu0YztJ/MwRYHZnQELibODjQyCUoYgfFhlJQW5cqIRmzhSn7uS8p4c UllYJPBo5WF8qTD0P+KhP/C8Cq2vQRBHSAzO0X/G+JSa7Jumaw3n X-Google-Smtp-Source: AGHT+IH9JVrooG29kryMJb44fmT+fjdPOANq8U7lZCkE+ypyAmMQzncKE9AUQyC9cciVnU5+2yLZIA== X-Received: by 2002:a50:8e09:0:b0:568:bc48:5f27 with SMTP id 9-20020a508e09000000b00568bc485f27mr459991edw.39.1711531233085; Wed, 27 Mar 2024 02:20:33 -0700 (PDT) Received: from lightning.caipicrew.dd-dns.de ([2001:a61:a0e:9401:f815:e42b:c961:9a62]) by smtp.gmail.com with ESMTPSA id a9-20020a50ff09000000b0056c0a3d91easm3646148edu.12.2024.03.27.02.20.31 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 27 Mar 2024 02:20:32 -0700 (PDT) Message-ID: <6603e4e0.500a0220.a557f.4f39@mx.google.com> X-Google-Original-Message-ID: <20240327092031.GQ17618@caipicrew.dd-dns.de;lightning.caipicrew.dd-dns.de> Sender: Michael Banck Received: from mbanck by lightning.caipicrew.dd-dns.de with local (Exim 4.92) (envelope-from ) id 1rpPSR-0005rC-GG; Wed, 27 Mar 2024 10:20:31 +0100 Date: Wed, 27 Mar 2024 10:20:31 +0100 From: Michael Banck To: Tom Lane Cc: Laurenz Albe , Nathan Bossart , vignesh C , "Kumar, Sachin" , Robins Tharakan , Jan Wieck , Bruce Momjian , Andrew Dunstan , Magnus Hagander , Peter Eisentraut , "pgsql-hackers@postgresql.org" Subject: Re: pg_upgrade failing for 200+ million Large Objects References: <83D44BE5-0088-4D41-8AE6-20A05D026F46@amazon.com> <81D13E16-BA04-43CF-9B89-B8924300B211@amazon.com> <240D05EC-8B28-4112-BEAB-85ECBAF3F871@amazon.com> <2055911.1702258962@sss.pgh.pa.us> <557FD681-3929-44A1-87B2-6B5E10C4A66B@amazon.com> <842242.1706287466@sss.pgh.pa.us> <4a3ebf7d81bfc6dd4d545e5b27d6e8f6c32d8937.camel@cybertec.at> <3023817.1710629175@sss.pgh.pa.us> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <3023817.1710629175@sss.pgh.pa.us> User-Agent: Mutt/1.10.1 (2018-07-13) List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk Hi, On Sat, Mar 16, 2024 at 06:46:15PM -0400, Tom Lane wrote: > Laurenz Albe writes: > > On Fri, 2024-03-15 at 19:18 -0400, Tom Lane wrote: > >> This patch seems to have stalled out again. In hopes of getting it > >> over the finish line, I've done a bit more work to address the two > >> loose ends I felt were probably essential to deal with: > > > Applies and builds fine. > > I didn't scrutinize the code, but I gave it a spin on a database with > > 15 million (small) large objects. I tried pg_upgrade --link with and > > without the patch on a debug build with the default configuration. > > Thanks for looking at it! > > > Without the patch: > > Runtime: 74.5 minutes > > > With the patch: > > Runtime: 70 minutes > > Hm, I'd have hoped for a bit more runtime improvement. I also think that this is quite a large runtime for pg_upgrade, but the more important savings should be the memory usage. > But perhaps not --- most of the win we saw upthread was from > parallelism, and I don't think you'd get any parallelism in a > pg_upgrade with all the data in one database. (Perhaps there is more > to do there later, but I'm still not clear on how this should interact > with the existing cross-DB parallelism; so I'm content to leave that > question for another patch.) What is the status of this? In the commitfest, this patch is marked as "Needs Review" with Nathan as reviewer - Nathan, were you going to take another look at this or was your mail from January 12th a full review? My feeling is that this patch is "Ready for Committer" and it is Tom's call to commit it during the next days or not. I am +1 that this is an important feature/bug fix to have. Because we have customers stuck on older versions due to their pathological large objects usage, I did some benchmarks (jsut doing pg_dump, not pg_upgarde) a while ago which were also very promising; however, I lost the exact numbers/results. I am happy to do further tests if that is required for this patch to go forward. Also, is there a chance this is going to be back-patched? I guess it would be enough if the ugprade target is v17 so it is less of a concern, but it would be nice if people with millions of large objects are not stuck until they are ready to ugprade to v17. Michael