Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1tgWQO-00BPMR-0c for pgsql-general@arkaria.postgresql.org; Fri, 07 Feb 2025 22:02:12 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.94.2) (envelope-from ) id 1tgWQL-0069C0-Uq for pgsql-general@arkaria.postgresql.org; Fri, 07 Feb 2025 22:02:09 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1tgWQL-0069Bs-Fo for pgsql-general@lists.postgresql.org; Fri, 07 Feb 2025 22:02:09 +0000 Received: from mail-qv1-xf2c.google.com ([2607:f8b0:4864:20::f2c]) by magus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.96) (envelope-from ) id 1tgWQI-004RS5-0I for pgsql-general@postgresql.org; Fri, 07 Feb 2025 22:02:09 +0000 Received: by mail-qv1-xf2c.google.com with SMTP id 6a1803df08f44-6e436c59113so22528656d6.3 for ; Fri, 07 Feb 2025 14:02:06 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=princeton.edu; s=google; t=1738965725; x=1739570525; darn=postgresql.org; h=to:references:message-id:content-transfer-encoding:cc:date :in-reply-to:from:subject:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=1oNW45ZoB7vARQks8dSz+5MjjUiBNH4808YQPazPzOI=; b=Ht7KE3TjAQor+YCbwPWcg+DiafMKvqNrcH5ejfsyV6uamq+Z0CaUJyqsQznoLRvame SSYpyi/Q8ptvtqBmX+IxqSRYecYPSqmGVRdMgqD9UgwfMfNqJCAJkJdeAuhjyIXpdOdZ O8R2RRM9XR72uIkm8s0cLbtVzd4w6IWnB60e0rg+jlYt52d0xn+eCU8wInBhbpitCgPR vg2ndbGmfD4ofP38PCq2vN1gaFjurq65f1JXNXFc6j0TcQSQiI7Lp4Nd9F9OG/RzuRx2 R6j2zCaTebFQtALpW8ifj/0IY63lGk+aEFz1ci//q1kn9wujOWgfQjbZ6ij/4YGSDv0x rxJA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1738965725; x=1739570525; h=to:references:message-id:content-transfer-encoding:cc:date :in-reply-to:from:subject:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=1oNW45ZoB7vARQks8dSz+5MjjUiBNH4808YQPazPzOI=; b=udwP2gGMDHCljh1ud/5ADkF7DAxU9wcfbLksCONhI6LCPxONz4iSLQZJOCF2Si5Uhf XklGdH1NrVwnEvVsYcehduAkGFSugCf8rcDAW/XGZKp9k1XqnO2y6XsaDn/5xra1wCwJ O0L5T17VicCTurqq8mDxIMGLbegTgoxlud9kBJSfnpbPiWhKSczEqlWW3vpSD3KFwy6B RTNyI5+fy0qQoPX8xikwTEvAKEujqV2k68ooQoC/nB3WcRGD2J52emC/DEG70t+Y0G6I aytzwWectdcn8mP0lDq8UPt5U1hkEHaPP+0uJVBeM9gupbHUCOTh+MiTfcUuoGqAf9Cv hpuQ== X-Gm-Message-State: AOJu0YzmApRdNV4yz9Q2QEl9pPgPpyIT0CK3N3o9CEgS7ZhfyXute7sk EQWH5cqHBqxIBfFNjAPrepBZKPuVOQlWoVDfx87Dt/diqcT3mE767gt3pS4oEZehlvpzlnJU1n8 y X-Gm-Gg: ASbGncvTwBZjMcayZ/zWvOiUW5hMlgj5zfXqgQwvVTxUClrd1RrFUm41VPCgN344Eti qdsSdeP7pjPeRe9xRmGWkkO5zH9SqhAQZKniQd8+SYqDPB+JbI0Qu4sbe/VohyXhrbzEChbFzKS 8+sknLEf3w5YKGGSmz4V3z7dY9ailZGuNqXjjk4y9qWlUuo5Y+jhMikBs3OThA/p+sFxkmYIruD y140Azt6/4SD3K57KbOdLjPat0bJYF+VbOzSBXTSMYaSXzLPbwVmEkuBu2OdaoW/ool1WnNeVD1 lFb9seX4HzjDoCku25Si4owl/u8BXXbOKQ5fKCCR11Um3OvQGaJuPKjN8A== X-Google-Smtp-Source: AGHT+IGYgepakZjOK9uinoMOhzNPPKe/RVf3IrCLnG+gVEcupgL2t6L7N0RqkRjWcJCE3ye/2PxOaA== X-Received: by 2002:a05:6214:cc9:b0:6e1:f40c:b558 with SMTP id 6a1803df08f44-6e445768bd6mr60693156d6.44.1738965724948; Fri, 07 Feb 2025 14:02:04 -0800 (PST) Received: from smtpclient.apple (gen-rl-imac.princeton.edu. [128.112.117.2]) by smtp.gmail.com with ESMTPSA id 6a1803df08f44-6e43ba36c3csm21146306d6.38.2025.02.07.14.02.04 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Fri, 07 Feb 2025 14:02:04 -0800 (PST) Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3696.120.41.1.1\)) Subject: Re: How to perform a long running dry run transaction without blocking From: Robert Leach In-Reply-To: Date: Fri, 7 Feb 2025 17:02:03 -0500 Cc: pgsql-general@postgresql.org Content-Transfer-Encoding: quoted-printable Message-Id: <15A999B4-9D53-4F35-84B1-7B8696256EE9@princeton.edu> References: <88d60ace-45e6-4d41-afc4-113df7219c4d@aklaver.com> <4000D0EE-B250-4E9E-831F-00C034D6D0B5@princeton.edu> <6d833658-f461-4ad4-a3e1-86d3c515bc18@aklaver.com> <0FE9C709-A108-4ED5-8132-B802B8D9908F@princeton.edu> <00DF52D1-5ED0-4599-8480-8C671434EE4E@princeton.edu> <90368E5C-6F0C-45AE-B73A-5A467EC97043@princeton.edu> <5b7b7573-3b75-4061-a6fe-1387d1e0e51c@aklaver.com> <67D44BCB-71FA-4EF6-8C41-53F852683CFA@princeton.edu> To: Adrian Klaver X-Mailer: Apple Mail (2.3696.120.41.1.1) List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk >> Anyway, thanks so much for your help. This discussion has been very = useful, and I think I will proceed at first, exactly how you suggested, = by queuing every validation job (using celery). Then I will explore = whether or not I can apply the "on timeout" strategy in a small patch. >> Incidentally, during our Wednesday meeting this week, we actually = opened our public instance to the world for the first time, in = preparation for the upcoming publication. This discussion is about the = data submission interface, but that interface is actually disabled on = the public-facing instance. The other part of the codebase that I was = primarily responsible for was the advanced search. Everything else was = primarily by other team members. If you would like to check it out, let = me know what you think: http://tracebase.princeton.edu = >=20 > I would have to hit the books again to understand all of what is going = on here. It's a mass spec tracing database. Animals are infused with radio = labeled compounds and mass spec is used to see what the animal's = biochemistry turns those compounds into. (My undergrad was biochem, so = I've been resurrecting my biochem knowledge, as needed for this project. = I've been mostly doing RNA and DNA sequence analysis since undergrad, = and most of that was prokaryotic. > One quibble with the Download tab, there is no indication of the size = of the datasets. I generally like to know what I am getting into before = I start a download. Also, is there explicit throttling going on? I am = seeing 10.2kb/sec, whereas from here = https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page I = downloaded a 47.65M file at 41.9MB/s Thank you! Not knowing the download size is exactly a complaint I had. = That download actually uses my advanced search interface (in browse = mode). There is the same issue with the download buttons on the = advanced search. With the streaming, we're not dealing with temp files, = which is nice, at least for the advanced search, but we can't know the = download size that way. So I had wanted a progress bar to at least show = progress (current record per total). I could even estimate the size (an = option I explored for a few days). Eventually, I proposed a celery = solution for that and I was overruled. As for the download in the nav bar, we have an issue to change that to a = listing of actual files broken down by study (3 files per study). = There's not much actual utility from a user perspective for downloading = everything anyway. We've just been focussed on other things. In fact, = we have a request from a user for that specific feature, done in a way = that's compatible with curl/scp. We just have to figure out how to not = have to CAS authenticate each command, something I don't have experience = with.=