Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1w5rxL-003i5z-0L for pgsql-hackers@arkaria.postgresql.org; Thu, 26 Mar 2026 21:09:31 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.96) (envelope-from ) id 1w5rxJ-005MpM-23 for pgsql-hackers@arkaria.postgresql.org; Thu, 26 Mar 2026 21:09:30 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from ) id 1w5rxJ-005MpD-18 for pgsql-hackers@lists.postgresql.org; Thu, 26 Mar 2026 21:09:29 +0000 Received: from mail-ot1-x333.google.com ([2607:f8b0:4864:20::333]) by magus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.98.2) (envelope-from ) id 1w5rxH-00000001M8N-1S79 for pgsql-hackers@postgresql.org; Thu, 26 Mar 2026 21:09:29 +0000 Received: by mail-ot1-x333.google.com with SMTP id 46e09a7af769-7d7e5e8c907so926880a34.0 for ; Thu, 26 Mar 2026 14:09:27 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20251104; t=1774559366; x=1775164166; darn=postgresql.org; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:from:to:cc:subject:date:message-id:reply-to; bh=gclNfuNZOwKgqS/JBfXwD+x4lF/Atj2fIeqBdnrLYzk=; b=ImIgjwXSomzImD8k/8gzHF0UvvWVddDEEIODLn1Qvh+IFDSKlWCVi+o0pZHDysQ0Wr AmZ01mbhZRwIqqcfhWKCr1mYc0iR+cMt1CEGWSQ53Tm5npJryAUKrmBK+/NEzyB2wlB+ KbJ/nxxuRFQiV1DXOtPcd8f35XCIRsU1o5qHEDnRPq5SgAHTd5wux5zHOjE7L9aownXi 2RwXILSRYERh17AAAVqkqKjNA/5jhDh7ujjCeSBrekWghrgm6jynBtgR+DIust4g3QfH 2AKfptCbfKsUNSrFAtHOCWDHpBx90zHCS11va9kTczbMf7aV6lB9UbCjMeByCA9Nj2dl OGGw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20251104; t=1774559366; x=1775164166; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:x-gm-gg:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=gclNfuNZOwKgqS/JBfXwD+x4lF/Atj2fIeqBdnrLYzk=; b=dQT2QPdgifuLH6FgzarG+0AdYTN4kuWW09vgTcALNcO39tl++ZGj1yOG2rWx1QFpFm YC/Y2/iflAoT/i2p6G5sv/3Y50H6yTi4qTxSzcWK3QK51v24LMtl38uw1iQ1sP+lVOJ5 JLORIQyK7lw31U7746fvXQjph9NktwT01AavkvcUwiNj5+IsOH0BxDFyjoe8WRcZuWD4 MlFtVfZQG2VaUxTqKiyShcd9rAJzdkAv7EIqfXZqjrZbwY7M+uN98eyOpA5uFJozU5ag ZoScpcoTjFmfsrAP2hTs/BD65BpeTyVGREu2YZTapqkLUGmBJCbuvdbx3uNZq1nI0sV/ rJ9g== X-Forwarded-Encrypted: i=1; AJvYcCVuyHuYF6863/4i9HgCsJohALy1GgJp/P8Vbf9fTVOag7UPVVoE5SdBHwDazUI/fdKfKYOa7nv7IB57/PdE@postgresql.org X-Gm-Message-State: AOJu0YxzqD0Yluz82Ag9EEwF+9d8isy7bIq0jO7/yKTGJYiQykVmf7qk oFGPu8b0wFxIkBzpf6Yl6kLevlJJKpnWKTbPoZUKEhcMwSFs3pthWWxl X-Gm-Gg: ATEYQzyK1gJaQ/C1Rp67dQjeQHHlKUwa7v/QsXTPBj8x6VwqVo8Fs2cJ0p+xeSH2ULb rf/KgBOx2IrZwEoN4x5CuthDt2mZ2wyz8wYPnBjhGuO8eSCtT7gLOC8R0a1/lGuwDGhTREVFG6r 6X/14ZPOzsS/AdffryE/RUEeC8b3Rk8k7Rsqs7IoyEHAuoNLPtoJ/Q1e0Ya5KjFTJbmHqH3cZnR uAcZnpSQIw1FmUKUxnvbxerxU0GpmyG2H7lavx1QX0QDCbKQET69/m7iWWYX9hqVcrTZUKfSThx KI8a1KdZYhFsrtq1SpnKIFrXf4G/Kt5cP2NmtcwvO030sTzyY1MtiOmwQnfNeaY0vHftG0GVisg yw/4kUgS7hbfSVRVw3ggiZ6m8TRX85FVAyvuUigv9WGaWEtpO3BoxHm4YUP7ucWIwLYpEkr/3NB pmsEpT4A8NkxUo6YGWLYhej3UXI7wUa2BlzfFn39W1laFYB9ls7YCtPZn5Ng8RCoHEdOCDXjbel 8aQ6lRBITX//DEROKrmMw== X-Received: by 2002:a05:6830:43aa:b0:7d7:ea9f:c104 with SMTP id 46e09a7af769-7d9d63b3aeamr4452268a34.0.1774559366340; Thu, 26 Mar 2026 14:09:26 -0700 (PDT) Received: from nathan (162-195-168-172.lightspeed.stlsmo.sbcglobal.net. [162.195.168.172]) by smtp.gmail.com with ESMTPSA id 46e09a7af769-7d9e7272b57sm3121376a34.24.2026.03.26.14.09.25 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 26 Mar 2026 14:09:25 -0700 (PDT) Date: Thu, 26 Mar 2026 16:09:23 -0500 From: Nathan Bossart To: KAZAR Ayoub Cc: Andres Freund , Pg Hackers , Neil Conway , Manni Wood , Andrew Dunstan , Shinya Kato , Mark Wong , Nazir Bilal Yavuz Subject: Re: Speed up COPY TO text/CSV parsing using SIMD Message-ID: References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk On Wed, Mar 18, 2026 at 12:02:28AM +0100, KAZAR Ayoub wrote: > Test Master v3 v3_var v3_var_noinl > TEXT clean 1504ms -24.1% -23.0% -21.5% > CSV clean 1760ms -34.9% -32.7% -33.0% Nice! > TEXT 1/3 backslashes 3763ms +4.6% +6.9% +4.1% > CSV 1/3 quotes 3885ms +3.1% +2.7% -0.8% Hm. These seem a little bit beyond what we could ignore as noise. > Wide table TEXT (integer columns): > > Cols Master v3 v3_var v3_var_noinl > 50 2083ms -0.7% -0.6% +3.5% > 100 4094ms -0.1% -0.5% +4.5% > 200 1560ms +0.6% -2.3% +3.2% > 500 1905ms -1.0% -1.3% +4.7% > 1000 1455ms +1.8% +0.4% +4.3% These numbers look roughly within the noise range. > Wide table CSV: > > Cols Master v3 v3_var v3_var_noinl > 50 2421ms +4.0% +6.7% +5.8% Hm. Is this reproducible? A 4% regression is a bit worrisome. > 100 4980ms +0.1% +2.0% +0.1% > 200 1901ms +1.4% +3.5% +1.4% > 500 2328ms +1.8% +2.7% +2.2% > 1000 1815ms +2.0% +2.8% +2.5% These numbers don't bother me too much, but maybe there are some ways to minimize the regressions further. -- nathan