Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1sQBcV-008MDY-RN for pgsql-general@arkaria.postgresql.org; Sat, 06 Jul 2024 20:02:55 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.94.2) (envelope-from ) id 1sQBcU-005r5N-3M for pgsql-general@arkaria.postgresql.org; Sat, 06 Jul 2024 20:02:54 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1sQBcT-005r53-MX for pgsql-general@lists.postgresql.org; Sat, 06 Jul 2024 20:02:53 +0000 Received: from mail-vk1-xa2c.google.com ([2607:f8b0:4864:20::a2c]) by magus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.94.2) (envelope-from ) id 1sQBcN-000m75-38 for pgsql-general@lists.postgresql.org; Sat, 06 Jul 2024 20:02:52 +0000 Received: by mail-vk1-xa2c.google.com with SMTP id 71dfb90a1353d-4eb02c0c851so851519e0c.0 for ; Sat, 06 Jul 2024 13:02:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1720296165; x=1720900965; darn=lists.postgresql.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=6jHRXTdtjgvbZcZuaacCXw+a8F1RfcDPjyTGrvklxM0=; b=QHf0XQEWNPdHANs+EvMnQNmbI/c1wV4l1E+h9kh8Xh4PYHjVWxA5mlh7zW4dkeJGBF xPuGDdG0+GsZyJX2CKxkAC0xO8WEtVe9OoEyjC9/AFUlVb0+/Pd3OehMfKIoPkIRZoR5 FbJjcoHZoc5KQIxZjvPblQ8eDmTEf4Kdt12HKRxGuejw1jcarZbO2yErdY+PvACh3jE6 Un8gnblJb8KzVeWfjH8qZiu4kF31LQXwKoSQ9Pxz8UuH3vAoqVfVXmM/GH/xX74ylbkJ pEXm7rx0+/nCCYoNAuDoHImYVJzVkjUgff4uKEoaFGnxO9xMPBuulQTRiY/BtXce+ceC ebyw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1720296165; x=1720900965; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=6jHRXTdtjgvbZcZuaacCXw+a8F1RfcDPjyTGrvklxM0=; b=kjpYRgJa0DKijbc2u2LGDtLUxVXMWEkB3dGLC2fX14FRdyIH5QESRzgwyazQRFX2uM 9pH0YHm2R76dZasMYBNateKHigEr+wJ4wtFBlyIpYCFOWKkA3XShZNKouvhqVHijuQvF CGMv1pCQcWc9a13m9nUQVbN+QrTFtNhYFQRkFVAAACzXk4WcO5jtqze8NuFaBiO6T6A3 m+sAomvCdlrM+i+zSladUNBXDOr04XpjA//hfY3DNPu3eEjT1gO8dd11+yYBUvS+v+2Z iJl3nLPEqbcuSeM6DnNVG1ymAaJbc7W7tJSF1LJOsHcI0D0zWUSEa+ynGAW/Gk3Z8R/M MLnA== X-Gm-Message-State: AOJu0YyN6j3muAM4VGbvakwRja2YQO+fxvAOV2OkPVdvk+pwqaXR3nE6 1Udcno2g2TQhNS65sjVIwCd6Sf+r2/pltHAL8KzsXeanfmmgIgMsLWpRz/vRugri4MSqvzII864 02TXeVSVY2ypSiDBQFlm2RohtHEg= X-Google-Smtp-Source: AGHT+IEDhhnUKDzLYU08oaZHmm4WSj4LMnOxjIrSgboN6DUIZ0kqsUSGXSsgyV55iJeG+W9zNG2LNfHzN8x/MH+N6JA= X-Received: by 2002:a05:6122:4b14:b0:4f2:e9eb:951e with SMTP id 71dfb90a1353d-4f2f3fdd56dmr9130059e0c.10.1720296164946; Sat, 06 Jul 2024 13:02:44 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: sud Date: Sun, 7 Jul 2024 01:32:33 +0530 Message-ID: Subject: Re: Load a csv or a avro? To: Kashif Zeeshan Cc: pgsql-general Content-Type: multipart/alternative; boundary="000000000000d1fea5061c99a982" List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk --000000000000d1fea5061c99a982 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Fri, Jul 5, 2024 at 3:27=E2=80=AFPM Kashif Zeeshan wrote: > Hi > > There are different data formats available, following are few points for > there performance implications > > 1. CSV : It's easy to use and widely supported but it can be slower due t= o > parsing overload. > 2. Binary : Its faster to load but not human understandable. > > Hope this helps. > > Regards > Kashif Zeeshan > >> >> My understanding was that it will be faster to load .csv as it is already being mapped to table rows and columns whereas in case of .avro the mapping has to be done so that the fields in the avro can be mapped to the columns in the table appropriately and that will be having additional overhead. Is my understanding correct? --000000000000d1fea5061c99a982 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

On Fri, Jul 5, 2024 at 3:27=E2=80=AFPM Ka= shif Zeeshan <kashi.zeeshan@g= mail.com> wrote:
Hi

There are different data for= mats available, following are few points for there performance implications=

1. CSV : It's easy to use and widely=C2=A0sup= ported but it can be slower due to parsing overload.
2. Binary : = Its faster=C2=A0to load but not human understandable.

<= div>Hope this=C2=A0helps.

Regards
Kashif= Zeeshan


=C2=A0My understanding was that it will be faster to load .csv as it is = already=C2=A0being mapped to table rows and columns whereas=C2=A0in case of= =C2=A0.avro the mapping has to be done so that the fields in the avro can b= e mapped to the columns in the table appropriately and that will be having = additional overhead. Is my understanding correct? --000000000000d1fea5061c99a982--