Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1sPkKx-0068Yh-1K for pgsql-general@arkaria.postgresql.org; Fri, 05 Jul 2024 14:54:59 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.94.2) (envelope-from ) id 1sPkKu-00Am1x-Si for pgsql-general@arkaria.postgresql.org; Fri, 05 Jul 2024 14:54:57 +0000 Received: from makus.postgresql.org ([2001:4800:3e1:1::229]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1sPkKt-00Am1o-IF for pgsql-general@lists.postgresql.org; Fri, 05 Jul 2024 14:54:57 +0000 Received: from fout6-smtp.messagingengine.com ([103.168.172.149]) by makus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1sPkKq-000Y6K-QE for pgsql-general@lists.postgresql.org; Fri, 05 Jul 2024 14:54:55 +0000 Received: from compute2.internal (compute2.nyi.internal [10.202.2.46]) by mailfout.nyi.internal (Postfix) with ESMTP id 24116138026F; Fri, 5 Jul 2024 10:54:51 -0400 (EDT) Received: from mailfrontend1 ([10.202.2.162]) by compute2.internal (MEProxy); Fri, 05 Jul 2024 10:54:51 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=aklaver.com; h= cc:content-transfer-encoding:content-type:content-type:date:date :from:from:in-reply-to:in-reply-to:message-id:mime-version :references:reply-to:subject:subject:to:to; s=fm1; t=1720191291; x=1720277691; bh=ZxlP4jv6dj6ooa8dawaltADkmZOcBaUj78oawt8no50=; b= v1G6T8kBLB9wwQn85GZYiOWLzeCSGzzfTuktj9SyyNtixv9kKjtFWmiyhgxjR9o5 A/M+T44LQ8ntuH6yIAES1AJbax/MdGBenOQB1K4TFPsHC8OEEYB2aFWj9w2y7FtC 8EQlIsw/Ljs4sltPjee9i9r9cuBrn0K5KgizouuVAWQkDQ4gIG2BVFhUGDSAvg1I RKxWFS5rPrEmNPrwQNU/9rkr/7IYxgsFeNI4N0Y+I6Z4uQ6izKz9Wtl2Upi/ICwd iXtSYvjmTLfbaEVuy9sDpTrq1JnMvl0X75Xwv3Q21JPdG0l+e7pJ6IUUCgLinP3D g4W7R5NwlpJWomZlj5IEcQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:content-transfer-encoding:content-type :content-type:date:date:feedback-id:feedback-id:from:from :in-reply-to:in-reply-to:message-id:mime-version:references :reply-to:subject:subject:to:to:x-me-proxy:x-me-proxy :x-me-sender:x-me-sender:x-sasl-enc; s=fm2; t=1720191291; x= 1720277691; bh=ZxlP4jv6dj6ooa8dawaltADkmZOcBaUj78oawt8no50=; b=D GvJx0+xDbkqhf5mXOImMO8EX8FlEdC2yr2P22H1sAXM4Hipu4lfRYEiz9pLl0Luu qzzGRjE1ZwqUulegGO3kWl25UFpX4vQJ6Yi3EUE5Gm0yilmpURHZpOxVMOnDVk9h 65TebTt9F8Hq6yNo6Z4X4deDivMlqEg+WBsQBTt1gx+uxdj4DvvUDaiJEvsPfF4F x45fBmuaQf/sLw1H5klGT0dKVdDOgSY7vUoL+9fzyJezy4KUFL/mSknkud1BQAwQ nUK/ZmZM4b5gLVZd80WAbfNu8YcrNdan08H82bj+LTLh/mCjXe7f4v/rcUDaLGqu t2Ged0JqQHpa7jjwRaeFQ== X-ME-Sender: X-ME-Received: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeeftddrvddugdekudcutefuodetggdotefrodftvf curfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfghnecu uegrihhlohhuthemuceftddtnecunecujfgurhepkfffgggfuffvfhfhjggtgfesthejre dttddvjeenucfhrhhomheptegurhhirghnucfmlhgrvhgvrhcuoegrughrihgrnhdrkhhl rghvvghrsegrkhhlrghvvghrrdgtohhmqeenucggtffrrghtthgvrhhnpeeivdfhieehhe egueeileejieettdejhedugeefleekvdelkeehtdfgiefffeekudenucevlhhushhtvghr ufhiiigvpedtnecurfgrrhgrmhepmhgrihhlfhhrohhmpegrughrihgrnhdrkhhlrghvvg hrsegrkhhlrghvvghrrdgtohhm X-ME-Proxy: Feedback-ID: i76984098:Fastmail Received: by mail.messagingengine.com (Postfix) with ESMTPA; Fri, 5 Jul 2024 10:54:50 -0400 (EDT) Message-ID: <11d5753c-578e-41a2-af17-6de956f03058@aklaver.com> Date: Fri, 5 Jul 2024 07:54:49 -0700 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: Load a csv or a avro? To: sud , pgsql-general References: Content-Language: en-US From: Adrian Klaver In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk On 7/5/24 02:08, sud wrote: > Hello all, > > Its postgres database. We have option of getting files in csv and/or in > avro format messages from another system to load it into our postgres > database. The volume will be 300million messages per day across many > files in batches. Are dumping the entire contents of each file or are you pulling a portion of the data out? > > My question was, which format should we chose in regards to faster data > loading performance ? and if any other aspects to it also should be > considered apart from just loading performance? > -- Adrian Klaver adrian.klaver@aklaver.com