Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1sU8gu-000kBJ-21 for pgsql-general@arkaria.postgresql.org; Wed, 17 Jul 2024 17:43:48 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.94.2) (envelope-from ) id 1sU8gs-003kTn-5W for pgsql-general@arkaria.postgresql.org; Wed, 17 Jul 2024 17:43:46 +0000 Received: from makus.postgresql.org ([2001:4800:3e1:1::229]) by malur.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1sU8gr-003kS4-MR for pgsql-general@lists.postgresql.org; Wed, 17 Jul 2024 17:43:46 +0000 Received: from mail-vs1-xe29.google.com ([2607:f8b0:4864:20::e29]) by makus.postgresql.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.94.2) (envelope-from ) id 1sU8gp-0003fs-Ce for pgsql-general@lists.postgresql.org; Wed, 17 Jul 2024 17:43:44 +0000 Received: by mail-vs1-xe29.google.com with SMTP id ada2fe7eead31-48feaeaa705so431290137.2 for ; Wed, 17 Jul 2024 10:43:43 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1721238222; x=1721843022; darn=lists.postgresql.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=epOsSB5rqdCxKmOOyNlPQLOtMeiwpWsPsj/2BIRwLF8=; b=lC1fiI+D7ZPI7/eTFheqOgshRM+4XbvsyGGAe/evuL5g2uSiZi9XEpaZyH5abO3aMB WiOXH4feTr036pyFDqFbwpYRqe99VftJuI9bXaQiFRtyjTdxvDPK07XVnjcNLbyUI62T MoKX4uGSyiIGOhBzpMdrzxlnZTWGRqHPLGTqyWMtOprlMGz1ko7XjZlMpvHGb3dH8pB/ FV74vredZf/k1WcV5qi9n4aIkai9zFgM8MWHVuI4TTx+KuXaf9xVnFykU1wqfM+L4BPZ 6QbFO9/Ag6cK3imzbWMwWC1o91gyzGRS+M31C402JzqRYJUmSFdRpyx+jMkUjjGWICmV Lovw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1721238222; x=1721843022; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=epOsSB5rqdCxKmOOyNlPQLOtMeiwpWsPsj/2BIRwLF8=; b=MgS3qNcUf9m1KukKaBbPMMyScGDEWfPEeIQSJggyPGBtljFefKYNWM36nuX+HdNu7R gxycfzV2Q84vHLjpMdFHlAeT9B187V66MHxIn3r4PyxHuza1vq5gK0U1B2PZLN32s4tX Z+nRRP5UoXe3TVFq9gIxqEWh6um5MsAQ36AKvJGZg3Thy0w/XXhGD1nyQFwc3F2kOdE7 o8jQ2jADHttjNyqbDu6U6kYuEPqnwrIXDF2m0KtE6HWjJrMOH+XZcBTXJlo6t5ZoUsLs m1CS6t+lwwTI5/pjKTZKYiEfrQd7UALpoT3tu8PkEmZWXnszyO9CFP9Epxtv4Fg8GMmR +WlA== X-Gm-Message-State: AOJu0YyuG1YPe9AnhpakImGVdPVV3PLycksoJ31lJOudZUiN43Re9pLr 7vw82rvcbQG7qdT+VWPS0HBQttyACbBGP/nhWZRtEk5YXNpKSGoUo1m6KpgDPb2S/BTy9PnJ3wY mbekTxw+yJHOPNO6DxK9LgP9HHBNGN7iP X-Google-Smtp-Source: AGHT+IFJT2m6HTTHvKGLxdeM94Wq+BrURFlnCl4DGnqy+I3paJm75+CZKDablEpWsdgT2bJcmUBOawZik1hoVjKQ3t4= X-Received: by 2002:a05:6102:3348:b0:48f:39df:2d8e with SMTP id ada2fe7eead31-491599acbe9mr3058424137.19.1721238222349; Wed, 17 Jul 2024 10:43:42 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Kashif Zeeshan Date: Wed, 17 Jul 2024 22:43:33 +0500 Message-ID: Subject: Re: Planet Postgres and the curse of AI To: Greg Sabino Mullane Cc: pgsql-general Content-Type: multipart/alternative; boundary="000000000000d12c8d061d750016" List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk --000000000000d12c8d061d750016 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hi Greg I agree with you on the misuse of AI based tools, as per my experience with Postgres the solutions suggested wont work at times. Its not bad to get help from these tools but put all the solutions from there is counter productive. I think People should take care while using these tools while suggesting solutions for real world problems. Regards Kashif Zeeshan On Wed, Jul 17, 2024 at 10:22=E2=80=AFPM Greg Sabino Mullane wrote: > I've been noticing a growing trend of blog posts written mostly, if not > entirely, with AI (aka LLMs, ChatGPT, etc.). I'm not sure where to raise > this issue. I considered a blog post, but this mailing list seemed a bett= er > forum to generate a discussion. > > The problem is two-fold as I see it. > > First, there is the issue of people trying to game the system by churning > out content that is not theirs, but was written by a LLM. I'm not going t= o > name specific posts, but after a while it gets easy to recognize things > that are written mostly by AI. > > These blog posts are usually generic, describing some part of Postgres > in an impersonal, mid-level way. Most of the time the facts are not > wrong, per se, but they lack nuances that a real DBA would bring to the > discussion, and often leave important things out. Code examples are often > wrong in subtle ways. Places where you might expect a deeper discussion a= re > glossed over. > > So this first problem is that it is polluting the Postgres blogs with > overly bland, moderately helpful posts that are not written by a human, a= nd > do not really bring anything interesting to the table. There is a place f= or > posts that describe basic Postgres features, but the ones written by huma= ns > are much better. (yeah, yeah, "for now" and all hail our AI overlords in > the future). > > The second problem is worse, in that LLMs are not merely gathering > information, but have the ability to synthesize new conclusions and facts= . > In short, they can lie. Or hallucinate. However you want to call it, it's= a > side effect of the way LLMs work. In a technical field like Postgres, thi= s > can be a very bad thing. I don't know how widespread this is, but I was > tipped off about this over a year ago when I came across a blog suggestin= g > using the "max_toast_size configuration parameter". For those not > familiar, I can assure you that Postgres does not have, nor will likely > ever have, a GUC with that name. > > As anyone who has spoken with ChatGPT knows, getting small important > details correct is not its forte. I love ChatGPT and actually use it dail= y. > It is amazing at doing certain tasks. But writing blog posts should not b= e > one of them. > > Do we need a policy or a guideline for Planet Postgres? I don't know. It > can be a gray line. Obviously spelling and grammar checking is quite > okay, and making up random GUCs is not, but the middle bit is very hazy. > (Human) thoughts welcome. > > Cheers, > Greg > > --000000000000d12c8d061d750016 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Hi Greg

I agree with you on the misuse = of AI based tools, as per my experience with Postgres the solutions suggest= ed wont work at times.
Its not bad to get help from these tools b= ut put all the solutions from there is counter productive.
I thin= k People should take care while using these tools while suggesting solution= s for real world problems.

Regards
Kashi= f Zeeshan

On Wed, Jul 17, 2024 at 10:22=E2=80=AFPM Greg Sabino Mullan= e <htamfids@gmail.com> wrot= e:
I've been noticing a gr= owing trend of blog posts written mostly, if not entirely,=C2=A0with AI (ak= a LLMs, ChatGPT, etc.). I'm not sure where to raise this issue.=C2=A0I = considered a blog post, but this mailing list seemed a better forum to=C2= =A0generate a discussion.

The problem is two-fold as I see it.
First, there is the issue of people trying to game the system by churning= out=C2=A0content that is not theirs, but was written by a LLM. I'm not= going to name=C2=A0specific posts, but after a while it gets easy to recog= nize things that=C2=A0are written mostly by AI.

These blog posts are= usually generic, describing some part of Postgres in=C2=A0an impersonal, m= id-level way. Most of the time=C2=A0the facts are not wrong,=C2=A0per se, b= ut they lack nuances that a real DBA would bring to the discussion,=C2=A0an= d often leave important things out. Code examples are often wrong in subtle= ways. Places where you might expect a deeper discussion are glossed over.<= /div>

So this first problem is t= hat it is polluting the Postgres blogs with overly=C2=A0bland, moderately h= elpful posts that are not written by a human, and do not=C2=A0really bring = anything interesting to the table. There is a place for posts=C2=A0that des= cribe basic Postgres features, but the ones written by humans are=C2=A0much= better. (yeah, yeah, "for now" and all hail our AI overlords in = the future).

The second problem is worse, in that LLMs are not merel= y gathering information,=C2=A0but have the ability to synthesize new conclu= sions and facts. In short, they can lie.=C2=A0Or hallucinate. However you w= ant to call it, it's a side effect of the way LLMs work. In a technical= =C2=A0field like Postgres, this can be a very bad thing. I don't know h= ow widespread this=C2=A0is, but I was tipped off about this over a year ago= when I came across a blog=C2=A0suggesting using the "max_toast_size c= onfiguration parameter". For those not familiar,=C2=A0I can assure you= that Postgres does not have, nor will likely ever have, a GUC with that na= me.

As anyone who has spoken with ChatGPT knows, getting small impor= tant details=C2=A0correct is not its forte. I love ChatGPT and actually use= it daily. It is=C2=A0amazing at doing certain tasks. But writing blog post= s should not be one of them.

Do we need a policy or a guideline for = Planet Postgres? I don't know.=C2=A0It can be a gray line. Obviously sp= elling and grammar checking is quite okay,=C2=A0and making up random GUCs i= s not, but the middle bit is very hazy. (Human) thoughts welcome.
=

Cheers,
Greg

=
--000000000000d12c8d061d750016--