Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mijxQ-00030a-Bz for pgsql-www@arkaria.postgresql.org; Thu, 04 Nov 2021 21:07:36 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.92) (envelope-from ) id 1mijxN-0003Zf-RD for pgsql-www@arkaria.postgresql.org; Thu, 04 Nov 2021 21:07:33 +0000 Received: from magus.postgresql.org ([2a02:c0:301:0:ffff::29]) by malur.postgresql.org with esmtps (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mijxN-0003ZV-Fq for pgsql-www@lists.postgresql.org; Thu, 04 Nov 2021 21:07:33 +0000 Received: from mail-lf1-x12d.google.com ([2a00:1450:4864:20::12d]) by magus.postgresql.org with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1mijxG-0000N5-Hm for pgsql-www@lists.postgresql.org; Thu, 04 Nov 2021 21:07:33 +0000 Received: by mail-lf1-x12d.google.com with SMTP id o18so14512864lfu.13 for ; Thu, 04 Nov 2021 14:07:26 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=hagander-net.20210112.gappssmtp.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=56FG8xQLDS8l77PhwXj/xCARuuKridA3ZJ0E5psfq+M=; b=AgaMC5o/8lK2BGMBNjtSmi+Es/0Hd4SI8FHdEZr0C9aV/ug9ZTg9uEwpWezDgXqKeS Ve8po1eAAbsqXBhRs7daqIpY/7AWhDQH00Z6nxvnRK0tMe33g8UP4OPRRDBYR70iqFXN DFzWEMBphRZCPT0y+wFGc9VAjpDhGnEfz4YAtAlyGM9dGW6jztEbvgr9k3E5tcfbKbcD Sz3t/8Is1HejUqmFuYvq7ppdF8ebzIHg2KcbN5aKsKD2keY/wbSpNWg4vffny9tkRp2n C1rLxYGzigFt57MXhN/iBY1lVvsk/La9DtkXefT/8XBduxK2POLtqeJqYiDezI1E5Nna UAVQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=56FG8xQLDS8l77PhwXj/xCARuuKridA3ZJ0E5psfq+M=; b=j9Hkb/6TOqr/nZGRjyXPVFAM6X8Vl0WbdGalnCQV9AyiWRjrPlXIusGvCwf05wyrQE 4O+Ny7AW5NtCCVynezUZHskZCO3dAZDBVQedMhVqznUv/9YpNhdVUgfHcPBtToPnR9hm xdhdDSlRZ7WUe+37F2qPlJyxbT68rTerRSv1X4JQwC1zkQWeRHQj3KZ5FTC+C1TL/9j5 JtJCmBxOmsmzimIrpyGv47qx6oaU7ctOgC9kaBLoKkpeoOncxb5EBMthTRs2yClgfFLl 31qrlOqUBspjr0bd7Ki40z2uFsFwfPnqbPPHZy6FqndaIKJU5oirClmxbsWTKa0PL9Fs zfSQ== X-Gm-Message-State: AOAM530vOFkzlOjGLfd/+MNtfKkLCss+E5hpgoXPAWZGBb2chYTqIPkr Nhzr/wontLQuVsUTG4LXOKUExNl1lwKcgVqldyezaGZEDb0= X-Google-Smtp-Source: ABdhPJwTFqPwUhWx5zFkEnOsWjyyNuoyhruYXjeUdNB1Ll3Vfoh6eXTHArUezLA2I9ctdOnIjSAh6+zkHaQoZQCmMRs= X-Received: by 2002:a19:ef13:: with SMTP id n19mr31388725lfh.496.1636060045492; Thu, 04 Nov 2021 14:07:25 -0700 (PDT) MIME-Version: 1.0 References: <53316703-db69-7067-c82e-47a598711595@cmatte.me> <202111041947.x5su2sdjdx74@alvherre.pgsql> In-Reply-To: <202111041947.x5su2sdjdx74@alvherre.pgsql> From: Magnus Hagander Date: Thu, 4 Nov 2021 22:07:14 +0100 Message-ID: Subject: Re: [PATCH] pgarchives: parser: handle messages in which Message-ID is missing To: Alvaro Herrera Cc: =?UTF-8?Q?C=C3=A9lestin_Matte?= , pgsql-www@lists.postgresql.org Content-Type: multipart/alternative; boundary="000000000000d7dd4605cffce98d" List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk --000000000000d7dd4605cffce98d Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Thu, Nov 4, 2021 at 8:47 PM Alvaro Herrera wrote: > On 2021-Nov-04, C=C3=A9lestin Matte wrote: > > > > I don't think this should be the responsibility of pglister. As you > > > say, "most MTAs do add this field" -- and the solution is to > > > configure the MTA to do this. We already rely on the MTA to get a > > > lot of other important things right. > > > > But then these messages will get delivered by pglister but pgarchives > > will fail to archive them, although they do not actually break > > requirements. Shouldn't we follow the RFC here? > I agree that the scenario is a problem, per below. I don't agree that making up an id is a solution to that problem. Maybe pglister should refuse to deliver messages that don't contain > a Message-Id. > It should. I actually thought it did already, but apparently it does not. I guess we've only ever used it under properly configured MTAs :) Have you actually come across any case where a *proper* non-spam message is sent without a message-id and passes through actual mailservers on the way? Looking through the approximately 1.4 million mails in the postgres list archives, not a single one has a message-id generated by the archives server MTA (which is configured to generate it). Not a single one by our inbound relay servers. And exactly one by the pglister server -- which turns out to be a bounce that ended up in the archives because of a misconfiguration back in 2018 that's not visible in the public archives. --=20 Magnus Hagander Me: https://www.hagander.net/ Work: https://www.redpill-linpro.com/ --000000000000d7dd4605cffce98d Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable


=
On Thu, Nov 4, 2021 at 8:47 PM Alvaro= Herrera <a= lvherre@alvh.no-ip.org> wrote:
On 2021-Nov-04, C=C3=A9lestin Matte wrote:

> > I don't think this should be the responsibility of pglister. = As you
> > say, "most MTAs do add this field" -- and the solution = is to
> > configure the MTA to do this. We already rely on the MTA to get a=
> > lot of other important things right.
>
> But then these messages will get delivered by pglister but pgarchives<= br> > will fail to archive them, although they do not actually break
> requirements. Shouldn't we follow the RFC here?

I agree that the scenario is a problem, per below.=C2=A0 = I don't agree that making up an id is a solution to that problem.


Maybe pglister should refuse to deliver messages that don't contain
a Message-Id.

It should. I actually tho= ught it did already, but apparently it does not. I guess we've only eve= r used it under properly configured MTAs :)

Have y= ou actually come across any case where a *proper* non-spam message is sent = without a message-id and passes through actual mailservers on the way?=C2= =A0

Looking through the approximately 1.4 million = mails in the postgres list archives, not a single one has a message-id gene= rated by the archives server MTA (which is configured to generate it). Not = a single one by our inbound relay servers. And exactly one by the pglister = server -- which turns out to be a bounce that ended up in the archives beca= use of a misconfiguration back in 2018 that's not visible in the public= archives.

--
=C2=A0Magnus Hagander
=C2=A0Me: https://www.hagander.net/
=C2=A0Work: https://www.redpill-l= inpro.com/
--000000000000d7dd4605cffce98d--