Received: from localhost (unknown [200.46.204.183]) by postgresql.org (Postfix) with ESMTP id 1CD28650342 for ; Thu, 17 Jul 2008 09:45:57 -0300 (ADT) Received: from postgresql.org ([200.46.204.86]) by localhost (mx1.hub.org [200.46.204.183]) (amavisd-maia, port 10024) with ESMTP id 95098-05 for ; Thu, 17 Jul 2008 09:45:48 -0300 (ADT) X-Greylist: from auto-whitelisted by SQLgrey-1.7.6 Received: from lists.commandprompt.com (host-159.commandprompt.net [207.173.203.159]) by postgresql.org (Postfix) with ESMTP id 9124C64FF8A for ; Thu, 17 Jul 2008 09:45:53 -0300 (ADT) Received: from commandprompt.com (CPE001b63afe888-CM001adea9c5a6.cpe.net.cable.rogers.com [99.236.211.160]) (authenticated bits=0) by lists.commandprompt.com (8.13.8/8.13.8) with ESMTP id m6HCm7fN028876 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Thu, 17 Jul 2008 05:48:11 -0700 Date: Thu, 17 Jul 2008 08:45:46 -0400 From: Andrew Sullivan To: pgsql-www@postgresql.org Subject: Re: Spam filtering on the mailing lists Message-ID: <20080717124546.GA6732@commandprompt.com> References: <200807161509.m6GF9RO14179@momjian.us> <1216227673.28961.139.camel@jd-laptop> <20080716180738.GF5758@commandprompt.com> <8C1E1C3655705FB9706E2E10@ganymede.hub.org> <361E31A33E6E19B7844FA853@ganymede.hub.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <361E31A33E6E19B7844FA853@ganymede.hub.org> User-Agent: Mutt/1.5.17 (2007-11-01) X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0 (lists.commandprompt.com [207.173.203.159]); Thu, 17 Jul 2008 05:48:11 -0700 (PDT) X-Virus-Scanned: Maia Mailguard 1.0.1 X-Spam-Status: No, hits=0 tagged_above=0 required=5 tests=none X-Spam-Level: X-Archive-Number: 200807/103 X-Sequence-Number: 15533 On Thu, Jul 17, 2008 at 12:01:08AM -0300, Marc G. Fournier wrote: > Does anyone know if there is some 'rule' that return mail about 'Undeliverable' > has to be in English? Nope, there isn't. The bounce message is intended for human consumption, and therefore localisation is approproate. Note, however, that many of the messages should have a Content-Type: multipart/report; report-type=delivery-status; header. See RFC 3462, which obsoletes RFC 1892. RFC 3462 requires a machine-parsable delivery report, and one such report is defined in RFC 3464. Many mail systems (most?) conform to this these days, so just detecting thatt here is a delivery-status header, then parsing the second body-part for "^Action: failed" oughta work. The message/delivery-status content-type MUST be 7 bit, so you can be relatively certain it's not going to be anything but ASCII. (AFAIK, there are no machines left on the Internet that leak 7-bit charsets that are not ASCII). A -- Andrew Sullivan ajs@commandprompt.com +1 503 667 4564 x104 http://www.commandprompt.com/