public inbox for [email protected]  
help / color / mirror / Atom feed
Spam filtering on the mailing lists
18+ messages / 6 participants
[nested] [flat]

* Spam filtering on the mailing lists
@ 2008-07-16 14:14  Greg Sabino Mullane <[email protected]>
  0 siblings, 1 reply; 18+ messages in thread

From: Greg Sabino Mullane @ 2008-07-16 14:14 UTC (permalink / raw)
  To: pgsql-www


-----BEGIN PGP SIGNED MESSAGE-----
Hash: RIPEMD160


I'd like to reiterate my request for some (or better?) spam filtering on
the lists. I just rejected over 50 messages to pgsql-bugs, all of them
spam and 95% of them would be easily caught by the worst spam filter
on the market. The chance of the moderators mistakenly rejecting a
legitimate message increases as the volume of spam does. If this is
something the project cannot handle for technical/political reasons,
there are plenty of third-party spam filtering solutions out there,
some of which are using Postgres and would probably be happy to provide
the service to us.

- --
Greg Sabino Mullane [email protected]
PGP Key: 0x14964AC8 200807161012
http://biglumber.com/x/web?pk=2529DF6AB8F79407E94445B4BC9B906714964AC8
-----BEGIN PGP SIGNATURE-----

iEYEAREDAAYFAkh+AdMACgkQvJuQZxSWSsjv5QCeIuHCak5NoPuYFCIRnUIttcje
gDcAoPM7BbVEELLudAGfU3tA0erx0TbT
=nywb
-----END PGP SIGNATURE-----





^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-16 15:09  Bruce Momjian <[email protected]>
  parent: Greg Sabino Mullane <[email protected]>
  0 siblings, 1 reply; 18+ messages in thread

From: Bruce Momjian @ 2008-07-16 15:09 UTC (permalink / raw)
  To: Greg Sabino Mullane <[email protected]>; +Cc: pgsql-www

Greg Sabino Mullane wrote:
> I'd like to reiterate my request for some (or better?) spam filtering on
> the lists. I just rejected over 50 messages to pgsql-bugs, all of them
> spam and 95% of them would be easily caught by the worst spam filter
> on the market. The chance of the moderators mistakenly rejecting a
> legitimate message increases as the volume of spam does. If this is
> something the project cannot handle for technical/political reasons,
> there are plenty of third-party spam filtering solutions out there,
> some of which are using Postgres and would probably be happy to provide
> the service to us.

I am confused --- we have no spam filter for the bugs list?

-- 
  Bruce Momjian  <[email protected]>        http://momjian.us
  EnterpriseDB                             http://enterprisedb.com

  + If your life is a hard drive, Christ can be your backup. +



^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-16 17:01  Joshua D. Drake <[email protected]>
  parent: Bruce Momjian <[email protected]>
  0 siblings, 1 reply; 18+ messages in thread

From: Joshua D. Drake @ 2008-07-16 17:01 UTC (permalink / raw)
  To: Bruce Momjian <[email protected]>; +Cc: Greg Sabino Mullane <[email protected]>; pgsql-www

On Wed, 2008-07-16 at 11:09 -0400, Bruce Momjian wrote:
> Greg Sabino Mullane wrote:

> I am confused --- we have no spam filter for the bugs list?

We do I am sure, I just find that @postgresql.org spam filtering is
lacking. I get more spam at my @postgresql.org address than my
@commandprompt.com address by far.

I have just learned to overlook it.


Joshua D. Drake

-- 
The PostgreSQL Company since 1997: http://www.commandprompt.com/ 
PostgreSQL Community Conference: http://www.postgresqlconference.org/
United States PostgreSQL Association: http://www.postgresql.us/
Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate






^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-16 18:07  Andrew Sullivan <[email protected]>
  parent: Joshua D. Drake <[email protected]>
  0 siblings, 2 replies; 18+ messages in thread

From: Andrew Sullivan @ 2008-07-16 18:07 UTC (permalink / raw)
  To: pgsql-www

On Wed, Jul 16, 2008 at 10:01:13AM -0700, Joshua D. Drake wrote:
> I have just learned to overlook it.

But moderators can't overlook it.  We have to troll through piles of
it.  Much of it is in Cyrillic or CJK, and is just obviously not aimed
at the list.  We still have to process it.

(I say "we" even though I am still unable to send mail from the
account I moderate from, and therefore can't do any useful
moderating.)

A

-- 
Andrew Sullivan
[email protected]
+1 503 667 4564 x104
http://www.commandprompt.com/



^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-16 22:35  Joshua D. Drake <[email protected]>
  parent: Andrew Sullivan <[email protected]>
  1 sibling, 0 replies; 18+ messages in thread

From: Joshua D. Drake @ 2008-07-16 22:35 UTC (permalink / raw)
  To: Andrew Sullivan <[email protected]>; +Cc: pgsql-www

On Wed, 2008-07-16 at 14:07 -0400, Andrew Sullivan wrote:
> On Wed, Jul 16, 2008 at 10:01:13AM -0700, Joshua D. Drake wrote:
> > I have just learned to overlook it.
> 
> But moderators can't overlook it.  We have to troll through piles of
> it.  Much of it is in Cyrillic or CJK, and is just obviously not aimed
> at the list.  We still have to process it.

Yeah I thought about it afterword, I moderate announce. I get maybe a
dozen a day.


Joshua D. Drake


-- 
The PostgreSQL Company since 1997: http://www.commandprompt.com/ 
PostgreSQL Community Conference: http://www.postgresqlconference.org/
United States PostgreSQL Association: http://www.postgresql.us/
Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate






^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 02:17  Marc G. Fournier <[email protected]>
  parent: Andrew Sullivan <[email protected]>
  1 sibling, 4 replies; 18+ messages in thread

From: Marc G. Fournier @ 2008-07-17 02:17 UTC (permalink / raw)
  To: Andrew Sullivan <[email protected]>; pgsql-www

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


Its sad how this is such an ongoing problem, but this is the first that I hear 
that ppl are having problems ... looking at the message headers for a random 
few, I notice that they are scoring >4, but just below 5:

X-Spam-Status: No, hits=4.855 tagged_above=0 required=5 tests=AWL=-1.994,
 DCC_CHECK=1.37, DIGEST_MULTIPLE=0.001, HTML_MESSAGE=0.001,
 MIME_HTML_ONLY=1.672, RAZOR2_CHECK=0.5, RCVD_IN_BL_SPAMCOP_NET=2.188,
 RCVD_IN_SORBS_WEB=1.117
X-Spam-Level: ****

I can change the quarantining to be >4 if ppl want, which should greatly reduce 
the # of messages going through ...

Thoughts?

- --On Wednesday, July 16, 2008 14:07:38 -0400 Andrew Sullivan 
<[email protected]> wrote:

> On Wed, Jul 16, 2008 at 10:01:13AM -0700, Joshua D. Drake wrote:
>> I have just learned to overlook it.
>
> But moderators can't overlook it.  We have to troll through piles of
> it.  Much of it is in Cyrillic or CJK, and is just obviously not aimed
> at the list.  We still have to process it.
>
> (I say "we" even though I am still unable to send mail from the
> account I moderate from, and therefore can't do any useful
> moderating.)
>
> A
>
> --
> Andrew Sullivan
> [email protected]
> +1 503 667 4564 x104
> http://www.commandprompt.com/
>
> --
> Sent via pgsql-www mailing list ([email protected])
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-www



- -- 
Marc G. Fournier        Hub.Org Hosting Solutions S.A. (http://www.hub.org)
Email . [email protected]                              MSN . [email protected]
Yahoo . yscrappy               Skype: hub.org        ICQ . 7615664
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.9 (FreeBSD)

iEYEARECAAYFAkh+q7IACgkQ4QvfyHIvDvPYfQCfchdCwNpL+90T5maoAWZUZttL
97wAoLMRAkqfszOxXojM7YNbUIZBX5ug
=9v1h
-----END PGP SIGNATURE-----




^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 02:58  Joshua D. Drake <[email protected]>
  parent: Marc G. Fournier <[email protected]>
  3 siblings, 0 replies; 18+ messages in thread

From: Joshua D. Drake @ 2008-07-17 02:58 UTC (permalink / raw)
  To: Marc G. Fournier <[email protected]>; +Cc: Andrew Sullivan <[email protected]>; pgsql-www

On Wed, 2008-07-16 at 23:17 -0300, Marc G. Fournier wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> 
> Its sad how this is such an ongoing problem, but this is the first that I hear 
> that ppl are having problems ...

I can only assume you mean "this time".

Joshua D. Drake


-- 
The PostgreSQL Company since 1997: http://www.commandprompt.com/ 
PostgreSQL Community Conference: http://www.postgresqlconference.org/
United States PostgreSQL Association: http://www.postgresql.us/
Donate to the PostgreSQL Project: http://www.postgresql.org/about/donate






^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 03:01  Marc G. Fournier <[email protected]>
  parent: Marc G. Fournier <[email protected]>
  3 siblings, 1 reply; 18+ messages in thread

From: Marc G. Fournier @ 2008-07-17 03:01 UTC (permalink / raw)
  To: Andrew Sullivan <[email protected]>; pgsql-www

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


I just cleared out the AWL ... going through the stuff label'd as 'non-spam', 
am finding alot of stuff scoring a bit low due to the From being in AWL ...

Does anyone know if there is some 'rule' that return mail about 'Undeliverable' 
has to be in English?  I'm finding some that *look* like that sort of thing, 
but in other languages / character sets, and wonder if there is an RFC that is 
meant to direct such stuff 'in English'?  (ie. should I be marking that stuff 
as spam, since *I* can't confirm it isn't?)



- --On Wednesday, July 16, 2008 23:17:22 -0300 "Marc G. Fournier" 
<[email protected]> wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
>
> Its sad how this is such an ongoing problem, but this is the first that I
> hear  that ppl are having problems ... looking at the message headers for a
> random  few, I notice that they are scoring >4, but just below 5:
>
> X-Spam-Status: No, hits=4.855 tagged_above=0 required=5 tests=AWL=-1.994,
>  DCC_CHECK=1.37, DIGEST_MULTIPLE=0.001, HTML_MESSAGE=0.001,
>  MIME_HTML_ONLY=1.672, RAZOR2_CHECK=0.5, RCVD_IN_BL_SPAMCOP_NET=2.188,
>  RCVD_IN_SORBS_WEB=1.117
> X-Spam-Level: ****
>
> I can change the quarantining to be >4 if ppl want, which should greatly
> reduce  the # of messages going through ...
>
> Thoughts?
>
> - --On Wednesday, July 16, 2008 14:07:38 -0400 Andrew Sullivan
> <[email protected]> wrote:
>
>> On Wed, Jul 16, 2008 at 10:01:13AM -0700, Joshua D. Drake wrote:
>>> I have just learned to overlook it.
>>
>> But moderators can't overlook it.  We have to troll through piles of
>> it.  Much of it is in Cyrillic or CJK, and is just obviously not aimed
>> at the list.  We still have to process it.
>>
>> (I say "we" even though I am still unable to send mail from the
>> account I moderate from, and therefore can't do any useful
>> moderating.)
>>
>> A
>>
>> --
>> Andrew Sullivan
>> [email protected]
>> +1 503 667 4564 x104
>> http://www.commandprompt.com/
>>
>> --
>> Sent via pgsql-www mailing list ([email protected])
>> To make changes to your subscription:
>> http://www.postgresql.org/mailpref/pgsql-www
>
>
>
> - --
> Marc G. Fournier        Hub.Org Hosting Solutions S.A. (http://www.hub.org)
> Email . [email protected]                              MSN . [email protected]
> Yahoo . yscrappy               Skype: hub.org        ICQ . 7615664
> -----BEGIN PGP SIGNATURE-----
> Version: GnuPG v2.0.9 (FreeBSD)
>
> iEYEARECAAYFAkh+q7IACgkQ4QvfyHIvDvPYfQCfchdCwNpL+90T5maoAWZUZttL
> 97wAoLMRAkqfszOxXojM7YNbUIZBX5ug
> =9v1h
> -----END PGP SIGNATURE-----
>
>
> --
> Sent via pgsql-www mailing list ([email protected])
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-www



- -- 
Marc G. Fournier        Hub.Org Hosting Solutions S.A. (http://www.hub.org)
Email . [email protected]                              MSN . [email protected]
Yahoo . yscrappy               Skype: hub.org        ICQ . 7615664
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.9 (FreeBSD)

iEYEARECAAYFAkh+tfQACgkQ4QvfyHIvDvNU7wCg7XDKErTl3ebHhTeS2o6p6J5Y
CscAn1i0bEFAYnHqGTvWmE0XsqjImc+9
=KGUA
-----END PGP SIGNATURE-----




^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 12:45  Andrew Sullivan <[email protected]>
  parent: Marc G. Fournier <[email protected]>
  0 siblings, 0 replies; 18+ messages in thread

From: Andrew Sullivan @ 2008-07-17 12:45 UTC (permalink / raw)
  To: pgsql-www

On Thu, Jul 17, 2008 at 12:01:08AM -0300, Marc G. Fournier wrote:

> Does anyone know if there is some 'rule' that return mail about 'Undeliverable' 
> has to be in English?  

Nope, there isn't.  The bounce message is intended for human
consumption, and therefore localisation is approproate. 

Note, however, that many of the messages should have a Content-Type:
multipart/report; report-type=delivery-status; header.  See RFC 3462,
which obsoletes RFC 1892.

RFC 3462 requires a machine-parsable delivery report, and one such
report is defined in RFC 3464.  Many mail systems (most?) conform to
this these days, so just detecting thatt here is a delivery-status
header, then parsing the second body-part for "^Action: failed" oughta
work.  The message/delivery-status content-type MUST be 7 bit, so you
can be relatively certain it's not going to be anything but ASCII.
(AFAIK, there are no machines left on the Internet that leak 7-bit
charsets that are not ASCII).

A

-- 
Andrew Sullivan
[email protected]
+1 503 667 4564 x104
http://www.commandprompt.com/



^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 12:46  Andrew Sullivan <[email protected]>
  parent: Marc G. Fournier <[email protected]>
  3 siblings, 1 reply; 18+ messages in thread

From: Andrew Sullivan @ 2008-07-17 12:46 UTC (permalink / raw)
  To: pgsql-www

On Wed, Jul 16, 2008 at 11:17:22PM -0300, Marc G. Fournier wrote:

> Its sad how this is such an ongoing problem, but this is the first that I hear 
> that ppl are having problems 

I believe I mentioned this before.  If you really want me to plough
through the archives, I will.

> I can change the quarantining to be >4 if ppl want, which should greatly reduce 
> the # of messages going through ...

Sounds like a good idea to me.

A

-- 
Andrew Sullivan
[email protected]
+1 503 667 4564 x104
http://www.commandprompt.com/



^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 15:41  Marc G. Fournier <[email protected]>
  parent: Andrew Sullivan <[email protected]>
  0 siblings, 1 reply; 18+ messages in thread

From: Marc G. Fournier @ 2008-07-17 15:41 UTC (permalink / raw)
  To: Andrew Sullivan <[email protected]>; +Cc: pgsql-www


On Thu, 17 Jul 2008, Andrew Sullivan wrote:

>> I can change the quarantining to be >4 if ppl want, which should 
>> greatly reduce the # of messages going through ...
>
> Sounds like a good idea to me.

'k, I'll put it down to 4 tonight, and increase the frequency I check for 
false-positives accordingly, to make sure that I don't get a big spike as 
a result ...




^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 15:54  Alvaro Herrera <[email protected]>
  parent: Marc G. Fournier <[email protected]>
  0 siblings, 1 reply; 18+ messages in thread

From: Alvaro Herrera @ 2008-07-17 15:54 UTC (permalink / raw)
  To: Marc G. Fournier <[email protected]>; +Cc: Andrew Sullivan <[email protected]>; pgsql-www

Marc G. Fournier wrote:
>
> On Thu, 17 Jul 2008, Andrew Sullivan wrote:
>
>>> I can change the quarantining to be >4 if ppl want, which should  
>>> greatly reduce the # of messages going through ...
>>
>> Sounds like a good idea to me.
>
> 'k, I'll put it down to 4 tonight, and increase the frequency I check for 
> false-positives accordingly, to make sure that I don't get a big spike as 
> a result ...

Hmm, I think keeping it at 5 is a good idea, seeing how 2 of those were
being substracted by the AWL and you've just cleared that.

Maybe what we need is a way to remove a particular domain from the AWL
instead?

-- 
Alvaro Herrera                                http://www.CommandPrompt.com/
The PostgreSQL Company - Command Prompt, Inc.



^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 15:54  Greg Sabino Mullane <[email protected]>
  parent: Marc G. Fournier <[email protected]>
  3 siblings, 1 reply; 18+ messages in thread

From: Greg Sabino Mullane @ 2008-07-17 15:54 UTC (permalink / raw)
  To: pgsql-www


-----BEGIN PGP SIGNED MESSAGE-----
Hash: RIPEMD160


> Its sad how this is such an ongoing problem, but this is the first that I hear
> that ppl are having problems ... looking at the message headers for a random
> few, I notice that they are scoring >4, but just below 5:
...
> I can change the quarantining to be >4 if ppl want, which should greatly reduce
> the # of messages going through ...

I think that would be a good start, but there are definitely some other problems.
First, the example you gave:

> X-Spam-Status: No, hits=4.855 tagged_above=0 required=5 tests=AWL=-1.994,
> DCC_CHECK=1.37, DIGEST_MULTIPLE=0.001, HTML_MESSAGE=0.001,
> MIME_HTML_ONLY=1.672, RAZOR2_CHECK=0.5, RCVD_IN_BL_SPAMCOP_NET=2.188,
> RCVD_IN_SORBS_WEB=1.117

A score of 0.001 for HTML_MESSAGE? Might as well not have the check at all. Same
with things like DIGEST_MULTIPLE. I think we need more checks, and much higher
scores for many of them.

I grabbed a few random messages from the bugs list last night. Most interesting
was that some had no X-Spam-Status headers at all - does this mean they slipped
through the spam filtering entirely? Here's one of them:

===
Return-Path: <[email protected]>
Delivered-To: [email protected]
Received: from localhost (unknown [200.46.204.183])
        by postgresql.org (Postfix) with ESMTP id C3148650275
        for <[email protected]>; Wed, 16 Jul 2008 15:40:45 -0300 (ADT)
Received: from postgresql.org ([200.46.204.86])
 by localhost (mx1.hub.org [200.46.204.183]) (amavisd-maia, port 10024)
 with ESMTP id 48600-04-3 for <[email protected]>;
 Wed, 16 Jul 2008 15:40:43 -0300 (ADT)
X-Greylist: from auto-whitelisted by SQLgrey-1.7.6
Received: from wwwmaster.postgresql.org (wwwmaster.postgresql.org [217.196.146.204])
        by postgresql.org (Postfix) with ESMTP id AB1D565026D
        for <[email protected]>; Wed, 16 Jul 2008 15:40:44 -0300 (ADT)
Received: from wwwmaster.postgresql.org (wwwmaster.postgresql.org [217.196.146.204])
        by wwwmaster.postgresql.org (8.13.8/8.13.8) with ESMTP id m6GIehuA007983
        for <[email protected]>; Wed, 16 Jul 2008 18:40:43 GMT
        (envelope-from [email protected])
Received: (from www@localhost)
        by wwwmaster.postgresql.org (8.13.8/8.13.8/Submit) id m6GIehIP007982;
        Wed, 16 Jul 2008 18:40:43 GMT
        (envelope-from www)
Date: Wed, 16 Jul 2008 18:40:43 GMT
Message-Id: <[email protected]>
To: [email protected]
Subject: BUG #4310: PkMERMInZQ
From: "make money on line" <[email protected]>
Content-Type: text/plain; charset=utf-8
X-Virus-Scanned: Maia Mailguard 1.0.1


The following bug has been logged online:

Bug reference:      4310
Logged by:          make money on line
Email address:      [email protected]
PostgreSQL version: IUrjkiPgQkQXNgo
Operating system:   aJzBuaSGetA
Description:        PkMERMInZQ
Details:

<a href=" http://www.divinecaroline.com/public/user/profile?user_id=83997
">work at home jobs 101waystoincome.com</a>

====

Did it get whitelisted because it came from our form? I still think we
should scan it  - the "make money on line" is a dead giveaway, and
when I ran a local spamassassin on it, I even found:

 2.0 URIBL_BLACK            Contains an URL listed in the URIBL blacklist
                            [URIs: 101waystoincome.com]


Here's another one from last night that did have a spam header. I apologize
for how long this post is getting, but I'm trying to provide some hard data:
===

Return-Path: <[email protected]>
Delivered-To: [email protected]
Received: from localhost (unknown [200.46.204.183])
        by postgresql.org (Postfix) with ESMTP id AFB3A64FD01
        for <[email protected]>; Wed, 16 Jul 2008 23:15:20 -0300 (ADT)
Received: from postgresql.org ([200.46.204.86])
 by localhost (mx1.hub.org [200.46.204.183]) (amavisd-maia, port 10024)
 with ESMTP id 35883-07 for <[email protected]>;
 Wed, 16 Jul 2008 23:15:11 -0300 (ADT)
X-Greylist: domain auto-whitelisted by SQLgrey-1.7.6
Received: from smtp1web.tin.it (smtp1web.tin.it [212.216.176.195])
        by postgresql.org (Postfix) with ESMTP id 8ECBB64FCE4
        for <[email protected]>; Wed, 16 Jul 2008 23:15:17 -0300 (ADT)
Received: from pswm6.cp.tin.it (192.168.70.26) by smtp1web.tin.it (8.0.016.5)
        id 48623AD8015C5727; Thu, 17 Jul 2008 03:59:43 +0200
Message-ID: <[email protected]>
Date: Thu, 17 Jul 2008 02:59:41 +0100 (GMT+01:00)
From: "Tajana for(Mrs. Lucy Berg)" <[email protected]>
Reply-To: [email protected]
Subject: REMINDER NOTIFICATION
Mime-Version: 1.0
Content-Type: text/plain;charset="UTF-8"
Content-Transfer-Encoding: 7bit
X-Originating-IP: 62.163.243.54
To: undisclosed-recipients:;
X-Virus-Scanned: Maia Mailguard 1.0.1
X-Spam-Status: No, hits=1.806 tagged_above=0 required=5
 tests=SUBJ_ALL_CAPS=1.806
X-Spam-Level: *

REMINDER NOTIFICATION

This email is to notify you that your Email
Address attached to a
Ticket Number(140408) has won an Award Sum of
($500,000.00)(Five
Hundred Thousand Dollars)In an Email Sweepstakes
program held in
The Netherlands these year 2008.Please contact the
claim officer
through the below given contact information.

MR.HANSON
CHRIS.
TEL. +31-643-502-787.
FAX: +31-847-290-539.
E-mail:cpinans@aol.
nl

WINNING INFORMATIONS
Ref Number:Nl50286
lucky Numbers:
07,12,24,36,45
Batch Number:EU-175508
Ticket Number:360208

Please
forward the above stated winning information to your Claim
Agent and do
include the following,

Your Name:
Telephone Number:

Congratulations!!!

Yours Sincerely,
Mrs. Lucy Berg.
Public Relation
Officer.

===

The only spam trigger found by postgresql.org was:

X-Spam-Status: No, hits=1.806 tagged_above=0 required=5
 tests=SUBJ_ALL_CAPS=1.806

There are numerous triggers in the body of the email that should
have boosted the score up. Personally, I'd also like to see
SUBJ_ALL_CAPS raised to 3 or 4.

So, to reiterate, I'd like to request the following:

1) Spam filtering is run on all messages
2) The default to reject is lowered to at least 4
3) The values get raised significantly for some tests
4) More SA tests get added (are we at least cronning sa-update?)
5) If 3 and 4 are too much trouble to maintain, outsource the
filtering to someone who does have the time, or who specializes
in it (economies of scale)

I did #5 myself years ago, after getting tired of updating SA rules,
messing with DNS lookups, blacklists, etc. and now just let
maillaunder.com handle it all.

- --
Greg Sabino Mullane [email protected]
PGP Key: 0x14964AC8 200807171149
http://biglumber.com/x/web?pk=2529DF6AB8F79407E94445B4BC9B906714964AC8

-----BEGIN PGP SIGNATURE-----

iEYEAREDAAYFAkh/atsACgkQvJuQZxSWSsjKKwCg4Pc0SNrYjfUZuJRQZjU6jDHR
oc0An0vTdKzfIJ3+CxQXpw7TZyWu0Tb6
=a3/E
-----END PGP SIGNATURE-----





^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 18:11  Marc G. Fournier <[email protected]>
  parent: Alvaro Herrera <[email protected]>
  0 siblings, 1 reply; 18+ messages in thread

From: Marc G. Fournier @ 2008-07-17 18:11 UTC (permalink / raw)
  To: Alvaro Herrera <[email protected]>; +Cc: Andrew Sullivan <[email protected]>; pgsql-www


On Thu, 17 Jul 2008, Alvaro Herrera wrote:

> Marc G. Fournier wrote:
>>
>> On Thu, 17 Jul 2008, Andrew Sullivan wrote:
>>
>>>> I can change the quarantining to be >4 if ppl want, which should
>>>> greatly reduce the # of messages going through ...
>>>
>>> Sounds like a good idea to me.
>>
>> 'k, I'll put it down to 4 tonight, and increase the frequency I check for
>> false-positives accordingly, to make sure that I don't get a big spike as
>> a result ...
>
> Hmm, I think keeping it at 5 is a good idea, seeing how 2 of those were
> being substracted by the AWL and you've just cleared that.
>
> Maybe what we need is a way to remove a particular domain from the AWL
> instead?

Actually, I was thinking of disabling the AWL feature altogether ... it 
might make a few extra false-positives to need to release, but I doubt it 
will make a huge difference ...

Maybe disable AWL and leave score threshold at 5 ... ?  I can adjust 
after, based on # of false positives ... if false positives don't go up, 
then without AWL causes no adverse problems ... then I can look at 
adjusting score threshold down while watching for false-positives ...




^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 18:15  Alvaro Herrera <[email protected]>
  parent: Marc G. Fournier <[email protected]>
  0 siblings, 1 reply; 18+ messages in thread

From: Alvaro Herrera @ 2008-07-17 18:15 UTC (permalink / raw)
  To: Marc G. Fournier <[email protected]>; +Cc: Andrew Sullivan <[email protected]>; pgsql-www

Marc G. Fournier wrote:
>
> On Thu, 17 Jul 2008, Alvaro Herrera wrote:

>> Hmm, I think keeping it at 5 is a good idea, seeing how 2 of those were
>> being substracted by the AWL and you've just cleared that.
>>
>> Maybe what we need is a way to remove a particular domain from the AWL
>> instead?
>
> Actually, I was thinking of disabling the AWL feature altogether ... it  
> might make a few extra false-positives to need to release, but I doubt it 
> will make a huge difference ...

I think that's a bad idea because it'll delay all email.

-- 
Alvaro Herrera                                http://www.CommandPrompt.com/
The PostgreSQL Company - Command Prompt, Inc.



^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 19:58  Marc G. Fournier <[email protected]>
  parent: Alvaro Herrera <[email protected]>
  0 siblings, 1 reply; 18+ messages in thread

From: Marc G. Fournier @ 2008-07-17 19:58 UTC (permalink / raw)
  To: Alvaro Herrera <[email protected]>; +Cc: Andrew Sullivan <[email protected]>; pgsql-www


On Thu, 17 Jul 2008, Alvaro Herrera wrote:

> Marc G. Fournier wrote:
>>
>> On Thu, 17 Jul 2008, Alvaro Herrera wrote:
>
>>> Hmm, I think keeping it at 5 is a good idea, seeing how 2 of those were
>>> being substracted by the AWL and you've just cleared that.
>>>
>>> Maybe what we need is a way to remove a particular domain from the AWL
>>> instead?
>>
>> Actually, I was thinking of disabling the AWL feature altogether ... it
>> might make a few extra false-positives to need to release, but I doubt it
>> will make a huge difference ...
>
> I think that's a bad idea because it'll delay all email.

Wait, i think we are talking about two different things here ... I'm not 
talking about Greylisting AWL, I'm talking abotu the AWL that modifies the 
spamscore ... totally different databases ... I'm not touching 
GreyListing, only spamscore ...




^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 20:41  Alvaro Herrera <[email protected]>
  parent: Marc G. Fournier <[email protected]>
  0 siblings, 0 replies; 18+ messages in thread

From: Alvaro Herrera @ 2008-07-17 20:41 UTC (permalink / raw)
  To: Marc G. Fournier <[email protected]>; +Cc: Andrew Sullivan <[email protected]>; pgsql-www

Marc G. Fournier wrote:
>
> On Thu, 17 Jul 2008, Alvaro Herrera wrote:
>
>> Marc G. Fournier wrote:

>>> Actually, I was thinking of disabling the AWL feature altogether ... it
>>> might make a few extra false-positives to need to release, but I doubt it
>>> will make a huge difference ...
>>
>> I think that's a bad idea because it'll delay all email.
>
> Wait, i think we are talking about two different things here ... I'm not  
> talking about Greylisting AWL, I'm talking abotu the AWL that modifies 
> the spamscore ... totally different databases ... I'm not touching  
> GreyListing, only spamscore ...

Ah, sorry for the noise then -- I see no problem with that.

-- 
Alvaro Herrera                                http://www.CommandPrompt.com/
PostgreSQL Replication, Consulting, Custom Development, 24x7 support



^ permalink  raw  reply  [nested|flat] 18+ messages in thread

* Re: Spam filtering on the mailing lists
@ 2008-07-17 20:48  Marc G. Fournier <[email protected]>
  parent: Greg Sabino Mullane <[email protected]>
  0 siblings, 0 replies; 18+ messages in thread

From: Marc G. Fournier @ 2008-07-17 20:48 UTC (permalink / raw)
  To: Greg Sabino Mullane <[email protected]>; pgsql-www

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1



- --On Thursday, July 17, 2008 15:54:41 +0000 Greg Sabino Mullane 
<[email protected]> wrote:

> I grabbed a few random messages from the bugs list last night. Most
> interesting was that some had no X-Spam-Status headers at all - does this
> mean they slipped through the spam filtering entirely? Here's one of them:
>
> ===
> Return-Path: <[email protected]>
> Delivered-To: [email protected]
> Received: from localhost (unknown [200.46.204.183])
>         by postgresql.org (Postfix) with ESMTP id C3148650275
>         for <[email protected]>; Wed, 16 Jul 2008
> 15:40:45 -0300 (ADT) Received: from postgresql.org ([200.46.204.86])
>  by localhost (mx1.hub.org [200.46.204.183]) (amavisd-maia, port 10024)
>  with ESMTP id 48600-04-3 for <[email protected]>;
>  Wed, 16 Jul 2008 15:40:43 -0300 (ADT)
> X-Greylist: from auto-whitelisted by SQLgrey-1.7.6
> Received: from wwwmaster.postgresql.org (wwwmaster.postgresql.org
> [217.196.146.204])         by postgresql.org (Postfix) with ESMTP id
> AB1D565026D
>         for <[email protected]>; Wed, 16 Jul 2008 15:40:44 -0300 (ADT)
> Received: from wwwmaster.postgresql.org (wwwmaster.postgresql.org
> [217.196.146.204])         by wwwmaster.postgresql.org (8.13.8/8.13.8) with
> ESMTP id m6GIehuA007983         for <[email protected]>; Wed, 16 Jul
> 2008 18:40:43 GMT         (envelope-from [email protected])
> Received: (from www@localhost)
>         by wwwmaster.postgresql.org (8.13.8/8.13.8/Submit) id m6GIehIP007982;
>         Wed, 16 Jul 2008 18:40:43 GMT
>         (envelope-from www)
> Date: Wed, 16 Jul 2008 18:40:43 GMT
> Message-Id: <[email protected]>
> To: [email protected]
> Subject: BUG #4310: PkMERMInZQ
> From: "make money on line" <[email protected]>
> Content-Type: text/plain; charset=utf-8
> X-Virus-Scanned: Maia Mailguard 1.0.1

The X-Virus-Scanned indicates it went through Amavis/Maia, not sure why it 
would have bypassed the spam checker though ... let me check into it to make 
sure, as I thought both "outbound" and "inbound" was scanned, but maybe just 
for virus, with the expectation that if you are using a spam scanner on 
incoming, you aren't sending *out* spam ...



- -- 
Marc G. Fournier        Hub.Org Hosting Solutions S.A. (http://www.hub.org)
Email . [email protected]                              MSN . [email protected]
Yahoo . yscrappy               Skype: hub.org        ICQ . 7615664
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.9 (FreeBSD)

iEYEARECAAYFAkh/sBYACgkQ4QvfyHIvDvMDowCfXL1i6nLlR8u8zsWmJVr4z8Ue
r1EAniwfru0hTeNWPliF+nBvANpxOFaf
=3zw9
-----END PGP SIGNATURE-----





^ permalink  raw  reply  [nested|flat] 18+ messages in thread


end of thread, other threads:[~2008-07-17 20:48 UTC | newest]

Thread overview: 18+ messages (download: mbox mbox.gz follow: Atom feed)
-- links below jump to the message on this page --
2008-07-16 14:14 Spam filtering on the mailing lists Greg Sabino Mullane <[email protected]>
2008-07-16 15:09 ` Bruce Momjian <[email protected]>
2008-07-16 17:01   ` Joshua D. Drake <[email protected]>
2008-07-16 18:07     ` Andrew Sullivan <[email protected]>
2008-07-16 22:35       ` Joshua D. Drake <[email protected]>
2008-07-17 02:17       ` Marc G. Fournier <[email protected]>
2008-07-17 02:58         ` Joshua D. Drake <[email protected]>
2008-07-17 03:01         ` Marc G. Fournier <[email protected]>
2008-07-17 12:45           ` Andrew Sullivan <[email protected]>
2008-07-17 12:46         ` Andrew Sullivan <[email protected]>
2008-07-17 15:41           ` Marc G. Fournier <[email protected]>
2008-07-17 15:54             ` Alvaro Herrera <[email protected]>
2008-07-17 18:11               ` Marc G. Fournier <[email protected]>
2008-07-17 18:15                 ` Alvaro Herrera <[email protected]>
2008-07-17 19:58                   ` Marc G. Fournier <[email protected]>
2008-07-17 20:41                     ` Alvaro Herrera <[email protected]>
2008-07-17 15:54         ` Greg Sabino Mullane <[email protected]>
2008-07-17 20:48           ` Marc G. Fournier <[email protected]>

This inbox is served by agora; see mirroring instructions
for how to clone and mirror all data and code used for this inbox