What's with bogofilter and spam
Anne Wilson
cannewilson at googlemail.com
Thu Jun 4 11:13:30 UTC 2009
On Thursday 04 June 2009 11:26:11 Rodd Clarkson wrote:
> On Thu, 2009-06-04 at 05:05 -0500, Mike Chambers wrote:
> > On Thu, 2009-06-04 at 20:01 +1000, Rodd Clarkson wrote:
> > > Recently bogofilter's spam sensing abilities seem to have gone all wrong
> > > in evolution.
> > >
> > > I was getting way too much span in the inbox (maybe 10% of my spam
> > > wasn't getting detected) and even though I was highlighting it and
> > > marking it as spam the same sorts of messages kept appearing.
> > >
>
> <snip>
>
> > > Are other noticing the same issues?
> > >
> > > I prefer bogofilter over spamassassin as the latter takes forever to
> > > filter through email, especially when you've been on holidays for a week
> > > and have to pull a couple of 1000 messages.
> >
> > Experienced everything you did, to include the marking my Fedora
> > messages as spam as well. Just doesn't seem bogofilter and/or evo is
> > not working together like they did in F10.
> >
> > I thought I was the only one experiencing this.
>
> Filed as: https://bugzilla.redhat.com/show_bug.cgi?id=504112
>
Spammers are getting a lot more clever/careful these days, using words that
won't be detected. I've found that I have to collect spam and 'unsure' ham
into folders until I get a reasonable number, then every few days I run
bash /usr/share/bogofilter/contrib/contrib/trainbogo.sh -c -H /home/anne/Maildir/.INBOX.bogotrain_ham/cur/ -S /home/anne/Maildir/.INBOX.bogotrain_spam/cur/
(watch for line-wrap - it's all one line), repeating until the missed spam is
down to about 3. I then delete all the tested messages and collect the next
batch. I'm still seeing a number of unsures, but bogofilter is definitely
learning the new stuff.
If you are seeing ham messages being detected as spam, copy a large number of
similar messages, for instance mailing-list messages, into your ham testing
folder before the run. Doing this a few times should sort out any
mis-training already there. HTH
Anne
--
New to KDE4? - get help from http://userbase.kde.org
Just found a cool new feature? Add it to UserBase
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part.
URL: <http://listman.redhat.com/archives/fedora-test-list/attachments/20090604/e928cbbf/attachment.sig>
More information about the fedora-test-list
mailing list