Re: bayesian filter training question

From: Roberto C. Sanchez (roberto_at_familiasanchez.net)
Date: 09/30/05

  • Next message: Roberto C. Sanchez: "Re: subversion 1.2.3a for sarge?"
    Date: Fri, 30 Sep 2005 07:36:05 -0400
    To: debian-user@lists.debian.org
    
    
    

    On Fri, Sep 30, 2005 at 09:14:53AM +0200, Kjetil Kjernsmo wrote:
    > On torsdag 29 september 2005, 21:51, Roberto C. Sanchez wrote:
    > > So, I finally decided to get with the 20th century and install
    > > spamassassin (acutally spampd hooked through postfix) to do site-wide
    > > spam filtering for my server.
    >
    > Yiiihaaa!
    >
    > > My question is this.  As I am training
    > > it with sa-learn, is it (good|bad|indifferent) to train it on spam
    > > that has already been flagged as spam.  That is, will this reinforce
    > > spamassassin's notion of spam or ruin it?
    >
    > No, that's fine. In fact, SA has this autowhitelist concept that does
    > exactly that (it's not really a whitelist, though, more an "evening out
    > weird things that may happen", I'm not using it).
    >
    > You should have a good look at bayes_ignore_header, so that it won't
    > train on things that are obviously in spam. SA is pretty good it this
    > itself, but if you see spam that has been filtered elsewhere a lot, be
    > sure to use it.
    >
    > I'm guessing that you, like me, are doing this for your family. In that
    > case, I have found that it is quite sufficient to train a single
    > database with the spam and ham of the entire family. If you have more
    > diverse users, you would probably need to have a per-user
    > configuration. For example, a friend of mine has an uncle who is a
    > psychiatrist working with people with gambling obsessions, and SA was
    > pretty catastrophic for him until he got a per-user config.
    >
    > Finally, I found that SA, in it's default 3.0-form was much too
    > conservative about the assigned scores, so I have a bunch of rules that
    > I have adjusted the score of. You'll get some experience about that in
    > time, I guess. Also note that SA 3.1 has been released upstream.
    >
    Cool. Thanks for the quick informative reply.

    -Roberto

    -- 
    Roberto C. Sanchez
    http://familiasanchez.net/~roberto
    
    

    -- 
    To UNSUBSCRIBE, email to debian-user-REQUEST@lists.debian.org 
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
    


  • Next message: Roberto C. Sanchez: "Re: subversion 1.2.3a for sarge?"

    Relevant Pages

    • Re: newbie asking for help with outlook express
      ... First, the spammers ... Second, by using an invalid domain, any spam generator will end up sending ... If they don't provide spam filtering then check ... >> Hotmail accounts then get used to some spam. ...
      (microsoft.public.windowsxp.basics)
    • Re: Antivirus
      ... The came out as the best out of 30 AV-programs on virus scanning, ... Soon to come anti spam for Exchange. ... Easy to install, maintain and update, even for remote workers. ... >> It has some nice features like Spam filtering. ...
      (microsoft.public.backoffice.smallbiz2000)
    • Re: OT Spam filters?
      ... but just tapping into the wide experience base in here. ... I need to set up a spam filtering system, but with the plethora of various products ... optionally be tagged either in subject or header fields, ...
      (uk.comp.homebuilt)
    • Re: Challenge-response mail filters considered harmful
      ... effective blocking of unwanted emails (spam, ... I am sure that for some your Challenge Response program ... approach) spam filtering is the preferred approach. ...
      (Debian-User)
    • Re: need spam filter for Outlook Express on XP64
      ... I just assumed that it would need to be 64-bit or at least compatible with whatever Outlook Express runs on XP64. ... The way Cloudmark plugs in to Outlook Express is the way I want the next spam-filter to work. ... I run the mail server and I don't want to run spam filtering software on that box. ...
      (microsoft.public.windows.inetexplorer.ie6_outlookexpress)