bogofilter - 0.15.12 - new current release

From: David Relson (relson_at_osagesoftware.com)
Date: 12/24/03

  • Next message: Michael Wiedmann: "ANN: tex-refs, V0.3.0, References for TeX and Friends"
    Date: Wed, 24 Dec 2003 13:17:45 CST
    
    

    Bogofilter is a mail filter that classifies mail as spam or ham
    (non-spam) by a statistical analysis of the message's header and content
    (body). The program is able to learn from the user's classifications
    and corrections.

    The statistical technique is known as the Bayesian technique and its use
    for spam was described by Paul Graham in his article "A Plan For Spam".
    Gary Robinson, in his weblog Rants, suggests some refinements for
    improved discrimination between spam and ham. Bogofilter's primary
    algorithm uses the f(w) parameter and the Fisher inverse chi-square
    technique that he describes.

    Bogofilter is run by an MDA script to classify an incoming message as
    spam or ham (using wordlists stored by BerkeleyDB). Bogofilter provides
    processing for plain text and html, supports multi-part mime message
    with decoding of base64, quoted-printable, and uuencoded text and
    ignores attachments, such as images.

    Bogofilter is written in C. Supported platforms: Linux, FreeBSD,
    Solaris, OS X, HP-UX, AIX, RISC-OS, OS/2, ...

    ******* ******* ******* ******* *******

    Bogofilter 0.15.12 is available on SourceForge. The download URL is
    http://sourceforge.net/project/showfiles.php?group_id=62265

    This release has a variety of minor fixes and changes and documentation
    updates. See the excerpt from NEWS-0.15 (below).

    If upgrading to 0.15.12 from a version older than 0.15.4, you should
    rebuild your token database to take advantage of header line tagging
    which was expanded in 0.15.4

                                   =================
                                    BOGOFILTER NEWS
                                   =================

    Be sure to read RELEASE.NOTES-0.15, with attention to the section
    about retraining and the '-H' flag...

    0.15.12 2003-12-24

    * Fix decoding of escaped urls.
    * Fix sigsegv in bogotune when '-D' option is used.
    * Fix compilation problems with datastore_tdb.c and datastore_qdbm.c
    * Force line buffered output when not in passthrough mode.
    * Minor fixes for parsing msg-count files.
    * Clarify list of supported mail formats in FAQ.
    * Enhance memdebug capabilities.

    * Distinguish between pipe and stdin as input sources.
      (contributed by Henning Makholm)

    0.15.11 2003-12-17

    * Fixed CRLF problem in bogoreader.c
    * Exclude ~ (tilde) at the end of tokens.
    * Updates to configure scripts, makefiles and contrib files to support
      DGUX.
    * bogotune-faq.html - new file
    * bogofilter - man page and FAQ updated
    * bogoupgrade - revised help message and man page
    * Improved configuration of BerkeleyDB and GSL.
    * Removed unused '-F' (force) option from bogofilter.
    * Removed unused '-q' (quiet) option from bogofilter.
    * Bugfix for SIGFPE (division by zero) crash on start-up on systems
      with BerkeleyDB 3.2 or older.
    * bogotune now understands degeneration options and can use
      them when creating message-count files.

    0.15.10 2003-12-08

    * Multiple fixes, revisions, and changes to bogotune.
    * Added -M option to bogotune for creating message count files.
    * Fixed bug in header degeneration.
    * Added degeneration options to config file.
    * Added subject line tagging for Unsures.
    * Formatting and portability fixes for DGUX.
    * Fixed "configure --enable-static" for building statically linked
      executables.
    * The test suite now uses static executables when "configure
      --enable-static" has been used.
    * The test suite no longer depends on procmail for t.MH and t.maildir
      or formail for t.bulkmode.
    * Moved robx calculation code to new file for sharing by bogoutil and
      bogotune.
    * Fix segfault when using '-H' (header_degen) option.

    0.15.9 2003-11-23

    * Configure now finds a POSIX compliant shell for running version.sh
    * Remove --disable-* options for algorithms. Has never been supported
      well and serves no useful purpose, the algorithm code is irrelevant
      compared to lexer or other stuff.
    * Fixed a memory leak in bogoutil.
    * Cleaned up help message in bogoutil.
    * Bogotune now checks for incorrectly classified messages in the test
      data and exits if so.
    * Bogotune's memory needs lessened.
    * Fixed timestamp config option.
    * Exclude apostrophes and backticks at the end of a token.
    * Lexer changes reduce size of bogofilter executable by approx 90%.
    * Lexer.c no longer discards X-Bogosity lines in rfc822 attachments.
    * Removed repetition counts in lexer for TOKEN and MIME_BOUNDARY
      patterns to reduce executable size.
    * "<!DOCTYPE HTML PUBLIC...>" is now recognized as starting html text.
    * Several minor lexer bugs fixed.
    * Updated bogominitrain.pl to v1.4.2
    * TDB passes all checks again.
    * QDBM support fixed.
    * Minor documentation fixes.
    * Minor error message cleanups.
    * Refactored passthrough.c.
    * Test suite bugfixes for TDB/QDBM.
    * BerkeleyDB support warns if data base size approaces file size
    resource limit.

    0.15.8 2003-10-29

    * Modified handling of mime attachments to decode rfc822 and to
      ignore applications and images.
    * Added decoding of percent escaped characters in URLs.
    * Script tuning/bogotune rewritten as C program.
    * Added man page for bogotune.
    * Print "X-Bogosity" line when "-t" is used alone.
    * Change bogoupgrade back to using 2 arg open for perl-5.6
      compatibility.
    * Initialize wordhash storage.
    * Fix initialization problem that prevents reading more than
      one msg-count file.
    * Configure script modified to better detect BerkeleyDB libs.
    * Makefile modified to build bogolexer and bogoutil with fewer
      shared libs.
    * Fix build problem in doc directory.
    * English and french versions of bogofilter-faq.html revised.

    0.15.7 2003-10-13 - Stable Release

    * Added decoding of escaped characters in html.
    * Disable header line tagging when processing msg-count files.
    * Revised mailbox processing so type recognition is now table driven.
    * Include all tokens in bogoutil dump output (unless in maintenance
      mode).
    * Added support for ANT mailboxes.
    * Made portability changes for OS/2 and RISC-OS

    ##########################################################################
    # Send submissions for comp.os.linux.announce to: cola@stump.algebra.com #
    # PLEASE remember a short description of the software and the LOCATION. #
    # This group is archived at http://stump.algebra.com/~cola/ #
    ##########################################################################


  • Next message: Michael Wiedmann: "ANN: tex-refs, V0.3.0, References for TeX and Friends"

    Relevant Pages

    • Bogofilter - 0.15.13 - New Stable Release
      ... for spam was described by Paul Graham in his article "A Plan For Spam". ... Bogofilter is run by an MDA script to classify an incoming message as ... Added -V option to bogotune. ... Fix problem with separate wordlists that causes ...
      (comp.os.linux.announce)
    • bogofilter-0.92.7 - new current release
      ... Bogofilter is a mail filter that classifies email as spam or ham ... for spam was described by Paul Graham in his article "A Plan For Spam". ... Bogofilter is run by an MDA script to classify an incoming message as ... * Allow only one of '-d' and '-D' options for bogotune. ...
      (comp.os.linux.announce)
    • Bogofilter-0.17.5 - New Current Release
      ... Bogofilter is a mail filter that classifies email as spam or ham ... for spam was described by Paul Graham in his article "A Plan For Spam". ... Bogofilter is run by an MDA script to classify an incoming message as ... Warn if user specified config file doesn't exist. ...
      (comp.os.linux.announce)
    • Bogofilter-0.14.1 - New Current Release
      ... Bogofilter is a mail filter that classifies mail as spam or ham ... Updated bogominitrain.pl to work with combined wordlists. ...
      (comp.os.linux.announce)
    • bogofilter-0.14.5.2 - New Stable Release
      ... Bogofilter is a mail filter that classifies mail as spam or ham by a statistical analysis of the message's header and content. ... Revised database API so that there are 3 distinct layers ... Debug output for wordlists and databases was enhanced. ...
      (comp.os.linux.announce)