EnglishFrenchSpanish

OnWorks favicon

bogotune-sqlite - Online in the Cloud

Run bogotune-sqlite in OnWorks free hosting provider over Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

This is the command bogotune-sqlite that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

PROGRAM:

NAME


bogotune - find optimum parameter settings for bogofilter

SYNOPSIS


bogotune [-v] [-c config] [-C] [-d dir] [-D] [-r value] [-T value]
-n okfile [[-n] okfile [...]] -s spamfile [[-s] spamfile [...]] [-M file]

bogotune [-h]

DESCRIPTION


Bogotune tries to find optimum parameter settings for bogofilter. It needs at least one
set each of spam and non-spam messages. The production wordlist is normally used, but it
can be directed to read a different wordlist, or to build its own from half the supplied
messages.

In order to produce useful results, bogotune has minimum message count requirements. The
wordlist it uses must have at least 2,000 spam and 2,000 non-spam in it and the message
files must contain at least 500 spam and 500 non-spam messages. Also, the ratio of spam to
non-spam should be in the range 0.2 to 5. If you direct bogotune to build its own
wordlist, it will use the half the input or 2000 messages (whichever is larger) for the
wordlist.

Message files may be in mbox, maildir, or MH folder or any combination. Msg-count files
can also be used, but not mixed with other formats.

OPTIONS


The -h option prints the help message and exits.

The -v option increases the verbosity level. Level 1 displays the scan output in detail
instead of using a progress meter.

The -c filename option tells bogofilter to read the config file named.

The -C option prevents bogotune from reading a configuration file.

The -d dir option specifies the directory for the database. See the ENVIRONMENT section
for other directory setting options.

The -D option tells bogotune to build a wordlist in memory using the input messages. The
messages are read and divided into two groups. The first group is used to build a wordlist
(in ram) and the second is used for tuning. To meet the minimum requirements of 2000
messages in the wordlist and 500 messages for testing, when -D is used, there must be 2500
non-spam and 2500 spam in the input files. If there are enough messages (more than 4000),
they will be split evenly between wordlist and testing. Otherwise, they will be split
proportionately.

The -n option tells bogotune that the following argument is a file (or folder) containing
non-spam. Since version 1.0.3, multiple arguments to the -n option can be given. All
non-option arguments until the next -s option will be treated as though they had been
preceded by -n

The -s option tells bogotune that the following argument is a file (or folder) containing
spam. It can be repeated as often as necessary. Since version 1.0.3, multiple arguments to
the -s can be given. All non-option arguments until the next -n option will be treated as
though they had been preceded by -s.

The -r value option tells bogotune to use the following parameter as the robx value.

The -T value option tells bogotune to use the following parameter as fp target value.

The -M file option tells bogotune to convert the file to message count format. This format
provides a sorted list of each message's unique tokens, along with their ham and spam
counts. Sorting hides the sense of the messages quite effectively, thus protecting
privacy. The message-count format allows bogotune and bogofilter to score messages quickly
without needing the original token database.

ENVIRONMENT


Bogofilter uses a database directory, which can be set in the config file. If not set
there, bogofilter will use the value of BOGOFILTER_DIR. Both can be overridden by the
-ddir option. If none of that is available, bogofilter will use directory
$HOME/.bogofilter.

Use bogotune-sqlite online using onworks.net services


Free Servers & Workstations

Download Windows & Linux apps

  • 1
    GenX
    GenX
    GenX is a scientific program to refine
    x-ray refelcetivity, neutron
    reflectivity and surface x-ray
    diffraction data using the differential
    evolution algorithm....
    Download GenX
  • 2
    pspp4windows
    pspp4windows
    PSPP is a program for statistical
    analysis of sampled data. It is a free
    replacement for the proprietary program
    SPSS. PSPP has both text-based and
    graphical us...
    Download pspp4windows
  • 3
    Git Extensions
    Git Extensions
    Git Extensions is a standalone UI tool
    for managing Git repositories. It also
    integrates with Windows Explorer and
    Microsoft Visual Studio
    (2015/2017/2019). Th...
    Download Git Extensions
  • 4
    eSpeak: speech synthesis
    eSpeak: speech synthesis
    Text to Speech engine for English and
    many other languages. Compact size with
    clear but artificial pronunciation.
    Available as a command-line program with
    many ...
    Download eSpeak: speech synthesis
  • 5
    Sky Chart / Cartes du Ciel
    Sky Chart / Cartes du Ciel
    SkyChart is a software to draw chart of
    the night sky for the amateur astronomer
    from a bunch of stars and nebulae
    catalogs. See main web page for full
    download...
    Download Sky Chart / Cartes du Ciel
  • 6
    GSmartControl
    GSmartControl
    GSmartControl is a graphical user
    interface for smartctl. It allows you to
    inspect the hard disk and solid-state
    drive SMART data to determine its
    health, as w...
    Download GSmartControl
  • More »

Linux commands

  • 1
    abc2abc
    abc2abc
    abc2abc - a simple abc
    checker/re-formatter/transposer ...
    Run abc2abc
  • 2
    abc2ly
    abc2ly
    abc2ly - manual page for abc2ly
    (LilyPond) 2.18.2 ...
    Run abc2ly
  • 3
    coqmktop
    coqmktop
    coqmktop - The Coq Proof Assistant
    user-tactics linker ...
    Run coqmktop
  • 4
    coqtop
    coqtop
    coqtop - The Coq Proof Assistant
    toplevel system ...
    Run coqtop
  • 5
    g.copygrass
    g.copygrass
    g.copy - Copies available data files in
    the current mapset search path to the
    user�s current mapset. KEYWORDS:
    general, map management ...
    Run g.copygrass
  • 6
    g.dirsepsgrass
    g.dirsepsgrass
    g.dirseps - Internal GRASS utility for
    converting directory separator
    characters. Converts any directory
    separator characters in the input string
    to or from na...
    Run g.dirsepsgrass
  • More »

Ad