EnglishFrenchSpanish

OnWorks favicon

rabema_evaluate - Online in the Cloud

Run rabema_evaluate in OnWorks free hosting provider over Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

This is the command rabema_evaluate that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

PROGRAM:

NAME


rabema_evaluate - RABEMA Evaluation

SYNOPSIS

rabema_evaluate [OPTIONS] --reference REF.fa --in-gsi IN.gsi --in-sam MAPPING.sam
rabema_evaluate [OPTIONS] --reference REF.fa --in-gsi IN.gsi --in-bam MAPPING.bam

DESCRIPTION

Compare the SAM/bam output MAPPING.sam/MAPPING.bam of any read mapper against the
RABEMA gold standard previously built with rabema_build_gold_standard. The input is
a reference FASTA file, a gold standard interval (GSI) file and the SAM/BAM input
to evaluate.

The input SAM/BAM file must be sorted by queryname. The program will create a FASTA
index file REF.fa.fai for fast random access to the reference.

-h, --help

Displays this help message.

--version

Display version information

-v, --verbose

Enable verbose output.

-vv, --very-verbose

Enable even more verbose output.

Input / Output:

-r, --reference FASTA

Path to load reference FASTA from. Valid filetypes are: fa and fasta.

-g, --in-gsi GSI

Path to load gold standard intervals from. If compressed using gzip, the file will
be decompressed on the fly. Valid filetypes are: gsi and gsi.gz.

-s, --in-sam SAM

Path to load the read mapper SAM output from. Valid filetype is: sam.

-b, --in-bam BAM

Path to load the read mapper BAM output from. Valid filetype is: bam.

--out-tsv TSV

Path to write the statistics to as TSV. Valid filetype is: tsv.

Benchmark Parameters:

--oracle-mode

Enable oracle mode. This is used for simulated data when the input GSI file gives
exactly one position that is considered as the true sample position. For simulated
data.

--only-unique-reads

Consider only reads that a single alignment in the mapping result file. Usefull for
precision computation.

--match-N

When set, N matches all characters without penalty.

--distance-metric METRIC

Set distance metric. Valid values: hamming, edit. Default: edit. One of hamming and
edit. Default: edit.

-e, --max-error RATE

Maximal error rate to build gold standard for in percent. This parameter is an
integer and relative to the read length. The error rate is ignored in oracle mode,
here the distance of the read at the sample position is taken, individually for
each read. Default: 0 Default: 0.

-c, --benchmark-category CAT

Set benchmark category. One of {all, all-best, any-best. Default: all One of all,
all-best, and any-best. Default: all.

--trust-NM

When set, we trust the alignment and distance from SAM/BAM file and no realignment
is performed. Off by default.

--ignore-paired-flags

When set, we ignore all SAM/BAM flags related to pairing. This is necessary when
analyzing SAM from SOAP's soap2sam.pl script.

--DONT-PANIC

Do not stop program execution if an additional hit was found that indicates that
the gold standard is incorrect.

Logging:

--show-missed-intervals

Show details for each missed interval from the GSI.

--show-invalid-hits

Show details for invalid hits (with too high error rate).

--show-additional-hits

Show details for additional hits (low enough error rate but not in gold standard.

--show-hits

Show details for hit intervals.

--show-try-hit

Show details for each alignment in SAM/BAM input.

The occurrence of "invalid" hits in the read mapper's output is not an error. If
there are additional hits, however, this shows an error in the gold standard.

RETURN VALUES

A return value of 0 indicates success, any other value indicates an error.

MEMORY REQUIREMENTS

From version 1.1, great care has been taken to keep the memory requirements as low
as possible.

The evaluation step needs to store the whole reference sequence in memory but
little more memory. So, for the human genome, the memory requirements are below 4
GB, regardless of the size of the GSI or SAM/BAM file.

REFERENCES

M. Holtgrewe, A.-K. Emde, D. Weese and K. Reinert. A Novel And Well-Defined
Benchmarking Method For Second Generation Read Mapping, BMC Bioinformatics 2011,
12:210.

http://www.seqan.de/rabema

RABEMA Homepage

http://www.seqan.de/mason

Mason Homepage

VERSION

rabema_evaluate version: 1.2.0 Last update March 14, 2013

Use rabema_evaluate online using onworks.net services


Free Servers & Workstations

Download Windows & Linux apps

  • 1
    OfficeFloor
    OfficeFloor
    OfficeFloor provides inversion of
    coupling control, with its: - dependency
    injection - continuation injection -
    thread injection For more information
    visit the...
    Download OfficeFloor
  • 2
    DivKit
    DivKit
    DivKit is an open source Server-Driven
    UI (SDUI) framework. It allows you to
    roll out server-sourced updates to
    different app versions. Also, it can be
    used fo...
    Download DivKit
  • 3
    subconverter
    subconverter
    Utility to convert between various
    subscription format. Shadowrocket users
    should use ss, ssr or v2ray as target.
    You can add &remark= to
    Telegram-liked HT...
    Download subconverter
  • 4
    SWASH
    SWASH
    SWASH is a general-purpose numerical
    tool for simulating unsteady,
    non-hydrostatic, free-surface,
    rotational flow and transport phenomena
    in coastal waters as ...
    Download SWASH
  • 5
    VBA-M (Archived - Now on Github)
    VBA-M (Archived - Now on Github)
    Project has moved to
    https://github.com/visualboyadvance-m/visualboyadvance-m
    Features:Cheat creationsave statesmulti
    system, supports gba, gbc, gb, sgb,
    sgb2Tu...
    Download VBA-M (Archived - Now on Github)
  • 6
    Stacer
    Stacer
    Linux System Optimizer and Monitoring
    Github Repository:
    https://github.com/oguzhaninan/Stacer.
    Audience: End Users/Desktop. User
    interface: Qt. Programming La...
    Download Stacer
  • More »

Linux commands

  • 1
    7za
    7za
    7za - A file archiver with highest
    compression ratio ...
    Run 7za
  • 2
    7zr
    7zr
    7zr - A file archiver with highest
    compression ratio ...
    Run 7zr
  • 3
    cpan
    cpan
    cpan - easily interact with CPAN from
    the command line ...
    Run cpan
  • 4
    cpan2debp
    cpan2debp
    dh-make-perl - Create debian source
    packages from Perl modules ...
    Run cpan2debp
  • 5
    fweelin
    fweelin
    freewheeling � live looping musical
    instrument ...
    Run fweelin
  • 6
    fwexec
    fwexec
    fwexec - program to upload and rexecute
    image file to a connected NXT device ...
    Run fwexec
  • More »

Ad