sam-stats - Online in the Cloud

This is the command sam-stats that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

PROGRAM:

NAME


sam-stats - ea-utils: produce digested statistics

SYNOPSIS


sam-stats [options] [file1] [file2...filen]

DESCRIPTION


Version: 1.38.681

Produces lots of easily digested statistics for the files listed

Options (default in parens):

-D Keep track of multiple alignments -O PREFIX Output prefix enabling
extended output (see below) -R FIL Coverage/RNA output (coverage, 3' bias, etc,
implies -A) -A Report all chr sigs, even if there are more than 1000 -b INT
Number of reads to sample for per-base stats (1M) -S INT Size of ascii-signature
(30) -x FIL File extension for handling multiple files (stats) -M Only
overwrite if newer (requires -x, or multiple files) -B Input is bam, don't
bother looking at magic -z Don't fail when zero entries in sam

OUTPUT:

If one file is specified, then the output is to standard out. If multiple files are
specified, or if the -x option is supplied, the output file is <filename>.<ext>. Default
extension is 'stats'.

Complete Stats:

<STATS>
: mean, max, stdev, median, Q1 (25 percentile), Q3

reads : # of entries in the sam file, might not be # reads

phred : phred scale used

bsize : # reads used for qual stats

mapped reads
: number of aligned reads (unique probe id sequences)

mapped bases
: total of the lengths of the aligned reads

forward
: number of forward-aligned reads

reverse
: number of reverse-aligned reads

snp rate
: mismatched bases / total bases (snv rate)

ins rate
: insert bases / total bases

del rate
: deleted bases / total bases

pct mismatch
: percent of reads that have mismatches

pct align
: percent of reads that aligned

len <STATS>
: read length stats, ignored if fixed-length

mapq <STATS>
: stats for mapping qualities

insert <STATS>
: stats for insert sizes

%<CHR> : percentage of mapped bases per chr, followed by a signature

Subsampled stats (1M reads max):
base qual <STATS> : stats for base qualities %A,%T,%C,%G : base percentages

Meaning of the per-chromosome signature:
A ascii-histogram of mapped reads by chromosome position. It is only output if the
original SAM/BAM has a header. The values are the log2 of the # of mapped reads at
each position + ascii '0'.

Extended output mode produces a set of files:
.stats : primary output

.fastx : fastx-toolkit compatible output

.rcov : per-reference counts & coverage

.xdist : mismatch distribution

.ldist : length distribution (if applicable)

.mqdist
: mapping quality distribution

Use sam-stats online using onworks.net services



Latest Linux & Windows online programs