pegasus-statistics - Online in the Cloud

This is the command pegasus-statistics that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

PROGRAM:

NAME


pegasus-statistics - A tool to generate statistics about the workflow run.

SYNOPSIS


pegasus-statistics [-h|--help]
[-o|--output dir]
[-c|--conf propfile]
[-p|--statistics-level level]
[-t|--time-filter filter]
[-i|--ignore-db-inconsistency]
[-v|--verbose]
[-q|--quiet]
[-m|--multiple-wf]
[-p|--ispmc]
[-u|--isuuid]
[[submitdir ..] | [workflow_uuid ..]]

DESCRIPTION


pegasus-statistics generates statistics about the workflow run like total jobs/tasks/sub
workflows ran, how many succeeded/failed etc. It generates job instance statistics like
run time, condor queue delay etc. It generates invocation statistics information grouped
by transformation name. It also generates job instance and invocation statistics
information grouped by time and host.

OPTIONS


-h, --help
Prints a usage summary with all the available command-line options.

-o dir, --output dir
Writes the output to the given directory.

-c propfile, --conf propfile
The properties file to use. This option overrides all other property files.

-s level, --statistics-level level
Specifies the statistics information to generate. Valid levels are: all, summary,
wf_stats, jb_stats, tf_stats, and ti_stats. Default is summary. The output generated
by pegasus-statistics is based on the the level set:

· all: generates all the statistics information.

· summary: generates the workflow statistics summary. In the case of a hierarchical
workflow the summary is across all sub workflows.

· wf_stats: generates the workflow statistics information of each individual
workflow. In case of a hierarchical workflow the workflow statistics are created
for each sub workflow.

· jb_stats: generates the job statistics information of each individual workflow. In
case of hierarchical workflow the job statistics is created for each sub
workflows. Note: Not supported when generating statistics over multiple workflows.

· tf_stats: generates the invocation statistics information of each individual
workflow grouped by transformation name .In case of hierarchical workflow the
transformation statistics is created for each sub workflows.

· ti_stats: generates the job instance and invocation statistics like total count
and runtime grouped by time and host.

-t filter, --time-filter filter
Specifies the time filter to group the time statistics. Valid filter values are:
month, week, day, hour. Default is day.

-i, --ignore-db-inconsistency
Turn off the the check for database consistency.

-v, --verbose
Increases the log level. If omitted, the default level will be set to WARNING. When
this option is given, the log level is changed to INFO. If this option is repeated,
the log level will be changed to DEBUG.

-q, --quiet
Decreases the log level. If omitted, the default level will be set to WARNING. When
this option is given, the log level is changed to ERROR.

-m, --multiple-wf
Set this option when generating statistics over more than one workflow. The tool
automatically sets this flag if multiple submit directories or multiple workflow UUIDs
are provided. This option would need to be set explicitly only to generate statistics
over all workflows in a single STAMPEDE database. NOTE: When workflows are specified
as UUIDs the --conf options needs to be set for the tool to determine the STAMPEDE
database URL.

-p, --ispmc
Set this flag to generate statistics for workflows which are run with PMC clustering
enabled. It is recommended that this option be used when calculating statistics over
multiple workflow runs.

-u, --isuuid
Set this option if the positional argument are workflow UUIDs. NOTE: When workflows
are specified as UUIDs the --conf options needs to be set for the tool to determine
the STAMPEDE database URL.

EXAMPLE


Runs pegasus-statistics and writes the output to the given directory:

$ pegasus-statistics -o /scratch/statistics /scratch/grid-setup/run0001

Runs pegasus-statistics over a workflow run identified by a single workflow UUID:

$ pegasus-statistics --conf pegasusrc --isuuid 316f2986-7754-44ec-8b38-fcd0cb602ce0

Runs pegasus-statistics over a workflow run identified by a multiple workflow UUID:

$ pegasus-statistics --conf pegasusrc --isuuid 316f2986-7754-44ec-8b38-fcd0cb602ce0
7ef77af8-4eb2-45ca-b37d-c5a02186133a

Runs pegasus-statistics over all workflows in the STAMPEDE database:

$ pegasus-statistics --conf pegasusrc --multiple-wf

AUTHORS


Prasanth Thomas Rajiv Mayani

Pegasus Team http://pegasus.isi.edu

Use pegasus-statistics online using onworks.net services



Latest Linux & Windows online programs