About HPC Systems Software User Guides Education Partners

  / gears / hpc / software / bioinf / seg


Bioinformatics

Compilers and Programming Tools

Computational Chemistry

File System

Finite Element Solvers

Graphics

Mathematics

Numerical Libraries

Optimization

Parallel Programming Libraries and Tools

Queuing and Scheduling Systems

Solid Modeling

Statistics

seg

Overview

seg divides sequences into contrasting segments of low-complexity and high-complexity. Low-complexity segments defined by the algorithm represent "simple sequences" or "compositionally-biased regions".

Locally-optimized low-complexity segments are produced at defined levels of stringency, based on formal definitions of local compositional complexity. The segment lengths and the number of segments per sequence are determined automatically by the algorithm.

The input is a FASTA-formatted sequence file, or a database file containing many FASTA-formatted sequences. seg is tuned for amino acid sequences.

Setup

To use seg it is necessary to set your seg environment by running a special command sequence once per login session. You may optionally place these commands in your .cshrc (C Shell users) or .profile (Bourne Shell users) to avoid having to manually run these commands on login.

For csh and tcsh:

source /usr/local/setup/seg.setup.csh

For sh and bash:

source /usr/local/setup/seg.setup.sh
Usage

seg is invoked with the command seg.

Examples

The following is an example PBS script to run a seg job on LION-XE for a maximum of 2 hours. The FASTA formatted input file is called input.fa and is found in the scope of this example in /home/foo/seg. Since seg prints its output to STDOUT (standard output), the command output will be found in the normal PBS output file.

#PBS -l nodes=1:ppn=1
#PBS -l walltime=2:00:00
#PBS -j oe
#PBS -q lionxe-serial

# setup the seq environment
. /usr/local/setup/seg.setup.sh

# change the current working directory to the directory where
# the input file can be found
cd /home/foo/seg

# run the seg command
seg input.fa

Further information on PBS scripts and submitting jobs on the LION-XE cluster can be found in the User Guides section of the HPC website.

Documentation

Information on seg can be found on LION-XE in the file /usr/global/seg/seg.doc.


Please send questions or suggestions about this web page to beatnic@aset.psu.edu

ASET | ITS | Penn State