Giter VIP home page Giter VIP logo

kobas's Introduction

		 
		 KOBAS - KEGG Orthology Based Annotation System

1. What's KOBAS?

KEGG Orthology Based Annotation System (KOBAS) is a standalone Python
application in Bioinformatics.  KOBAS can assign appropriate KO terms for
queried sequences based on similarity search, and it can further discover
enriched KO terms among the annotation results by frequency of pathways or
statistical significance of pathways.

2. Installation

Please refer to install.txt under docs directory in KOBAS source

3. Usage

KOBAS composes of two major scripts, which are similar to usual command line
tools: the behavor can be adjusted by swiches and configuration file.  The whole
work flow can be divided to two steps: sequence annotation and pathway discovery.

Sequence Annotation
-----------------

Given a FASTA file of protein, you can annotate them by this command:

      blast2ko.py <FASTA FILE> 

All the result are printed default, you can redirect into a file.  If the input
are nuclear acid sequences, you can adjust blast2ko behavor by the command
option "-p" as follows:

      blast2ko.py -p blastx <FASTA FILE>


Pathway Discovery
-----------------

Once the annotation is complete, pathfind can be used to discover enriched
pathways by two ways: frequency prior and statistally significance prior.
You can find the most frequent pathways among the annotation result:

      pathfind.py <ANNOATION FILE>

Or, you can also find the most statistically enriched pathways:

      pathfind.py [-k b/c/h] <ANNOATION FILE> <BACKGROUND ORGANISM>

'-k' option specifies the statistical method to rank, 'b' represents binomial
distribution, 'c' represents chisquare distribution and 'h' represents
hypergeometry distrubtion. <BACKGROUND Organism> is three-charater abbreviation
of the organism in KEGG GENES as backgroud distribution.  Also, you can specifiy
an data file as backround like this:

      pathfind.py [-k b/c/h] <ANNOATION FILE> <BACKGROUND FILE>

Please refer other details to online manpage by the command "<CMD> -h".

4. Contact

Please, report bugs, comments and suggestions to the email below.  Also, if you
succeed in compiling this module under a platform other than Linux, please, send
us a note.

Authors: Xizeng Mao, Jianmin Wu, Tao Cai, Chen Xie, Liping Wei

Email:  [email protected]

Web: 	http://kobas.cbi.pku.edu.cn

Enjoy :)

kobas's People

Contributors

xmao avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.