CAAT-Box, contigs-Assembly and Annotation Tool-Box for genome sequencing projects - Archive ouverte HAL Access content directly
Journal Articles Bioinformatics Year : 2004

CAAT-Box, contigs-Assembly and Annotation Tool-Box for genome sequencing projects

(1) , (2) , (2) , (2) , (2) , (1) , (2)
1
2

Abstract

Contigs-Assembly and Annotation Tool-Box (CAAT-Box) is a software package developed for the computational part of a genome project where the sequence is obtained by a shotgun strategy. CAAT-Box contains new tools to predict links between contigs by using similarity searches with other whole genome sequences. Most importantly, it allows annotation of a genome to commence during the finishing phase using a gene-oriented strategy. For this purpose, CAAT-Box creates an Individual Protein file (IPF) for each ORF of an assembly. The nucleotide sequence reported in an IPF corresponds to the sequence of the ORF with 500 additional bases before the ORF and 200 bases after. For annotation, additional information like Blast results can be added or linked to the IPFs as well as automatic and/or manual annotations. When a new assembly is performed, CAAT-Box creates new IPFs according to the old IPF panel. CAAT-Box recognizes the modified IPFs which are the only ones used for a new automatic analysis after each assembly. Using this strategy, the user works with a group of IPFs independently of the closure phase progression. The IPFs are accessible by a web server and can therefore be modified and commented by different groups.

Dates and versions

pasteur-03252411 , version 1 (07-06-2021)

Identifiers

Cite

L. Frangeul, Philippe Glaser, Christophe Rusniok, C. Buchrieser, E. Duchaud, et al.. CAAT-Box, contigs-Assembly and Annotation Tool-Box for genome sequencing projects. Bioinformatics, 2004, 20 (5), pp.790-797. ⟨10.1093/bioinformatics/btg490⟩. ⟨pasteur-03252411⟩

Collections

PASTEUR CNRS
20 View
0 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More