To deal with the vast quantities of genome sequence data being
generated by the various genome projects, we have developed an
automated, iterative system for comprehensive sequence analysis
and gene annotation for comparative genomics. Our approach is
can be used for high-throughput analyses of genome sequence data
as they are generated and as an application emphasizing regions
of interest to researchers to identify genes responsible for disease.
Our "annotation pipeline" consists of modules for sequence extension,
analysis, annotation, and visualization, and integrates disparate
bioinformatics applications such as Phred/Phrap, RepeatMasker,
MetaGene, and the Virtual Comparative Map (VCMap) with Internet
bioinformatics services such as Blast and LocusLink, and presents
the annotation in a web-based interface.