Input File
OR paste Genome Sequence in FASTA format
Additional Parameters
Threshold Values :
Start Codon :
ATG
CTG
GTG
TTG
Method :
DNA
Protein
Swissprot
E-mail ID :
(Optional)
Threshold Value: If you have small genome you can specify lower threshold value to find smaller genes. If you have large genomes you can specify higher threshold value to weed out false positives
Start Codon: You can specify what should be the start codon with which you want to find genes.
Method : DNA Space: The method takes complete or part of genome sequence of prokaryotic species in FASTA format as input file. It searches for genes based on physico-chemical properties of double-helical deoxyribonucleic acid (DNA).
Protein Space: The method takes the result generated from DNA space as input file and works as a filter based on stereochemical properties of protein sequences to reduce false positives.
Swissprot Space :The method takes the result generated from protein space as input file and calculates the standard deviation of a query nucleotide sequence (predicted gene sequence) with the swissprot proteins based on the frequency of occurrence of aminoacids. A threshold standard deviation is chosen to keep the false positives at minimum and precision at maximum.
There is no file size limitation for the genomes. We have tested on more than 5 MB genome file size available with us. If the program crashes on large genome size, more than 5 MB, please intimate us.
The computation may take 5-10 minutes depending upon the load on the web server and the size of the genome in the input file.
[Mirror] (http://chemgenome.wesleyan.edu)
|