What is MiST?
The Microbial Signal Transduction database contains the signal transduction proteins for bacterial and archaeal genomes (2,756 complete and 5,181 draft). These are identified using various domain profiles that directly or indirectly implicate a particular protein in participating in signal transduction.
3 September 2013
- In v2.2, the protein ids have changed. We are working on mapping the old ids to the new ids. Until this is complete, please use the previous version, http://v2-1.mistdb.com if you need to work with the old ids.
- An alternative, more stable approach is to only store references to sequences using the gene locus.
26 August 2013
- New version 2.2 - badly needed genome update; majority of new genomes are incomplete. Many genomes available for various strains of the same species.
- Minor usability improvements (e.g. the list of genomes and the taxonomy browser is much more interactive and dynamic.)
- No more genome metadata (phenotype, gram-stain, disease, etc.); NCBI no longer provides this information in an easily accessible format.
If you use information from the MiST database in your research, please cite:
- The MiST2 database: a comprehensive genomics resource on microbial signal transduction.
Luke E. Ulrich and Igor B. Zhulin
Nucleic Acids Research, 2010, doi:10.1093/nar/gkp940.
Definitions of microbial signal transduction implemented in MiST are described in:
- One-component systems dominate signal transduction in prokaryotes.
Luke E. Ulrich, Eugene V. Koonin, and Igor B. Zhulin
Trends in Microbiology, 2005, 13:52-56.
Genomic distribution of signal transduction proteins
- 3,321 signal transduction proteins: Vibrio parahaemolyticus O1:Kuk str. FDA_R31
- 2,624 one-component proteins: Vibrio parahaemolyticus O1:Kuk str. FDA_R31
- 497 two-component proteins: Vibrio parahaemolyticus O1:Kuk str. FDA_R31
- 149 chemotaxis proteins: Vibrio parahaemolyticus O1:Kuk str. FDA_R31
- 118 ECF protiens: Plesiocystis pacifica SIR-1
- Largest genome (39.3 Mbp): Vibrio parahaemolyticus O1:Kuk str. FDA_R31
- Most genes (39,184 genes): Vibrio parahaemolyticus O1:Kuk str. FDA_R31
* Note: Chemotaxis is a specialized form of two-component systems (details)