PMBL - DATABASES - SEQUENCIES - NUCLEIC ACIDS



GENERAL DATABASES

 EMBL-EBI Nucleotide Sequence Database (EMBL)
NCBI National Center for Biotechnology Information (NIH)
DDBJ DNA Data Bank of Japan

DNA SEQUENCE DATABASES

EST

dbEST dbEST (Expressed Sequence Tags) db (NCBBI)
UniGene Non redundat genes at NCBI
TIGR The Gene Index Project (Computational Biol. & Functional Genomics Lab.)
 EGAD The Expressed Gene Anatomy Database (EGAD)

SPLICING

 
Introns.com  
ASD Alternative Splicing Database Project (EMBL-EBI)
Hollywood.mit.edu Hollywood Alternative spliced RNA database
Aberrant SD Aberrant Splicing Database
MAASE The Manually Annotated Alternative Splicing Events Database
RatSDB Database of rat splicing variants
ECgene Genome Annotation for Alternative Splicing
ASAP Alternative Splicing Annotation Project Database
 UNTRANSLATED RNAs
tRNA­Compilation  
RDP Ribosomal Database Project 9.0
soRNA soRNA Database
Comparative RNA Comparative RNA Web Site and Project (Univ. Texas, Austin)
Non-coding RNA Data Bank  
fRNAdb Functional RNA database
Rfam Database of noncoding RNA families
miRBase MicroRNA database
ASRP Arabidopsis thaliana small RNA project
met snoRNA Methylation guide snoRNA database
plant snoRNA Plant snoRNA database
tRNA database tRNA database
tmRDB Database of tmRNA sequences
RNase P RNase P RNA and protein database
SRPDB Signal recognition particle database
RNA World Website The RNA World Website (Fritz Lipmann Institute)
Scottish RNA Club  
5S rRNA 5S Ribosomal RNA Database
tmRNA Website  
SHORT TAGGED SEQUENCES (STS)
dbSTS dbSTS: database of Sequence Tagged Sites  (NCBI)
SINGLE NUCLEOTIDE POLYMORPHISM (SNP)
NCBI-SNPs  
Barley SNPs Barley SNP Database
Rice SNPs Database Rice SNPs Database
PlantMarkers Plant Markers, a database of predicted molecular markers
Maritime Pine SNPs Maritime Pine SNPs database
PROMOTERS
DOOP A database of Orthologous Promoters
PlantProm DB A Database of Plant Promoter Sequences
VECTORS
VectorDB  
UniVec Vector Screen Database
OLIGOS
VirOligo Virus Oligos Compilation Lab
PROBES
MPDB Molecular probe database (Ist. Naz. Ricerca sul Cancro)
TELOMERES
TelDB Telomere Database
TelLit Telomere Literature Database
ACUTS Ancient conserved untranslated DNA sequuences db
   

MOTIFS & REGULATORY SIGNALS

PlantCARE A database of plant promoters and their cis-acting regulatory elements
PLACE A Database of Plant Cis-acting Regulatory DNA elements
EPD The Eukaryotic promoter database
TRRD Transcription regulatory regions database
MethDB DNA Methylation Database
AtProbe Arabidopsis thaliana Promoter Binding Element Database
TRED Transcriptional Regulatory Element Database
Plant-TRRD Plant Genes Knowledge Base
ORegAnno Gene regulatory elements database
   

DNA REPEATS

PRDP Plant Repeat Database Project (TIGR)
PlantSat Plant Satellite Database
Rice Repeats TIGR rice repeat database
   

CODON USAGE

 
Codon Usage Db Codon Usage Database
RCPdb Evolutionary classification and codon usage of Repeat Containing Proteins
NRCUD Non redundant Codon Usage Db
CODEHOP Codon Usage Tables
   

DNA ALIGNMENTS

EMBL-ALIGN  
   

ORGANELLES

MitoDat Mendelian Inheritance and the Mitochonddrion db
Plant Organellome The Plant Organelle Database
GOBASE The Organelle Genome Database
MitoRes Mitochondria related-genes
   
GENE COLLECTIONS
SPECIES
GrainGenes  Small grain genes database
Gramene  
   
TYPES OF GENES
MAIZEWALL Maize cell wall database
PlantTFDB Plant Transcription Factor Database
RiceTFDB Rice Transcription Factor Database
RARTF RIKEN Arabidopsis Transcription Factor DB
DATF Arabidopsis Transcription Factor DB
TOBFAC Tobacco Transcription Factor DB
Medicago MADS Medicago Truncatula Transcription Factor db
DPTF Poplar Transcription factors db
   

 

 
















TRANSFAC

Transcription factor database

http://transfac.gbf.de/TRANSFAC/

TFD

Transcription factor database

http://www.tfdg.com/Pages/tfddata.html

TRRD

Transcription regulatory region database

http://www.mgs.bionet.nsc.ru/mgs/dbases/trrd4/

PlantCARE

Plant cis-Acting regulatory elements

http://sphinx.rug.ac.be:8080/PlantCARE/

PLACE

Plant cis-acting regulatory elements

http://www.dna.affrc.go.jp/htdocs/PLACE/

RegulonDB

Database on transcriptional regulation in Escherichia coli

http://www.cifn.unam.mx/Computational_Genomics/regulondb/

SCPD

Promoter database of Saccharomyces cerevisiae

http://cgsigma.cshl.org/jian

EPD


Eukaryotic promoter database


http://www.epd.isb-sib.ch/


 

TATA Signaling, Promoter & Trans-Factor Bind Site Prediction

Gibbs Motif Sampler Homepage (E.C. Rouchka and B. Thompson, Bioinformatics Laboratory of  Wadsworth Center, U.S.A.) - I have linked to the prokaryotic DNA default setting page. On the next page I have presented data the IHF-binding site (consensus: WWWTCAA[N4]TTR).

RSA-tools - Gibbs (A. Neuwald & Jacques van Helden, Service de Conformation des Macromolécules Biologiques et de Bioinformatique, Université Libre de Bruxelles, Belgium) - type in the matrix size desired and deselect "add reverse complement strand."  After running the program once I would delete those sequences from the discovery set which align imperfectly.

 BindGene (C. Lockwood, University of Manchester, United Kingdom) - I have found this site particularly useful.  If your sequence is less than 2kb use the default settings.  If you paste 10kb of data, I suggest changing the "shuffled matrices" to "0."

 TESS - String Search Page (Center for Bioinformatics, University of Pennsylvania, U.S.A.) - this site requires that one enter the motif consensus sequence (Search My Site Strings), and is limited to 2000 nt per search. N.B. This site also permits searching TRANSFAC Strings. Choose "small Javal applets" to view the results of the search.

Create Matrix File (J. Zheng, Queen's University, Canada) - creates a matrix from a DNA Clustal alignment and also presents the consensus:


DNA Motifs Gibbs Sampler - SeSiMCMC - the Sequence Similarities by Markov Chain Monte-Carlo algorithm finds DNA motifs of unknown length and complicated structure in a set of unaligned DNA sequences. It uses an improved motif length estimator and careful Bayesian analysis of the possibility of a site absence in a sequence. Reference: A.V. Favorov et al.. 2005.  Bioinformatics 21: 2240-2245.

PromScan (D.J. Studholme & R. Dixon. 2003. Domain architectures of sigma54-dependent transcriptional activators. J. Bacteriol. 185:1757-67; as modified by S. Richards, Queen's University, Canada). Scans small genomes for potential factor-binding sites including IHF-binding sites. If a *.ptt file is included the results will indicate the position of the promoter relative to the nearest gene.

FindTerm (Softberry Inc.) - currently the only tool on the internet for mapping rho-independent terminators. You might consider using the advanced feature options and minimally increase the default energy threshold to -12.0.

TransTerm (Michael Nuhn, Nano+Bio-Center) - TransTerm searches for rho-independent terminators in the vicinity of annotated genes. This TIGR program can be accessed online in two ways. If you have the genome in GenBank format to use this program since it will only look for terminators in the vicinity of the annotated genes. If the genome has not been annotated use this site. The latter site combines Glimmer and RBSfinder with TransTerm.

Tools to find motif clusters in DNA sequences - one should probably start at ZLAB (Dr. Zhiping Weng, Boston University, U.S.A) which has developed a  wide range of tools to interaction between regulatory proteins and their DNA/RNA target sites including:

 Cluster-Buster
 Comet
 Cister

Find short split motifs in DNA sequences with YMF (Reference: Sinha, S. & Tompa, M. 2002. Nucleic Acids Research
Motif Sampler - tries to find over-represented motifs (cis-acting regulatory elements) in the upstream region of a set of co- regulated genes. This motif finding algorithm uses Gibbs sampling to find the position probability matrix that represents the motif. Be sure to "uncheck" the appropriate box if you don't want the complementary strand included in the analysis. (Reference: G. Thijs et al. 2002. J. Comput. Biol. 9: 447-464.)
Melina - Motif Elucidator in Nucleotide Sequence Assembly (Human Genome Center, University of Tokyo, Japan) - helps one extract a set of common motifs shared by functionally-related DNA sequences. It  utilizes CONSENSUS, GIBBS DNA, MEME and Coresearch  which are considered to be the most progressive motif search algorithms. Each algorithms is supplied with an impressive set of selection parameters. 

BioProspector  Discovering Conserved DNA Motifs in Upstream Regulatory Regions of Co-Expressed Genes (Stanford University AI Lab, U.S.A.) - uses a Gibbs sampling strategy to examine the upstream region of genes in the same gene expression pattern group and looks for regulatory sequence motifs. BioProspector uses Markov background to model the base dependencies of non-motif bases, which greatly improved the specificity of the reported motifs.
BioOptimizer - is an algorithm designed to clean up the motifs found by BioProspector, Consensus, AlignACE & MEME by finding the configuration of motif start sites that maximizes a scoring function (Reference: S.T. Jensen & J.S. Liu 2004 Bioinformatics 20:1557-1564).

 

 

PROMOTERS & TERMINATORS


B. Eukaryotic

Not being a eukaryotic molecular biologist I cannot comment on utility and accuracy of the following promoter- prediction programs. You might want to visit The Eukaryotic Promoter Database and download their organism-specific promoter databases to test the following.  

red_bullet.gif (914 bytes)Neural Network Promoter Prediction (Berkeley Drosophila Genome Project, U.S.A.) (Reference: M.G. Reese 2001. Comput. Chem. 26: 51-6).
red_bullet.gif (914 bytes)Promoter 2.0 Prediction Server (S. Knudsen,Center for Biological Sequence Analysis, Technical University of Denmark) - predicts transcription start sites of vertebrate Pol II promoters in DNA sequences
red_bullet.gif (914 bytes)PROMOSER - Human, Mouse and Rat promoter extraction service (Boston University, U.S.A.) - maps promoter sequences and transcription start sites in mammalian genomes. (Reference: S. Anason et al. 2003. Nucl. Acids. Res. 2003 31: 3554-59).
red_bullet.gif (914 bytes)Dragon Gene Start Finder (Institute for Infocomm Research, Singapore)
red_bullet.gif (914 bytes)Promoter and gene expression regulatory motifs search (Softberry, U.S.A.) - offers a variety of promoter- scanning prrograms  

C. Transcriptional terminators

TransTerm (Michael Nuhn, Nano+Bio-Center) - TransTerm searches for rho-independent terminators in the vicinity of annotated genes. This TIGR program can be accessed online in two ways. If you have the genome in GenBank format to use this program since it will only look for terminators in the vicinity of the annotated genes. If the genome has not been annotated use this site. The latter site combines Glimmer and RBSfinder with TransTerm.

FindTerm (Softberry Inc.) - can also be used for mapping rho-independent terminators. You might consider using the advanced feature options and minimally increase the default energy threshold to -12.0. Please note that the online version of this program will only find one terminator at a time.  If you are dealing with a long sequence, once you have located a terminator, delete it from the DNA sequence and resubmit.

RibEx: Riboswitch Explorer - scans <40kb DNA for potential genes (which are linked to BLASTP) and several hundred regulatory elements, including riboswitches. If you click on the "search for attenuators" it finds terminators and antiterminators. (Reference: C. Abreu-Goodger & E. Merino. 2005. Nucl. Acids Res. 33: W690-W692).


ABS Blanco E.1,2, Farre D.1,2, Alba M.1, Messeguer X.2 and Guigo R.1
Experimentally verified orthologous transcription factor binding sites
database       summary

 

ACTIVITY
Functional DNA/RNA site activity
database       summary

 

AGRIS - Arabidopsis Gene Regulatory Informmation Server Davuluri RV1, Sun H1, Palaniswamy SK1, Matthews N2, Molina C2, Grotewold E2
Arabidopsis promoters, transcription factors and their target genes
database       summary

 

TRANSFAC® Matys, V., Kel-Margoulis, O.V., Fricke, E., Liebich, I., Land, S., Barre-Dirrie, A., Reuter, I., Chekmenev, D., Krull, M., Hornischer, K., Voss, N., Stegmaier, P., Lewicki-Potapov, B., Saxel, H., Kel, A.E., and Wingender, E.
Transcription factors, gene regulation, positional weight matrices
database       summary

 

ASPD Valuev, V.P., Afonnikov D.A., Ponomarenko M.P., Kolchanov N.A.
Artificial Selected Proteins/Peptides Database
database       summary

 

cisRED Robertson A.G.1, Bilenky M.1, Lin K.1, He A.1, Yuen, W.1, Dagpinar M.1, Varhol R.1, Teague K.1, Griffith O.L.1, Zhang X.1, Pan, Y.1, Hassel M.1, Sleumer M.C.1, Pan, W.1, Pleasance E.D.1, Chuang, M.1, Hao H.1, Li Y.Y.1, Robertson N.1, Fjell C.1, Li B.1, Montgomery S.B.1, Astakhova T.1, Zhou J.2, Sander J.2, Siddiqui A.S.1 and Jones S.J.M.1
Phylogenetically conserved regulatory DNA sequence motifs
database       summary

 

 DBD Kummerfeld S.K. and Teichmann S.A.
Transcription factor prediction database
database       summary

 

 

 DBTSS Yamashita R.1, Suzuki, Y.2, Wakakguri, H.2, Nakai, K.1, Sugano, S.2
Database of transcriptional start sites
database       summary

 

 DoOP - Databases of Orthologous Promoters Barta E.1, Sebestyén E.1, Pálfy T.B.1, Tóth G.1, Ortutay C.P.1 and Patthy L.2
Clusters of orthologous upstream sequences from chordates and plants
 

 EPD Cavin Périer, R.1, Praz, V.2, Bucher, P.2
Eukaryotic POL II promoters with experimentally-determined transcription start sites
database       summary

 

 

GeneNet Ananko, E.A., Podkolodny, N.L., Stepanenko, I.L., Podkolodnaya, O.A., Rasskazov, D.A., Miginsky, D.S., Likhoshvai, V.A., Ratushny, A.V., Podkolodnaya, N.N., Kolchanov, N.A.
Database on gene network components
database       summary

 

 

 HTPSELEX
Transcription factor binding site sequences obtained using high-throughput SELEX method
database       summary

 

 JASPAR
Collection of PSSMs for transcription factor DNA-binding sites
database       summary

 

 MAPPER
Putative transcription factor binding sites in various genomes
database       summary

 

 ooTFD Ghosh D.
Transcription factors and gene expression
database       summary

 

 PLACE Higo, K.
Plant cis-acting regulatory elements
database       summary

 

 

PlantCARE Rombauts, S.1, Bonnet, E.1, Thijs, G.3, Marchal, K.3, Moreau, Y.3, Van de Peer, Y.1, Rouze, P.2
Plant cis-acting regulatory elements
database       summary

 

 PlantProm
Plant promoter sequences
database       summary

 

rSNP Guide
Single nucleotide polymorphisms in regulatory gene regions
database       summary

 SELEXdb
Selected DNA/RNA functional site sequences
database       summary

 

 SKY/M-FISH and CGH
Fluorescent images of chromosomes and cytogenetic data
database       summary

 

 SwissRegulon Pachkov M., Erb I., Molina N. and van Nimwegen E.
Genome-wide annotations of regulatory sites in the intergenic regions
database       summary

 

 TESS
Transcription element search system
database       summary

 

 

 TRANSCompel® Kel-Margoulis, O.1, Kel, A.1, Reuter, I.1, Deineko, I.2, Wingender, E.1
Transcriptional regulation, composite regulatory elements
database       summary

 

 TRANSPATH® Krull, M.1, Pistor, S.1, Voss, N.1, Kel, A.1, Reuter, I.1, Potapov, A.1,2, Choi, C.1, Kel-Margoulis, O.1, Wingender, E.1,2
Signal transduction pathways, vizualization and expression data analysis
database       summary

 

 Transterm Jacobs, G.2, Rackham, O.1, Stockwell, P. A.1, Brown, C. M.1
Codon usage, start and stop signals
database       summary

 

 TRED - Transcriptional Regulatory Element Database
Transcriptional regulatory element database
database       summary

 

 TRRD Kolchanov N.A.1,2, Ananko E.A.1,2, Podkolodnaya O.A.1,2, Ignatieva E.V.1,2, Stepanenko I.L.1,2, Khlebodarova T.M.1,2, Merkulova T.I.1,2, Podkolodny N.L.1,2, Grigorovich D.A.1,2, Poplavsky A.S.1,2, Romashchenko A.G.1,2
Transcription regulatory regions of eukaryotic genes
database       summary

 

 TrSDB
Transcription factor database
database       summary

 

PLACE Higo, K.
Plant cis-acting regulatory elements
database       summary

 

PlantCARE Rombauts, S.1, Bonnet, E.1, Thijs, G.3, Marchal, K.3, Moreau, Y.3, Van de Peer, Y.1, Rouze, P.2
Plant cis-acting regulatory elements
database       summary

 




















1
Hosted by www.Geocities.ws

1