Database “Proteomics of malignant cells”
Multimodal database “Proteomics of malignant cells” (http://ef2.inbi.ras.ru) currently contains seventeen interconnected information modules; each includes proteomic and biomedical data on proteins of the corresponding objects of research.
Database “Proteomics of Prostate Cancer”
A multi-level e-database “Proteomics of Prostate Cancer” (http://ef.inbi.ras.ru/) was created based on results of proteomic studies of prostate tissue samples with malignant and benign tumors. The database contains more than 300 protein fractions and is registered in the State Register of databases (registration number 2012620676 from 13.07.2012).
Database “Muscle proteomics”
Multimodal database “Muscle proteomics” (http://mp.inbi.ras.ru/) contains complex modules “myocardial proteins” and “proteins from myoblasts”(totally more than 300 identified muscle proteins).
Database of oligopeptides EROP-Moscow
Database EROP-Moscow (Endogenous Regulatory OligoPeptides) with online access (http://erop.inbi.ras.ru/) contains information about structures and functions of natural oligopeptides.
In order to generate a client-server database, chemical, biological and other characteristics of natural oligopeptides produced via ribosomal template, non-ribosomal template or enzymatic synthesis were compiled. Currently, the database contains more than 9000 structures of natural oligopeptides.
Web-site with the bacterial gene annotations
The site gives a list of the most likely biological functions of the nucleotide sequence under study. The effectiveness of annotations is about 19% higher than that of all existing methods with the same number of false positives (http://genefunction.ru)
Database of periodic DNA regions in major genomes
The database contains information on areas with different periodicity tapes in various genomes. For eukaryotic genomes, these areas on average occupy ~ 8% of the genome (http://victoria.biengi.ac.ru/cgi-bin/indelper/index.cgi)
Web site for searching for hidden periodicity in DNA and amino acid sequences with deletions and insertions
The site allows you to find the hidden periodicity with insertions and deletions, or both amino acid and the nucleotide sequences (http://victoria.biengi.ac.ru/splinter/login.php)
Database of potential reading frame shifts in coding sequences
The database contains information about potential mutations like reading frame shift in a variety of cds from eukaryotic genomes. On average, about 23% cds contains such mutations (http://victoria.biengi.ac.ru/cgi-bin/frameshift/index.cgi)
Web site to search of potential reading frame shifts in cds
The server allows you to find potential mutations of the type of reading frame shift to any cds (http://victoria.biengi.ac.ru/fsfinder/)
Database of Potential promoter sequence in Rice genome
The database is located at: http://victoria.biengi.ac.ru/cgi-bin/dbPPS/index.cgi
The database contains over 150 thousand potential promoter sequences. The creation of a database became possible because a new mathematical method for creating multiple alignments of nucleotide sequences was developed. Potential applications include biotechnology and genetic engineering.
There are no similar databases. This is due to the fact that the identification of promoter sequences by all previously developed mathematical algorithms is impossible. Created by the staff of the group for mathematical analysis of DNA and protein sequences (Head: Dr. E.V. Korotkov).
Rice Genome SINE Repeat Database
The database is located at: http://victoria.biengi.ac.ru/sinerice/
The database contains tens of thousands of new and known SINE repeats from 45 different families. The creation of a database became possible only because a new mathematical method for creating multiple alignments of nucleotide sequences was developed.
Potential applications include biotechnology and genetic engineering.
There are no similar databases for the rice genome. This is due to the fact that the detection of new previously unknown SINE sequences by all previously developed mathematical algorithms is impossible. Created by the staff of the group for mathematical analysis of DNA and protein sequences (Head: Dr. E.V. Korotkov).
Database of dispersed repeats in plant genomes
The database contains dispersed repeats which were found in Arabidopsis thaliana, Capsicum annuum, Daucus carota, Oryza sativa and Zea mays genomes. We applied the IP method based on optimization of position-weight matrices and two-dimensional dynamic programming, which allows one to detect repeats with weak similarity. The method is described in detail in the publications: https://doi.org/10.3390/ijms241310964 and https://doi.org/10.3390/ijms25084441.