Our data contains the most comprehensive rabbit genome and transcriptome data up to now.As we know, the rabbit is a very important species for both biomedical research and agriculture animal breeding.They are not only the most-used experimental animals for the production of antibodies, but also they are widely used for studying a variety of human diseases.The complete genome of rabbit was published in 2012, and many projects have been launched to generate a flood of sequencing data of rabbits until now.
dbDEMC (database of Differentially Expressed MiRNAs in human Cancers) is an integrated database that designed to store and display differentially expressed microRNAs (miRNAs) in human cancers detected by high-throughput methods. In this updated version of dbDEMC, a total of 209 newly published data sets were collected from Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA).
A database collected human datasets involved in the transition process from inflammation to cancer. I2CDB collected different kinds of cancer types such as liver cancer, colon cancer, gastric cancer, esophageal cancer and desmoid cancer.
Data collection, data analysis and integration, conbination with clinical medicine are the three main systems support I2C database. This project can not only serve the major research plan, but also serve the whole society by finding the key node and molecular markers of I2C process, realizing the prediction and diagnosis of tumor and providing the guidance to find new tumor drug targets.
PGG.Population database documents 7122 genomes representing 356 global populations from 106 countries, and provides necessary information for researchers and medical doctors to understand genomic diversity and genetic ancestry of human populations. We included miscellaneous functions and a friendly graphical user interface to visualize genomic diversity, population relationship, ancestral makeup, footprints of natural selection, and population history etc.
The National Omics Data Encyclopedia (NODE) stores raw sequence data from "next-generation" sequencing technologies including 454, IonTorrent, Illumina, SOLiD, Helicos and Complete Genomics. In addition to raw sequence data, NODE now stores alignment information in the form of read placements on a reference sequence.
eLMSG (an eLibrary of Microbial Systematics and Genomics) is a web database, which will integrate microbial systematics, genomics, and phenomics (polyphasic taxonomy related phenotypes). The taxonomic system of eLMSG is composed of all validly and some effectively published taxa (from phylum to genus). For species rank, the current version of eLMSG includes merely the type species of all genera.
GTDB is an integrated repository of glycosyltransferases, which collects comprehensive information, including amino acid sequences, coding region sequences, available tertiary structures, protein classification families, catalytic reactions and metabolic pathways involved, from distinct well-known databases or predictions.
The PhenoModifier database is an attempt at collecting information that are scattered throughout the literature to characterize human modifiers, and this in turn will help us elucidating the genetic basis of human phenotypic variation.
SysPTM provides a systematic and sophisticated platform for proteomic PTM research, equipped not only with a knowledge base of manually curated multi-type modification data, but also with four fully developed, in-depth data mining tools. Currently, SysPTM contains data detailing 471109 experimentally determined PTM sites on 53235 proteins, covering more than 50 modification types, curated from public resources including five databases and four webservers and more than three hundred peer-reviewed mass spectrometry papers.
The Han Chinese Genomes Database (PGG.Han) serves as the central repository of the genomic data of the Han Chinese Genomes Project (Phase I). As is its current version, PGG.Han archives genomic data of 114,783 Han Chinese individuals (a.k.a. Han100K), including high coverage WGS data (n = 319), low coverage WGS data (n = 11878), and high-density SNV data with 8,056,973 variants genotyped or partially imputed (n = 102,586).
A database for understanding evolutionary and medical implications of human single nucleotide variations in diverse populations
PGG.SNV is database for understanding evolutionary and medical implications of human single nucleotide variation (SNV) on population levels. It now documents more than 300,000 genomes and 10 billion allele frequencies records for diverse human ethnic groups. PGG.Tools is a web-based toolkit or servers for analyzing huge genomic data. Currently, it is under construction.
To identify and isolate the source of the virus and prevent the virus from spreading further in the population, it is important to understand the source of it. Genomics, epidemiology, clinical symptoms and other data are the basis for studying the source of the virus. For this purpose, ViGTK integrates a variety of data resources, and develops tools that integrate viral genome screening, molecular variation and evolution analysis and visualization, and epitope prediction and screening, that may provide services for COVID-19 research.
An integrative database, aiming to annotating alternative back-splicing and alternative splicing In circRNAs across different cell lines.
An updated circRNA database with integrated information of circRNA annotation, expression, and conservation.
BEable-GPS: A guidance for the potential editable pathegonic point mutations by CRISPR base editors (BEs).
A database to profile the accessibility of base editors (BEs) for targeting pathogenic SNVs and provide gRNA designs for pathogenic SNVs and genomic targets.