Rabbit Genome and Trascriptome Database

Our data contains the most comprehensive rabbit genome and transcriptome data up to now.As we know, the rabbit is a very important species for both biomedical research and agriculture animal breeding.They are not only the most-used experimental animals for the production of antibodies, but also they are widely used for studying a variety of human diseases.The complete genome of rabbit was published in 2012, and many projects have been launched to generate a flood of sequencing data of rabbits until now.

A Database of Differentially Expressed miRNAs in Human Cancers (version 2.0)

dbDEMC (database of Differentially Expressed MiRNAs in human Cancers) is an integrated database that designed to store and display differentially expressed microRNAs (miRNAs) in human cancers detected by high-throughput methods. In this updated version of dbDEMC, a total of 209 newly published data sets were collected from Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA).

A database collected human datasets involved in the transition process from inflammation to cancer. I2CDB collected different kinds of cancer types such as liver cancer, colon cancer, gastric cancer, esophageal cancer and desmoid cancer.

Data collection, data analysis and integration, conbination with clinical medicine are the three main systems support I2C database. This project can not only serve the major research plan, but also serve the whole society by finding the key node and molecular markers of I2C process, realizing the prediction and diagnosis of tumor and providing the guidance to find new tumor drug targets.

Drug Repositioning based on Anti-Fibrosis Characteristic

Dr AFC is a comprehensive, freely accessible platform aimed for Drug Repositioning that is based on anti-fibrosis characteristic and an a virtual Knowledge Base accommodating anti-fibrosis compounds, targets and their interactions.

A database for understanding genomic diversity and genetic ancestry of human populations

PGG.Population database documents 7122 genomes representing 356 global populations from 106 countries, and provides necessary information for researchers and medical doctors to understand genomic diversity and genetic ancestry of human populations. We included miscellaneous functions and a friendly graphical user interface to visualize genomic diversity, population relationship, ancestral makeup, footprints of natural selection, and population history etc.

National Omics Data Encyclopedia

The National Omics Data Encyclopedia (NODE) stores raw sequence data from "next-generation" sequencing technologies including 454, IonTorrent, Illumina, SOLiD, Helicos and Complete Genomics. In addition to raw sequence data, NODE now stores alignment information in the form of read placements on a reference sequence.

eLibrary of Microbial Systematics and Genomics

eLMSG (an eLibrary of Microbial Systematics and Genomics) is a web database, which will integrate microbial systematics, genomics, and phenomics (polyphasic taxonomy related phenotypes). The taxonomic system of eLMSG is composed of all validly and some effectively published taxa (from phylum to genus). For species rank, the current version of eLMSG includes merely the type species of all genera.

Glycosyltransferases Database

GTDB is an integrated repository of glycosyltransferases, which collects comprehensive information, including amino acid sequences, coding region sequences, available tertiary structures, protein classification families, catalytic reactions and metabolic pathways involved, from distinct well-known databases or predictions.

A genetic modifier database for elucidating the genetic basis of human phenotypic variation

The PhenoModifier database is an attempt at collecting information that are scattered throughout the literature to characterize human modifiers, and this in turn will help us elucidating the genetic basis of human phenotypic variation.

A systematic resource for proteomic research on post-translational modifications.

SysPTM provides a systematic and sophisticated platform for proteomic PTM research, equipped not only with a knowledge base of manually curated multi-type modification data, but also with four fully developed, in-depth data mining tools. Currently, SysPTM contains data detailing 471109 experimentally determined PTM sites on 53235 proteins, covering more than 50 modification types, curated from public resources including five databases and four webservers and more than three hundred peer-reviewed mass spectrometry papers.

The Han Chinese Genomes Database

The Han Chinese Genomes Database (PGG.Han) serves as the central repository of the genomic data of the Han Chinese Genomes Project (Phase I). As is its current version, PGG.Han archives genomic data of 114,783 Han Chinese individuals (a.k.a. Han100K), including high coverage WGS data (n = 319), low coverage WGS data (n = 11878), and high-density SNV data with 8,056,973 variants genotyped or partially imputed (n = 102,586).

A database for understanding evolutionary and medical implications of human single nucleotide variations in diverse populations

PGG.SNV is database for understanding evolutionary and medical implications of human single nucleotide variation (SNV) on population levels. It now documents more than 300,000 genomes and 10 billion allele frequencies records for diverse human ethnic groups. PGG.Tools is a web-based toolkit or servers for analyzing huge genomic data. Currently, it is under construction.

Virus Genome Toolkits

To identify and isolate the source of the virus and prevent the virus from spreading further in the population, it is important to understand the source of it. Genomics, epidemiology, clinical symptoms and other data are the basis for studying the source of the virus. For this purpose, ViGTK integrates a variety of data resources, and develops tools that integrate viral genome screening, molecular variation and evolution analysis and visualization, and epitope prediction and screening, that may provide services for COVID-19 research.

CIRCpedia: A comprehensive database containing circRNA annotations

An integrative database, aiming to annotating alternative back-splicing and alternative splicing In circRNAs across different cell lines.

CIRCpedia v2: An updated comprehensive database containing circRNA annotations

An updated circRNA database with integrated information of circRNA annotation, expression, and conservation.

TransCirc RNA Database

Using various direct and indirect evidences that support circRNA translation, we conducted an integrative analysis to predict the potential of all circRNAs in coding for functional peptides. The result of such prediction and the supporting evidences were summarized in the TransCirc database.

BEable-GPS: A guidance for the potential editable pathegonic point mutations by CRISPR base editors (BEs).

A database to profile the accessibility of base editors (BEs) for targeting pathogenic SNVs and provide gRNA designs for pathogenic SNVs and genomic targets.

Knowledge Graph of SARS-CoV-2

KGCoV(Knowledge Graph of SARS-CoV-2) structures and matches COVID-19 epidemiological information and SARS-CoV-2 genomic data with combined curation methods, and integrates variation information generated by bioinformatic tools.