Summary:

Data check is a module of the function to check whether the input data meet the requirements.

  • Data conversion
    Plink binary format and VCF format can be uploaded. All uploaded VCF format data will be converted to Plink binary format.
  • Calculate the number of variants and samples
    The number of variants and sample size in the data uploads are calculated, and some simple summary statistics are provided.
  • Basic data quality control
    This module will:
    a. Extract biallelic SNVs;
    b. Check chromosome codes (1-22,X[23), Y[24), XY[25), MT[26]);
    c. Remove duplicate variants;

Users can view the processed files in the data summary.