Data check is a module of the function to check whether the input data meet the requirements. This module mainly includes:

  • Data conversion
    All uploaded data will be converted to Plink binary format
  • Calculate the number of loci and samples
    The number of loci and that of the sample size in the data uploaded are calculated, and some simple summary statistics are provided.
  • Basic data quality control
    This module will:
    a. Extract balletic SNPs;
    b. Check chromosome codes (1-22,X[23), Y[24), XY[25), MT[26]);
    c. Transform reference genome to GRCh37;
    d. Remove site duplication;
    e. Update rsIDs.

Users can view the processed files in the data summary.