Overview

ONETOOL is a tool for family-based big data analyses. It implements the properties of well-known existing family data analysis tools and recently developed methods in a computationally efficient manners, so is suitable for analyzing the vast amount of variant data availble from sequencing family members, providing a rich choice of analysis methods for big data from families.

Overall workflow in ONETOOL is:

_images/onetool_dia_1.png

Input

ONETOOL supports two different sets of input files, PLINK set and VCF set. The PLINK set consists of three files (i.e., .fam, .bed, and .bim) that are used to run PLINK, and the VCF set con-sist of a plink format family file (.fam) and a Variant Call Format (.vcf). The additional phenotypes and covariates are supported through an optional input file (.pheno) for both sets of input files.

ONETOOL also support two different ways to specify the desired analysis options, through a command line and a script file.

The full list of data types and input file formats supported can be found in Input.

Features

The main features in ONETOOL are:

  • InfoQC analysis

    1. Variant information
    2. Sample information
    3. Pedigree information
    4. Mendelian error detection
    5. Relatedness matrix
    6. Pedigree plot
  • Trait analysis

    1. Familial Aggregation
    2. Heritability
    3. Segregation Analysis
  • Linkage analysis

    1. Model-based
    2. Model-free
  • Association analysis

    1. Single variant - common variant
    2. Gene-based - rare variant
    3. Dosage data
  • Epistasis analysis

    1. Multidimensional Dimensionality Reduction (MDR) analysis
    2. Generalized MDR analysis
  • Imputation of missing genotype

  • Data management

    1. Variant filtering
    2. Sample filtering
    3. LD-based prunning

Output

Each method in ONETOOL outputs the result file with the appropriate extension, so that the user can recognize it easily. It has the familiar user interface and the same or similar analysis option names as the existing tools, so no, or only a minimal, learning curve is needed.