Topological data analysis can extract sub-groups with high incidence rates of Type 2 diabetes
by Hyung Sun Kim; Chahngwoo Yi; Yongkang Kim; Uhnmee Park; Woong Kook; Bermseok Oh; Hyuk Kim; Taesung Park
International Journal of Data Mining and Bioinformatics (IJDMB), Vol. 22, No. 1, 2019

Abstract: Type 2 Diabetes (T2D) is now a rapidly increasing, worldwide scourge, and the identification of genetic contributors is vital. However, current analyses of multiple, disease-contributing factors, and their combined interactions, remains quite difficult, using traditional approaches. Topological Data Analysis (TDA) shows what shape a data set can have, facilitating clustering analysis, by determining which components are close to each other. Thus, TDA can generate a network, using Single-Nucleotide Polymorphism (SNP) data, revealing the genetic relatedness of specific individuals, and can derive multiple ordered sub-groups, from one with a low patient concentration, to one with a high patient concentration. Since it is widely accepted that T2D pathogenesis is affected by multiple genetic factors, we performed TDA on T2D data from the Korea Association REsource (KARE) project, a population-based, genome-wide association study of the Korean adult population. Since KARE data contains follow-up information about the incidence of T2D, we compared the T2D status of each individual, at baseline, with that of ten years later. For the TDA network-driven sub-groups, ordered by prevalence, we compared the T2D incidence rate, after ten years, for individuals initially without T2D. As a result, we found that the TDA network-driven, ordered sub-groups had significantly increased incidence rates, linearly correlated with prevalence (p-value = 0.006914). Our results demonstrate the usefulness of TDA in both identifying genetic contributors (e.g., SNPs), and their interrelationships, in the pathology of complex diseases.

Online publication date: Wed, 24-Apr-2019

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Mining and Bioinformatics (IJDMB):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com