Please use this identifier to cite or link to this item:
http://repository.i3l.ac.id/jspui/handle/123456789/913
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lim, Chrysania | - |
dc.date.accessioned | 2023-11-29T09:03:32Z | - |
dc.date.available | 2023-11-29T09:03:32Z | - |
dc.date.issued | 2023-06-12 | - |
dc.identifier.uri | http://repository.i3l.ac.id/jspui/handle/123456789/913 | - |
dc.description.abstract | Genome Wide Association Studies (GWAS) aims to find genomic variants that correlates with the occurrence of certain traits within a population. Various tools and algorithms can be utilized to conduct GWAS, among them are PLINK, Hail, and VariantSpark. In this study, a GWAS analysis pipeline was developed using PLINK, Hail, and VariantSpark to assess their performance in conduction GWAS association analysis. The Late-onset Alzheimer’s (LOAD) GWAS dataset from a previous study by Webster et al. (2009) was used on these softwares. PLINK ran Simple Association, Linear Regression, and Logistic Regression algorithms, while Hail ran Linear regression and Logistic regression. The novel Random Forest algorithm in VariantSpark was ran and compared with the previous algorithms. The softwares was evaluated based on SNP calling accuracy, RAM usage, and processing time. Overall, PLINK’s algorithms performed the best in terms of SNP calling accuracy, RAM usage, and processing time. Hail was comparable with PLINK in terms of RAM and processing time, but PLINK was more accurate in detecting variants. VariantSpark used a lot more RAM and processing time for the same performance as PLINK’s Simple Association algorithm. In conclusion, PLINK and Hail are great tools to conduct GWAS Association Testing. More testing with different datasets is needed for more accurate assessments of these tools. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Indonesia International Institute for Life Sciences | en_US |
dc.relation.ispartofseries | BM 23-001;T202306112 | - |
dc.subject | GWAS | en_US |
dc.subject | Late-onset Alzheimer’s (LOAD) | en_US |
dc.subject | PLINK | en_US |
dc.subject | Hail | en_US |
dc.subject | VariantSpark | en_US |
dc.title | Comparison of GWAS analysis pipelines using Hail/VariantSpark and PLINK on Late-Onset Alzheimer’s Dataset | en_US |
dc.type | Thesis | en_US |
Appears in Collections: | Biomedicine |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Abstract.pdf | Abstract | 141.73 kB | Adobe PDF | View/Open |
BM 23-001_Chrysania Lim.pdf Restricted Access | Full Text | 3.23 MB | Adobe PDF | View/Open Request a copy |
Chapter 1.pdf | Chapter 1 | 150.83 kB | Adobe PDF | View/Open |
Cover.pdf | Cover | 123.22 kB | Adobe PDF | View/Open |
References.pdf | References | 255.55 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.