Please use this identifier to cite or link to this item: http://repository.i3l.ac.id/jspui/handle/123456789/913
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLim, Chrysania-
dc.date.accessioned2023-11-29T09:03:32Z-
dc.date.available2023-11-29T09:03:32Z-
dc.date.issued2023-06-12-
dc.identifier.urihttp://repository.i3l.ac.id/jspui/handle/123456789/913-
dc.description.abstractGenome Wide Association Studies (GWAS) aims to find genomic variants that correlates with the occurrence of certain traits within a population. Various tools and algorithms can be utilized to conduct GWAS, among them are PLINK, Hail, and VariantSpark. In this study, a GWAS analysis pipeline was developed using PLINK, Hail, and VariantSpark to assess their performance in conduction GWAS association analysis. The Late-onset Alzheimer’s (LOAD) GWAS dataset from a previous study by Webster et al. (2009) was used on these softwares. PLINK ran Simple Association, Linear Regression, and Logistic Regression algorithms, while Hail ran Linear regression and Logistic regression. The novel Random Forest algorithm in VariantSpark was ran and compared with the previous algorithms. The softwares was evaluated based on SNP calling accuracy, RAM usage, and processing time. Overall, PLINK’s algorithms performed the best in terms of SNP calling accuracy, RAM usage, and processing time. Hail was comparable with PLINK in terms of RAM and processing time, but PLINK was more accurate in detecting variants. VariantSpark used a lot more RAM and processing time for the same performance as PLINK’s Simple Association algorithm. In conclusion, PLINK and Hail are great tools to conduct GWAS Association Testing. More testing with different datasets is needed for more accurate assessments of these tools.en_US
dc.language.isoenen_US
dc.publisherIndonesia International Institute for Life Sciencesen_US
dc.relation.ispartofseriesBM 23-001;T202306112-
dc.subjectGWASen_US
dc.subjectLate-onset Alzheimer’s (LOAD)en_US
dc.subjectPLINKen_US
dc.subjectHailen_US
dc.subjectVariantSparken_US
dc.titleComparison of GWAS analysis pipelines using Hail/VariantSpark and PLINK on Late-Onset Alzheimer’s Dataseten_US
dc.typeThesisen_US
Appears in Collections:Biomedicine

Files in This Item:
File Description SizeFormat 
Abstract.pdfAbstract141.73 kBAdobe PDFView/Open
BM 23-001_Chrysania Lim.pdf
  Restricted Access
Full Text3.23 MBAdobe PDFView/Open Request a copy
Chapter 1.pdfChapter 1150.83 kBAdobe PDFView/Open
Cover.pdfCover123.22 kBAdobe PDFView/Open
References.pdfReferences255.55 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.