Abstract
Background: Alzheimer's disease (AD) is a recognized complex and severe neurodegenerative disorder, presenting a significant challenge to global health. Its hallmark pathological features include the deposition of β-amyloid plaques and the formation of neurofibrillary tangles. Given this context, it becomes imperative to develop an early and accurate biomarker model for AD diagnosis, employing machine learning and bioinformatics analysis.
Methods: In this study, single-cell data analysis was employed to identify cellular subtypes that exhibited significant differences between the diseased and control groups. Following the identification of NK cells, hdWGCNA analysis and cellular communication analysis were conducted to pinpoint NK cell subset with the most robust communication effects. Subsequently, three machine learning algorithms-LASSO, Random Forest, and SVM-RFE-were employed to jointly screen for NK cell subset modular genes highly associated with AD. A logistic regression diagnostic model was then designed based on these characterized genes. Additionally, a protein-protein interaction (PPI) networks of model genes was established. Furthermore, unsupervised cluster analysis was conducted to classify AD subtypes based on the model genes, followed by the analysis of immune infiltration in the different subtypes. Finally, Spearman correlation coefficient analysis was utilized to explore the correlation between model genes and immune cells, as well as inflammatory factors.
Results: We have successfully identified three genes (RPLP2, RPSA, and RPL18A) that exhibit a high association with AD. The nomogram based on these genes provides practical assistance in diagnosing and predicting patients' outcomes. The interconnected genes screened through PPI are intricately linked to ribosome metabolism and the COVID-19 pathway. Utilizing the expression of modular genes, unsupervised cluster analysis unveiled three distinct AD subtypes. Particularly noteworthy is subtype C3, characterized by high expression, which correlates with immune cell infiltration and elevated levels of inflammatory factors. Hence, it can be inferred that the establishment of an immune environment in AD patients is closely intertwined with the heightened expression of model genes.
Conclusion: This study has not only established a valuable diagnostic model for AD patients but has also delved deeply into the pivotal role of model genes in shaping the immune environment of individuals with AD. These findings offer crucial insights into early AD diagnosis and patient management strategies.
Keywords: Alzheimer’s disease, NK cell, machine learning, diagnostic signature, hdWGCNA, cellchat, single-cell RNA-seq, immune cell subtype distribution pattern.
[http://dx.doi.org/10.1007/978-94-007-5416-4_14] [PMID: 23225010]
[http://dx.doi.org/10.1002/alz.13016] [PMID: 36918389]
[http://dx.doi.org/10.1007/s13311-021-01146-y] [PMID: 34729690]
[http://dx.doi.org/10.1007/s11910-019-0917-z] [PMID: 30627880]
[http://dx.doi.org/10.1038/nature02621] [PMID: 15295589]
[http://dx.doi.org/10.3390/cells9030671] [PMID: 32164335]
[http://dx.doi.org/10.3389/fimmu.2017.01974] [PMID: 29375582]
[http://dx.doi.org/10.3390/cells11061017] [PMID: 35326467]
[http://dx.doi.org/10.3389/fimmu.2021.768966] [PMID: 34804058]
[http://dx.doi.org/10.4049/jimmunol.2000037] [PMID: 32503894]
[http://dx.doi.org/10.1038/nm.3913] [PMID: 26214837]
[http://dx.doi.org/10.1007/s12035-020-01945-y] [PMID: 32462551]
[http://dx.doi.org/10.2174/0115672050280894231214063023] [PMID: 38141185]
[http://dx.doi.org/10.1139/gen-2020-0131] [PMID: 33091314]
[http://dx.doi.org/10.1007/978-1-4939-7704-8_25] [PMID: 29512086]
[http://dx.doi.org/10.3389/fgene.2022.1010361] [PMID: 36338988]
[http://dx.doi.org/10.1042/ETLS20210249] [PMID: 34881778]
[http://dx.doi.org/10.15252/msb.20145304] [PMID: 25080494]
[http://dx.doi.org/10.1093/gerona/glp045] [PMID: 19366883]
[http://dx.doi.org/10.3389/fimmu.2021.645666] [PMID: 34447367]
[http://dx.doi.org/10.1016/j.cell.2019.05.031]
[http://dx.doi.org/10.1016/j.cell.2015.05.002] [PMID: 26000488]
[PMID: 30531897]
[http://dx.doi.org/10.1038/s41590-018-0276-y] [PMID: 30643263]
[http://dx.doi.org/10.1016/j.crmeth.2023.100498] [PMID: 37426759]
[http://dx.doi.org/10.3390/ijms24108819] [PMID: 37240164]
[http://dx.doi.org/10.1016/j.celrep.2022.111155] [PMID: 35926463]
[http://dx.doi.org/10.1038/s41467-021-21246-9] [PMID: 33597522]
[http://dx.doi.org/10.3389/fcell.2022.919731] [PMID: 35938159]
[http://dx.doi.org/10.1093/nar/gkv007] [PMID: 25605792]
[http://dx.doi.org/10.1093/nar/gky311] [PMID: 29912392]
[PMID: 25428369]
[http://dx.doi.org/10.1093/nar/gkw1092] [PMID: 27899662]
[http://dx.doi.org/10.1093/bioinformatics/btx795] [PMID: 29236969]
[http://dx.doi.org/10.1089/omi.2011.0118] [PMID: 22455463]
[http://dx.doi.org/10.1186/s13148-019-0730-1] [PMID: 31443682]
[http://dx.doi.org/10.1016/j.spl.2010.02.020] [PMID: 20582150]
[http://dx.doi.org/10.1155/2014/795624] [PMID: 25295306]
[http://dx.doi.org/10.3389/fimmu.2023.1181467] [PMID: 37475857]
[http://dx.doi.org/10.1186/1471-2105-12-77] [PMID: 21414208]
[http://dx.doi.org/10.1186/1471-2105-14-7] [PMID: 23323831]
[http://dx.doi.org/10.1111/ene.13439] [PMID: 28872215]
[http://dx.doi.org/10.1016/j.bioorg.2018.12.017] [PMID: 30605887]
[http://dx.doi.org/10.1016/j.ejmech.2018.11.049] [PMID: 30503937]
[http://dx.doi.org/10.1007/s00401-006-0127-z] [PMID: 16906426]
[http://dx.doi.org/10.1016/j.jalz.2011.10.007] [PMID: 22265587]
[http://dx.doi.org/10.1016/j.jalz.2016.03.001] [PMID: 27570871]
[http://dx.doi.org/10.3233/JAD-170786] [PMID: 29660933]
[http://dx.doi.org/10.7249/RR2272]
[http://dx.doi.org/10.1016/S1474-4422(20)30440-3] [PMID: 33609479]
[http://dx.doi.org/10.3389/fnagi.2022.919614] [PMID: 35966794]
[http://dx.doi.org/10.1016/S1474-4422(15)70016-5] [PMID: 25792098]
[http://dx.doi.org/10.1111/j.1749-6632.2000.tb05399.x] [PMID: 11268360]
[http://dx.doi.org/10.3389/fgene.2021.658323]
[http://dx.doi.org/10.1038/s41416-019-0382-0] [PMID: 30739912]
[http://dx.doi.org/10.1177/0271678X221111602] [PMID: 35766008]
[http://dx.doi.org/10.1038/srep05556] [PMID: 24990253]
[http://dx.doi.org/10.1038/srep02699] [PMID: 24048412]
[http://dx.doi.org/10.1093/gerona/gly228] [PMID: 30285098]
[http://dx.doi.org/10.1016/j.celrep.2020.03.012] [PMID: 32234477]
[http://dx.doi.org/10.1038/s41586-021-04295-4] [PMID: 35046576]
[http://dx.doi.org/10.3389/fnagi.2018.00192] [PMID: 29988480]
[http://dx.doi.org/10.1186/s12974-022-02679-5] [PMID: 36578067]
[http://dx.doi.org/10.1176/appi.ajp.2007.07121868] [PMID: 18178751]
[http://dx.doi.org/10.3233/JAD-200581] [PMID: 32538857]
[http://dx.doi.org/10.1186/s40035-021-00237-2] [PMID: 33941272]
[http://dx.doi.org/10.1001/jamaneurol.2020.1127] [PMID: 32275288]