The role of ontologies in machine learning: a case study of gene ontology

Authors

DOI:

https://doi.org/10.47989/ir30iConf47575

Keywords:

Ontologies, Gene Ontology, Machine Learning, Knowledge organization systems, Knowledgebases

Abstract

Introduction. Ontologies as knowledgebases have been heavily applied in computational biological studies by implementing into ML models for purposes such as disease-gene associations identification.

Method. We conduct a case study using gene ontology (GO) annotation data and three ML models to replicate the prediction of autism spectrum disorder (ASD)-causing genes.

Analysis. Data were collected from GO and Simmons Foundation Autism Research Initiative (SFARI). The semantic similarities between GO annotation terms on gene products were calculated.

Results. The best-performing model can reach an AUC of .85, which means using GO annotation data for ASD disease-gene prediction can receive a significantly accurate result. However, we stress the importance of constructing knowledgebases in adapting to LLMs and the role of LIS professionals in curating community knowledge for interoperability and reuse.

Conclusions. Biomedical ontologies play a crucial role in the discovery of biomedical knowledge. Knowledge organization and computer science domains require more communication and synchronization in the face of emerging AI and ML technologies.

Published

2025-03-11

How to Cite

Liu, Q., & Qin, J. (2025). The role of ontologies in machine learning: a case study of gene ontology. Information Research an International Electronic Journal, 30(iConf), 108–122. https://doi.org/10.47989/ir30iConf47575

Issue

Section

Peer-reviewed papers

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.