Revolutionizing knowledge organization with machine learning for metadata, classification, and retrieval in libraries
DOI:
https://doi.org/10.29352/mill0223e.37496Keywords:
classification; information retrieval; knowledge organization; machine learning; metadataAbstract
Introduction: This paper is going to explore how machine learning transformations will take library modernization a step further and help in knowledge organisation methods for better information retrieval processes.
Objective: To discuss how to make use of techniques in ML to develop the required library functions like metadata creation, classification systems, and information retrieval in an efficient and accurate manner.
Methods: The techniques used are based on a thorough knowledge of different machine learning algorithms, such as NLP for metadata generation and neural networks for classification.
Results: The case studies indeed prove that these techniques have been applied to and implemented in real library systems, as revealed by major libraries that have begun to implement ML technologies. These deployments lead to more efficiency, accuracy, and user experience. There are also issues highlighted in this research that could be problematic: biased training data, and transparency in algorithmic decisions.
Conclusion: There is a lot of promise in the future of libraries with the use of ML, and integration should be guided with a judicious balance, taking care of ethical issues.
Downloads
References
Ahmed, M., Mukhopadhyay, M., & Mukhopadhyay, P. (2023). Automated knowledge organization AI ML based subject indexing system for libraries. DESIDOC Journal of Library & Information Technology, 43(01), 45–54. https://doi.org/10.14429/djlit.43.01.18619
Audeh, B., Beigbeder, M., Largeron, C., & Ramírez-Cifuentes, D. (2020). Evaluating the usefulness of citation graph and document metadata in scientific document recommendation for neophytes. Proceedings of the 35th Annual ACM Symposium on Applied Computing, 681–689. https://doi.org/10.1145/3341105.3373886
Audeh, B., Beigbeder, M., Largeron, C., & Ramírez-Cifuentes, D. (2021). Improving exploratory information retrieval for neophytes. ACM SIGAPP Applied Computing Review, 20(4), 50–64. https://doi.org/10.1145/3447332.3447336
Baker, J. (2015). The British Library machine learning experiment. British Library Digital. https://shre.ink/3YZK
Das, R. K., & Islam, M. S. U. (2021). Application of artificial intelligence and machine learning in libraries: A systematic review. arXiv. https://arxiv.org/abs/2112.04573
Deolekar, R., & Dangare, A. (2018). Enterprise search: A new dimension in information retrieval. 2018 3rd International Conference for Convergence in Technology (I2CT), 1–6. https://doi.org/10.1109/I2CT.2018.8529602
Guerra-García, C., Pérez-González, H. G., Martínez-Pérez, F., Juárez-Ramírez, R., & Jiménez, S. (2023). Applying mechanisms of data profiling for assuring data quality in the software: A first approach. 2023 11th International Conference in Software Engineering Research and Innovation (CONISOFT), 108–115. https://doi.org/10.1109/CONISOFT58849.2023.00023
Guo, Q., Jin, S., Li, M., Yang, Q., Xu, K., Ju, Y., Zhang, J., Xuan, J., Liu, J., Su, Y., Xu, Q., & Liu, Y. (2020). Application of deep learning in ecological resource research: Theories, methods, and challenges. Science China Earth Sciences, 63(10), 1457–1474. https://doi.org/10.1007/s11430-019-9584-9
H. R, S. Rashmi, B. S, Prof. Anisha, & Kumar. P, Dr. R. (2019). Smart document analysis using AI-ML. International Journal of Innovative Research in Computer Science & Technology, 7(3), 54–70. https://doi.org/10.21276/ijircst.2019.7.3.6
Hjørland, B. (2021). Information retrieval and knowledge organization: A perspective from the philosophy of science. Information, 12(3), 135. https://doi.org/10.3390/info12030135
Kalisdha, A. (2024). The impact of artificial intelligence and machine learning in library and information science. International Journal of Research in Library Science, 10(1), 39–58. https://doi.org/10.26761/ijrls.10.1.2024.1733
Limbu, D. K., Connor, A., Pears, R., & MacDonell, S. (2006). Contextual relevance feedback in web information retrieval. Proceedings of the 1st International Conference on Information Interaction in Context - IIiX, 138. https://doi.org/10.1145/1164820.1164848
Miah, Md. S. U., Sulaiman, J., Sarwar, T. Bin, Naseer, A., Ashraf, F., Zamli, K. Z., & Jose, R. (2022). Sentence boundary extraction from scientific literature of electric double layer capacitor domain: Tools and techniques. Applied Sciences, 12(3), 1352. https://doi.org/10.3390/app12031352
Park, J., & Lu, C. (2009). Application of semi-automatic metadata generation in libraries: Types, tools, and techniques. Library & Information Science Research, 31(4), 225–231. https://doi.org/10.1016/j.lisr.2009.05.002
Pessin, V. Z., Santos, C. A. S., Yamane, L. H., Siman, R. R., Baldam, R. de L., & Júnior, V. L. (2023). A method of mapping process for scientific production using the Smart Bibliometrics. MethodsX, 11, 102367. https://doi.org/10.1016/j.mex.2023.102367
Qin, J. (2020). Knowledge organization and representation under the AI Lens. Journal of Data and Information Science, 5(1), 3–17. https://doi.org/10.2478/jdis-2020-0002
Rashid, F., Gargaare, S. M. A., Aden, A. H., & Abdi, A. (2022). Machine learning algorithms for document classification: Comparative analysis. International Journal of Advanced Computer Science and Applications, 13(4). https://doi.org/10.14569/IJACSA.2022.0130430
Rosa, J. D. L. (2024). Artificial Intelligence at the National Library of Norway. LUSTRE. https://lustre-network.net/artificial-intelligence-at-the-national-library-of-norway/
Soni, D. N. (2023). The role of artificial intelligence in information and library science: Opportunities and ethical considerations. International Journal of Research in Humanities & Soc. Sciences, 11(7), 41–43.
Weinryb-Grohsgal, L. (2020, July 22). Machine learning + libraries: A report on the state of the field. The Signal. The Library of Congress. https://shre.ink/3YF3
Wu, M., Brandhorst, H., Marinescu, M.-C., Lopez, J. M., Hlava, M., & Busch, J. (2023). Automated metadata annotation: What is and is not possible with machine learning. Data Intelligence, 5(1), 122–138. https://doi.org/10.1162/dint_a_00162
Xie, J. (2023). Research on information retrieval service innovation of university library. SHS Web of Conferences, 169, 01088. https://doi.org/10.1051/shsconf/202316901088
Yu, K., Gong, R., Sun, L., & Jiang, C. (2019). The application of artificial intelligence in smart library. Proceedings of the 2019 International Conference on Organizational Innovation (ICOI 2019). https://doi.org/10.2991/icoi-19.2019.124
Zamani, H., Diaz, F., Dehghani, M., Metzler, D., & Bendersky, M. (2022). Retrieval-enhanced machine learning. Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2875–2886. https://doi.org/10.1145/3477495.3531722
Zhou, Z., Liu, Y., Yu, H., & Ren, L. (2020). The influence of machine learning-based knowledge management model on enterprise organizational capability innovation and industrial development. PLOS ONE, 15(12), e0242253. https://doi.org/10.1371/journal.pone.0242253
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Millenium - Journal of Education, Technologies, and Health

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who submit proposals for this journal agree to the following terms:
a) Articles are published under the Licença Creative Commons (CC BY 4.0), in full open-access, without any cost or fees of any kind to the author or the reader;
b) The authors retain copyright and grant the journal right of first publication, allowing the free sharing of work, provided it is correctly attributed the authorship and initial publication in this journal;
c) The authors are permitted to take on additional contracts separately for non-exclusive distribution of the version of the work published in this journal (eg, post it to an institutional repository or as a book), with an acknowledgment of its initial publication in this journal;
d) Authors are permitted and encouraged to publish and distribute their work online (eg, in institutional repositories or on their website) as it can lead to productive exchanges, as well as increase the impact and citation of published work
Documents required for submission
Article template (Editable format)

