Revolutionizing knowledge organization with machine learning for metadata, classification, and retrieval in libraries

Authors

DOI:

https://doi.org/10.29352/mill0223e.37496

Keywords:

classification; information retrieval; knowledge organization; machine learning; metadata

Abstract

Introduction: This paper is going to explore how machine learning transformations will take library modernization a step further and help in knowledge organisation methods for better information retrieval processes.

Objective: To discuss how to make use of techniques in ML to develop the required library functions like metadata creation, classification systems, and information retrieval in an efficient and accurate manner.

Methods: The techniques used are based on a thorough knowledge of different machine learning algorithms, such as NLP for metadata generation and neural networks for classification.

Results: The case studies indeed prove that these techniques have been applied to and implemented in real library systems, as revealed by major libraries that have begun to implement ML technologies. These deployments lead to more efficiency, accuracy, and user experience. There are also issues highlighted in this research that could be problematic: biased training data, and transparency in algorithmic decisions.

Conclusion: There is a lot of promise in the future of libraries with the use of ML, and integration should be guided with a judicious balance, taking care of ethical issues.

Downloads

Download data is not yet available.

References

Ahmed, M., Mukhopadhyay, M., & Mukhopadhyay, P. (2023). Automated knowledge organization AI ML based subject indexing system for libraries. DESIDOC Journal of Library & Information Technology, 43(01), 45–54. https://doi.org/10.14429/djlit.43.01.18619

Audeh, B., Beigbeder, M., Largeron, C., & Ramírez-Cifuentes, D. (2020). Evaluating the usefulness of citation graph and document metadata in scientific document recommendation for neophytes. Proceedings of the 35th Annual ACM Symposium on Applied Computing, 681–689. https://doi.org/10.1145/3341105.3373886

Audeh, B., Beigbeder, M., Largeron, C., & Ramírez-Cifuentes, D. (2021). Improving exploratory information retrieval for neophytes. ACM SIGAPP Applied Computing Review, 20(4), 50–64. https://doi.org/10.1145/3447332.3447336

Baker, J. (2015). The British Library machine learning experiment. British Library Digital. https://shre.ink/3YZK

Das, R. K., & Islam, M. S. U. (2021). Application of artificial intelligence and machine learning in libraries: A systematic review. arXiv. https://arxiv.org/abs/2112.04573

Deolekar, R., & Dangare, A. (2018). Enterprise search: A new dimension in information retrieval. 2018 3rd International Conference for Convergence in Technology (I2CT), 1–6. https://doi.org/10.1109/I2CT.2018.8529602

Guerra-García, C., Pérez-González, H. G., Martínez-Pérez, F., Juárez-Ramírez, R., & Jiménez, S. (2023). Applying mechanisms of data profiling for assuring data quality in the software: A first approach. 2023 11th International Conference in Software Engineering Research and Innovation (CONISOFT), 108–115. https://doi.org/10.1109/CONISOFT58849.2023.00023

Guo, Q., Jin, S., Li, M., Yang, Q., Xu, K., Ju, Y., Zhang, J., Xuan, J., Liu, J., Su, Y., Xu, Q., & Liu, Y. (2020). Application of deep learning in ecological resource research: Theories, methods, and challenges. Science China Earth Sciences, 63(10), 1457–1474. https://doi.org/10.1007/s11430-019-9584-9

H. R, S. Rashmi, B. S, Prof. Anisha, & Kumar. P, Dr. R. (2019). Smart document analysis using AI-ML. International Journal of Innovative Research in Computer Science & Technology, 7(3), 54–70. https://doi.org/10.21276/ijircst.2019.7.3.6

Hjørland, B. (2021). Information retrieval and knowledge organization: A perspective from the philosophy of science. Information, 12(3), 135. https://doi.org/10.3390/info12030135

Kalisdha, A. (2024). The impact of artificial intelligence and machine learning in library and information science. International Journal of Research in Library Science, 10(1), 39–58. https://doi.org/10.26761/ijrls.10.1.2024.1733

Limbu, D. K., Connor, A., Pears, R., & MacDonell, S. (2006). Contextual relevance feedback in web information retrieval. Proceedings of the 1st International Conference on Information Interaction in Context - IIiX, 138. https://doi.org/10.1145/1164820.1164848

Miah, Md. S. U., Sulaiman, J., Sarwar, T. Bin, Naseer, A., Ashraf, F., Zamli, K. Z., & Jose, R. (2022). Sentence boundary extraction from scientific literature of electric double layer capacitor domain: Tools and techniques. Applied Sciences, 12(3), 1352. https://doi.org/10.3390/app12031352

Park, J., & Lu, C. (2009). Application of semi-automatic metadata generation in libraries: Types, tools, and techniques. Library & Information Science Research, 31(4), 225–231. https://doi.org/10.1016/j.lisr.2009.05.002

Pessin, V. Z., Santos, C. A. S., Yamane, L. H., Siman, R. R., Baldam, R. de L., & Júnior, V. L. (2023). A method of mapping process for scientific production using the Smart Bibliometrics. MethodsX, 11, 102367. https://doi.org/10.1016/j.mex.2023.102367

Qin, J. (2020). Knowledge organization and representation under the AI Lens. Journal of Data and Information Science, 5(1), 3–17. https://doi.org/10.2478/jdis-2020-0002

Rashid, F., Gargaare, S. M. A., Aden, A. H., & Abdi, A. (2022). Machine learning algorithms for document classification: Comparative analysis. International Journal of Advanced Computer Science and Applications, 13(4). https://doi.org/10.14569/IJACSA.2022.0130430

Rosa, J. D. L. (2024). Artificial Intelligence at the National Library of Norway. LUSTRE. https://lustre-network.net/artificial-intelligence-at-the-national-library-of-norway/

Soni, D. N. (2023). The role of artificial intelligence in information and library science: Opportunities and ethical considerations. International Journal of Research in Humanities & Soc. Sciences, 11(7), 41–43.

Weinryb-Grohsgal, L. (2020, July 22). Machine learning + libraries: A report on the state of the field. The Signal. The Library of Congress. https://shre.ink/3YF3

Wu, M., Brandhorst, H., Marinescu, M.-C., Lopez, J. M., Hlava, M., & Busch, J. (2023). Automated metadata annotation: What is and is not possible with machine learning. Data Intelligence, 5(1), 122–138. https://doi.org/10.1162/dint_a_00162

Xie, J. (2023). Research on information retrieval service innovation of university library. SHS Web of Conferences, 169, 01088. https://doi.org/10.1051/shsconf/202316901088

Yu, K., Gong, R., Sun, L., & Jiang, C. (2019). The application of artificial intelligence in smart library. Proceedings of the 2019 International Conference on Organizational Innovation (ICOI 2019). https://doi.org/10.2991/icoi-19.2019.124

Zamani, H., Diaz, F., Dehghani, M., Metzler, D., & Bendersky, M. (2022). Retrieval-enhanced machine learning. Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2875–2886. https://doi.org/10.1145/3477495.3531722

Zhou, Z., Liu, Y., Yu, H., & Ren, L. (2020). The influence of machine learning-based knowledge management model on enterprise organizational capability innovation and industrial development. PLOS ONE, 15(12), e0242253. https://doi.org/10.1371/journal.pone.0242253

Downloads

Published

2026-06-23

How to Cite

Jain, R., Palaniappan, D., Parmar, K., & Premavathi, T. (2026). Revolutionizing knowledge organization with machine learning for metadata, classification, and retrieval in libraries. Millenium - Journal of Education, Technologies, and Health, 2(23e), e37496. https://doi.org/10.29352/mill0223e.37496

Issue

Section

Engineering, Technology, Management and Tourism