Model for automatic lexical disambiguation based on a hybrid measure
Submitted: 2025-06-27
|Accepted: 2025-09-21
|Published: 2025-11-20
Copyright (c) 2025 Journal of Computer-Assisted Linguistic Research

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Downloads
Keywords:
lexical ambiguity, word sense disambiguation, natural language processing, computer-based linguistic resources
Supporting agencies:
Agencia Nacional de Investigación y Desarrollo (ANID) del Ministerio de Ciencia, Tecnología, Conocimiento e Innovación del Gobierno de Chile
Ministerio de Ciencia, Innovación y Universidades del Gobierno de España (MICIU/AEI/10.13039/501100011033)
Abstract:
This research presents the development of a more robust model for measuring semantic similarity than those currently available for solving the problem of word sense disambiguation applied to natural language processing. The model is based both linguistically and statistically on the interaction of two approaches to taxonomic exploration: path-based and information content, through the incorporation of FunGramKB as a sense inventory. In terms of evaluation, the proposed similarity measure consistently generated efficient results from a linguistic perspective in the automatic lexical disambiguation process.
References:
Aussenac-Gilles, Nathalie, and Fabien Gandon. 2013. “From the knowledge acquisition bottleneck to the knowledge acquisition overflow: A brief French history of knowledge acquisition.” International Journal of Human-Computer Studies 71 (2): 157-65. https://doi.org/10.1016/j.ijhcs.2012.10.009
Buchanan, Bruce, and David Wilkins. 1993. Readings in Knowledge Acquisition and Learning: Automating the Construction and Improvement of Expert Systems. San Mateo, CA: Morgan Kaufmann.
Croft, William, and David Alan Cruse. 2008. Lingüística cognitiva. Madrid: Akal.
Daelemans, Walter, and Véronique Hoste. 2002. “Evaluation of machine learning methods for natural language processing tasks.” In Proceedings of the Third International Conference on Language Resources and Evaluation, Las Palmas de Gran Canaria. http://www.lrec-conf.org/proceedings/lrec2002/pdf/94.pdf.
Gangemi, Aldo, Nicola Guarino, Claudio Masolo, and Alessandro Oltramari. 2003. “Sweetening WordNet with DOLCE.” AI Magazine 24 (3): 13-24. https://doi.org/10.1609/aimag.v24i3.1715
Han, Jiawei, Micheline Kamber, and Jian Pei. 2012. Data Mining: Concepts and Techniques. 3rd ed. Waltham, MA: Morgan Kaufmann.
Hoste, Véronique, Iris Hendrickx, Walter Daelemans, and Antal van den Bosch. 2002. “Parameter optimization for machine learning of word sense disambiguation.” Natural Language Engineering 8 (4): 311-25.
Leacock, Claudia, and Martin Chodorow. 1998. “Combining local context and WordNet similarity for word sense identification.” In WordNet: An Electronic Lexical Database, edited by Christiane Fellbaum, 265-83. Cambridge, MA: MIT Press.
Lu, Wenpeng, Fanqing Meng, Shoujin Wang, Guoqiang Zhang, Xu Zhang, Antai Ouyang, and Xiaodong Zhang. 2019. “Graph-based Chinese word sense disambiguation with multi-knowledge integration.” Computers, Materials & Continua61 (1): 197-212.
Moreno-Sandoval, Antonio. 1996. Lingüística computacional. Madrid: Síntesis.
Nevzorova, Olga, Alfiya Galieva, and Vladimir Nevzorov. 2015. “Sentence context and resolving lexical ambiguity for special groups of words on the base of corpus data.” Procedia – Social and Behavioral Sciences 198: 359-66.
Núñez-Torres, Fredy. 2021. Diseño y desarrollo de un modelo de desambiguación léxica automática para el procesamiento del lenguaje natural. Tesis doctoral, Pontificia Universidad Católica de Chile.
Núñez Torres, Fredy, and María Beatriz Pérez Cabello de Alba. 2023. “Desarrollo de un sistema de aprendizaje automático supervisado para la desambiguación léxica automática utilizando DAMIEN (Data Mining Encountered).” Revista Electrónica de Lingüística Aplicada 21 (1): 150-78. https://doi.org/10.58859/rael.v21i1.504
Pasini, Tommaso. 2020. “The knowledge acquisition bottleneck problem in multilingual word sense disambiguation.” In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI-20).
Patwardhan, Siddharth, Satanjeev Banerjee, and Ted Pedersen. 2003. “Using measures of semantic relatedness for word sense disambiguation.” In Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics.
Periñán-Pascual, Carlos. 2012. “En defensa del procesamiento del lenguaje natural fundamentado en la lingüística teórica.” Onomázein 26: 13-48.
Periñán-Pascual, Carlos. 2015. “The underpinnings of a composite measure for automatic term extraction: The case of SRC.” Terminology 21 (2): 151-79.
Periñán-Pascual, Carlos, and Francisco Arcas-Túnez. 2007. “Cognitive modules of an NLP knowledge base for language understanding.” Procesamiento del Lenguaje Natural 39: 197-204.
Periñán-Pascual, Carlos, and Francisco Arcas-Túnez. 2010. “The architecture of FunGramKB.” In Proceedings of the Seventh International Conference on Language Resources and Evaluation, 2667-74. Valletta, Malta: European Language Resources Association (ELRA).
Periñán-Pascual, Carlos, and Ricardo Mairal-Usón. 2010. “La gramática de COREL: un lenguaje de representación conceptual.” Onomázein 21: 11-45.
Rada, Roy, Hafedh Mili, and María Blettner. 1989. “Development and application of a metric on semantic nets.” IEEE Transactions on Systems, Man and Cybernetics 19 (1): 17-30.
Seco, Nuno, Tony Veale, and Jer Hayes. 2004. “An intrinsic information content metric for semantic similarity in WordNet.” In Proceedings of the 16th European Conference on Artificial Intelligence (ECAI). https://doi.org/10.5555/3000001.3000272
Van Gompel, Maarten, and Antal van den Bosch. 2013. “WSD2: Parameter optimisation for memory-based cross-lingual word-sense disambiguation.” In Proceedings of the Second Joint Conference on Lexical and Computational Semantics (SEM 2013), 183-87.
Wagner, Christian. 2006. “Breaking the knowledge acquisition bottleneck through conversational knowledge management.” Information Resources Management Journal 19 (1): 70-83
Zhou, Zili, Yanna Wang, and Junzhong Gu. 2008. “New model of semantic similarity measuring in WordNet.” In Proceedings of the Second International Conference on Future Generation Communication and Networking Symposia.https://doi.org/10.1109/FGCNS.2008.4730937


