
Inma Hernáez-Rioja, Jose A. Gonzalez-Lopez, Heidi Christensen 

Special Issue on Applications of Speech and Language Technologies in Healthcare (2023)

Inge Salomons; Eder del Blanco; Eva Navas; Inma Hernáez; Xabier de Zuazo 

Frame-Based Phone Classification Using EMG Signals (2023)

Salomons, I., del Blanco, E., Navas, E., Hernáez, I. 

Spanish Phone Confusion Analysis for EMG-Based Silent Speech Interfaces (2023)

Ander Cejudo, Arantza Casillas, Alicia Pérez, Maite Oronoz, Daniel Cobos

Cause of Death estimation from Verbal Autopsies: Is the Open Response redundant or synergistic? (2023)

Artificial Intelligence In Medicine

Farwell, A., & Mees, L. (2023). A Taste of Spain: Images of Rioja Wine in Britain and America (1890-1960). Global Food History, 1–26.

Xabier Larrayoz, Nuria Lebeña, Arantza Casillas, Alicia Pérez

Representation exploration and Deep learning applied to the early detection of pathological gambling risks (2023)

Accepted. Experimental IR Meets Multilinguality, Multimodality, and Interaction: 14th International Conference of the CLEF Association, CLEF 2023, Springer International Publishing, Thessaloniki, Greece.

Iakes Goenaga, Edgar Andrés, Koldo Gojenola, Aitziber Atutxa

Advances in Monolingual and Crosslingual Automatic Disability Annotation in Spanish (2023)

BMC Bioinformatics volume 24, Article number: 265

Xabier Larrayoz, Nuria Lebeña, Arantza Casillas, Alicia Pérez

Eating Disorders Detection by means of Deep Learning (2023)

Accepted. MentalRiskES at IberLEF 2023: Early Detection of Mental Disorders Risk in Spanish

Gordo, I, Ibarra, I., Iruskieta, M. eta Limpo, T.

Azkar idatzi eta ortografia onarekin! (2023)


Ibarra, I. Martínez-Arbelaiz, A. Arriola, J.M., Iruskieta, M

El proceso de escritura a mano en 2º de primaria: ¿Cómo interpretar las pausas 2 observadas? (2023)

I Congreso Internacional Infancia, Adolescencia y Juventud. INFAPOST. Urtarrilak 19 eta 20.

Irune Ibarra, Asunción Martínez, Jose Maria Arriola

Buliding bridges between research and schools:Feedback to primary education teachers on handwriting. (2023)

20th Biennial EARLI Conference (EARLI 2023), Thessaloniki, Greece, Abuztuak 23. Book of Abstracts (149. orr.).

Irune Ibarra, Asunción Martínez-Arbelaiz, Jose Maria Arriola

Los corpus lingüísticos y los videos de la escritura a mano como herramientas para la mejora de la escritura cursiva (2023)

Actas del 6º Congreso Mundial de Educación EDUCA 2023

Iker De la Iglesia, María Vivó, Paula Chocrón, Gabriel de Maeztu, Koldo Gojenola, Aitziber Atutxa

Overview of ClinAIS at IberLEF 2023: Automatic Identification of Sections in Clinical Documents in Spanish (2023)

Procesamiento del Lenguaje Natural, Revista nº 71, septiembre de 2023

Iker de la Iglesia, María Vivó, Paula Chocrón, Gabriel de Maeztu, Koldo Gojenola, Aitziber Atutxa

An Open Source Corpus and Automatic Tool for Section Identification in Spanish Health Records (2023)

Journal of Biomedical Informatics

Oscar Sainz, Oier Lopez de Lacalle, Eneko Agirre, German Rigau

What do Language Models know about word senses? Zero-Shot WSD with Language Models and Domain Inventories (2023)

In Proceedings of the 12th Global Wordnet Conference, pages 331–342, University of the Basque Country, Donostia - San Sebastian, Basque Country. Global Wordnet Association.

Jose Maria Arriola, Jon Alkorta, Ekain Arrieta, Mikel Iruskieta

Towards automatic essay scoring of Basque language texts from a rule-based approach based on curriculum-aware systems (2023)

Proceedings of the NoDaLiDa 2023 Workshop on Constraint Grammar - Methods, Tools and Applications Eckhard Bick, Trond Trosterud, Tanel Alumäe (Editors)

Anastasia Klimovich-Gray, Giovanni Di Liberto, Lucia Amoruso, Ander Barrena, Eneko Agirre, Nicola Molinaro

Increased top-down semantic processing in natural speech linked to better reading in dyslexia (2023)


Aitor Ormazabal, Mikel Artetxe, Aitor Soroa

CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models (2023)

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

J.M. Arriola, M. Iruskieta, I. Ibarra, A. Martínez

Semiautomatic Study of Handwriting Development in Basque Children at Primary School. (2023)

M. Pikhart, B. Klimova, F. Meunier, I. Ibarra, F. Suñer, K. Zamborova, M. V. Soulé, R. Bartolome, A. Parmaxi, J.M. Arriola

A Systematic Review of the Cognitive Impact of Digital Media Modalities on Reading Comprehension in L2 (2023)

Investigaciones. Sobre Lectura, 18(2), 56-87.

Blanca Calvo Figueras, Irene Bausells, Tommaso Caselli

Dynamic Stance: Modeling Discussions by Labeling the Interactions (2023)

Findings of the Association for Computational Linguistics: EMNLP 2023

Masson, M., Roose, P., Sallaberry, C., Agerri, R., Bessagnet, MN., Lacayrelle, A.L.P

APs: A Proxemic Framework for Social Media Interactions Modeling and Analysis (2023)

In: Crémilleux, B., Hess, S., Nijssen, S. (eds) Advances in Intelligent Data Analysis XXI. IDA 2023. Lecture Notes in Computer Science, vol 13876. Springer, Cham.

Joseba Fernandez de Landa, Rodrigo Agerri (2023). HiTZ-IXA at PoliticES 2023: Document and Sentence Level Text Representations for Demographic Characteristics and Political Ideology Detection. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2023) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), Jaén, Spain, September 2023.

Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau and Anar Yeginbergenova (2023). HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine. In SEPLN 2023: 39th International Conference of the Spanish Society for Natural Language Processing.

Roberto Centeno and Rodrigo Agerri (2023). Overview of NLP-MisInfo 2023: Workshop on NLP applied to Misinformation. In Proceedings of the Workshop on NLP applied to Misinformation, co-located with the 39th International Conference of the Spanish Society for Natural Language Processing (SEPLN 2023).

Nayla Escribano, German Rigau, Rodrigo Agerri, A modular approach for multilingual timex detection and normalization using deep learning and grammar-based methods, Knowledge-Based Systems, Volume 273, 2023, 110612, ISSN 0950-7051, ( Abstract: Detecting and normalizing temporal expressions is an essential step for many NLP tasks. While a variety of methods have been proposed for detection, best normalization approaches rely on hand-crafted rules. Furthermore, most of them have been designed only for English. In this paper we present a modular multilingual temporal processing system combining a fine-tuned Masked Language Model for detection, and a grammar-based normalizer. We experiment in Spanish and English and compare with HeidelTime, the state-of-the-art in multilingual temporal processing. We obtain best results in gold timex normalization, timex detection and type recognition, and competitive performance in the combined TempEval-3 relaxed value metric. A detailed error analysis shows that detecting only those timexes for which it is feasible to provide a normalization is highly beneficial in this last metric. This raises the question of which is the best strategy for timex processing, namely, leaving undetected those timexes for which is not easy to provide normalization rules or aiming for high coverage. Keywords: Temporal processing; Multilingualism; Sequence labeling; Grammar-based approaches; Deep learning; Natural language processing

Rodrigo Agerri, Eneko Agirre

Lessons learned from the evaluation of Spanish Language Models (2023)

Procesamiento del Lenguaje Natural (70), pp 157-170

Gorka Urbizu, Iñaki San Vicente, Xabier Saralegi, Rodrigo Agerri, Aitor Soroa

Scaling Laws for BERT in Low-Resource Settings (2023)

Findings of the Association for Computational Linguistics: ACL 2023

Iker García, Rodrigo Agerri, German Rigau

T-Projection: High Quality Annotation Projection for Sequence Labeling Tasks (2023)

Findings of the Association for Computational Linguistics: EMNLP 2023

Oscar Sainz, Jon Ander Campos, Iker García, Julen Etxaniz, Oier Lopez de Lacalle, Eneko Agirre

NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark (2023)

Findings of the Association for Computational Linguistics: EMNLP 2023

Vincent Vandeghinste, Dimitar Shterionov, Mirella De Sisto, Aoife Brady, Mathieu De Coster, Lorraine Leeson, Josep Blat, Frankie Picron, Marcello Paolo Scipioni, Aditya Parikh, Louis ten Bosch, John O’Flaherty, Joni Dambre, Jorn Rijckaert, Bram Vanroy, Victor Ubieto Nogales, Santiago Egea Gomez, Ineke Schuurman, Gorka Labaka, Adrián Núnez-Marcos, Irene Murtagh, Euan McGill, Horacio Saggion. 2023. SignON: Sign Language Translation. Progress and challenges. In Proceedings of the 24th Annual Conference of the European Association for Machine Translation, pages 501–502, Tampere, Finland. European Association for Machine Translation.

Harritxu Gete, Thierry Etchegoyhen, and Gorka Labaka. 2023. What Works When in Context-aware Neural Machine Translation?. In Proceedings of the 24th Annual Conference of the European Association for Machine Translation, pages 147–156, Tampere, Finland. European Association for Machine Translation.

Harritxu Gete, Thierry Etchegoyhen, and Gorka Labaka. 2023. Targeted Data Augmentation Improves Context-aware Neural Machine Translation. In Proceedings of Machine Translation Summit XIX, Vol. 1: Research Track, pages 298–312, Macau SAR, China. Asia-Pacific Association for Machine Translation.

María Jesús Aranzabe, Igone Zabala, Izaskun Aldezabal

Goi-mailako testu akademikoak lantzeko baliabideak eta tresnak (2023)

II. CLARIAH-EUS workshop-a: Europako ikerketa azpiegiturekin lotuta egongo den euskararako ikerketa azpiegitura eraikitzen. Donostian, 2023ko azaroaren 23an. (Workshop horretan aurkeztutako posterra)

Kuzman, Taja ; Ljubešić, Nikola ; Erjavec, Tomaž ; Kopp, Matyáš ; Ogrodniczuk, Maciej ; Osenova, Petya ; Rayson, Paul ; Vidler, John ; Agerri, Rodrigo ; Agirrezabal, Manex ; Agnoloni, Tommaso ; Aires, José ; Albini, Monica ; Alkorta, Jon ; Antiba-Cartazo, Iván ; Arrieta, Ekain ; Barcala, Mario ; Bardanca, Daniel ; Barkarson, Starkaður ; Bartolini, Roberto ; Battistoni, Roberto ; Bel, Nuria ; Bonet Ramos, Maria del Mar ; Calzada Pérez, María ; Cardoso, Aida ; Çöltekin, Çağrı ; Coole, Matthew ; Darģis, Rober

Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en. ana 4.0 (2023)

Slovenian language resource repository CLARIN.SI

Blanka Klimova, Marcel Pikhart, Katerina Fronckova, Christina Sanchez-Stockhammer, Yulia Stukalina, Mikel Iruskieta, Kübra Okumuş Dağdeler, Eve Lejot, Antigoni Parmaxi, Rocío Bartolomé Rodríguez, Antonio Pareja-Lora

Analysis of foreign language teachers' attitudes towards digital teaching in the European Union countries (2023)

Sustainable Multilingualism 23

Begoña Altuna, Goutham Karunakaran, Alberto Lavelli, Bernardo Magnini, Manuela Speranza, Roberto Zanoli

CLinkaRT at EVALITA 2023: Overview of the Task on Linking a Lab Result to its Test Event in the Clinical Domain (2023)

Proceedings of the Eighth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2023), Parma 2023.

Begoña Altuna, Rodrigo Agerri, Lidia Salas-Espejo, José Javier Saiz, Roberto Zanoli, Manuela Speranza, Bernardo Magnini, Alberto Lavelli, Goutham Karunakaran

Overview of TESTLINK at IberLEF 2023: Linking Results to Clinical Laboratory Tests and Measurements (2023)

Procesamiento del Lenguaje Natural, Revista nº 71, 313-320, septiembre de 2023.

Itziar Gonzalez-Dios, Javier Alvez, and German Rigau

Exploiting Metonymy from Available Knowledge Resources. (2023)

20th International Conference, CICLing 2019, La Rochelle, France, April 7–13, 2019, Revised Selected Papers, Part I. Lecture Notes in Computer Science book series (LNCS, volume 13451), pp 34-43

Álvez, J., Gonzalez-Dios, I., & Rigau, G. (2023, January). Towards Effective Correction Methods Using WordNet Meronymy Relations. In Proceedings of the 12th Global Wordnet Conference (pp. 31-40).

Margot Madina, Itziar Gonzalez-Dios, Melanie Siegel (2023) Easy-to-Read Language: baliabide linguistikoen eta testuen egokitzapena eta tresna automatikoen garapena. V. IKERGAZTE NAZIOARTEKO IKERKETA EUSKARAZ Kongresuko artikulu-bilduma: Giza Zientziak eta Artea, 35-42.

Unai Atutxa-Barrenetxea, Mikel Iruskieta

Taller de resumen de textos en euskera basado en la coevaluación y feedback automático (2023)


Iker García, Begoña Altura, Javier Álvez, Itziar Gonzalez-Dios, German Rigau

This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models (2023)

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

Iñigo Alonso, Eneko Agirre

Automatic Logical Forms improve fidelity in Table-to-Text generation (2023)

Expert Systems with Applications, Volume 238, Part D, 15 March 2024, 121869

Irene Baucells de la Peña, Blanca Calvo Figueras, Marta Villegas, Oier Lopez de Lacalle

Entailment-based Task Transfer for Catalan Text Classification in Small Data Regimes (2023)

Procesamiento del Lenguaje Natural. v. 71, p. 165-177, sep. 2023

Jose Mari Arriola, Mikel Iruskieta, Irune Ibarra, Asunción Martínez

Semiautomatic Study of Handwriting Development in Basque Children at Primary Schoo (2023)

Conference: The European Conference on Language Learning 2023

Jeremy Barnes

Sentiment and Emotion Classification in Low-resource Settings (2023)

Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis

Jeremy Barnes, Samia Touileb, Petter Mæhlum, Pierre Lison

Identifying Token-Level Dialectal Features in Social Media (2023)

Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)

Margot Madina, Itziar Gonzalez-Dios and Melanie Siegel (2023) Easy-to-Read in Germany: a Survey on its Current State and Available Resources. To appear in proceedings of 10th Language & Technology Conference

Madina, M., Gonzalez-Dios, I., & Siegel, M. (2023, July). Easy-to-Read Language Resources and Tools for three European Languages. In Proceedings of the 16th International Conference on PErvasive Technologies Related to Assistive Environments (pp. 693-699).

Celia Soler Uguet, Nora Aranberri

Exploring politeness control in NMT: fine-tuned vs. multi-register models in Castilian Spanish (2023)

Revista Procesamiento del Lenguaje Natural, 70, pp. 199-212.

Galder Gonzalez Larrañaga, Olatz Perez de Viñaspre Garralda

Nor da nor Lur Hiztegi Entziklopedikoan?: euskarazko lehenengo entziklopediaren demografia digital alderatua (2023)

Uztaro: giza eta gizarte-zientzien aldizkaria, number 124, pag 25-49

Juan Martinez-Romo, Lourdes Araujo, Xabier Larrayoz, Maite Oronoz, Alicia Pérez

OBSER-MENH at eRisk 2023: Deep Learning-Based Approaches for Symptom Detection in Depression and Early Identification of Pathological Gambling Indicators (2023)

Accepted. Experimental IR Meets Multilinguality, Multimodality, and Interaction: 14th International Conference of the CLEF Association, CLEF 2023, Springer International Publishing, Thessaloniki, Greece.

Jordan Koontz, Maite Oronoz and Alicia Pérez

Evaluating Data Augmentation for Medication Identification in Clinical Notes (2023)

International Conference on Recent Advances in Natural Language Processing (RANLP) (Accepted)

Bonan Min, Hayley Ross, Elior Sulem, Amir Pouran Ben Veyseh, Thien Huu Nguyen, Oscar Sainz, Eneko Agirre, Ilana Heintz, Dan Roth

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey (2023)

ACM Computing Surveys. 27 June 2023

R. Agerri, E. Agirre, I. Aldabe, N. Aranberri, J.M. Arriola, A. Atutxa, G. Azkune, J.A. Campos, A. Casillas, A. Estarrona, A. Farwell, I. Goenaga, J. Goikoetxea, K. Gojenola, I. Hernáez, M. Iruskieta, G. Labaka, O. Lopez de Lacalle, E. Navas, M. Oronoz, A. Otegi, A. Pérez, O. Perez de Viñaspre, G. Rigau, A. Salaberria, J. Sanchez, I. Saratxaga, A. Soroa

State-of-the-Art in Language Technology and Language-centric Artificial Intelligence (2023)

In: Rehm, G., Way, A. (eds) European Language Equality. Cognitive Technologies. Springer, Cham.

Adrián Núñez-Marcos, Olatz Perez-de-Viñaspre, Gorka Labaka

A survey on Sign Language machine translation (2023)

Expert Systems with Applications, Volume 213, part B. URL: ISSN: 0957-4174

Martin Kaltenböck, Artem Revenko, Khalid Choukri, Svetla Boytcheva, Christian Lieske, Teresa Lynn, German Rigau, Maria Heuschkel, Aritz Farwell, Gareth Jones, Itziar Aldabe, Ainara Estarrona, Katrin Marheinecke, Stelios Piperidis, Victoria Arranz, Vincent Vandeghinste, Claudia Borg

Deep Dive Data and Knowledge (2023)

In: Rehm, G., Way, A. (eds) European Language Equality. Cognitive Technologies. Springer, Cham.

Itziar Aldabe, Aritz Farwell, German Rigau, Georg Rehm, Andy Way

Strategic Plans and Projects in Language Technology and Artificial Intelligence (2023)

In: Rehm, G., Way, A. (eds) European Language Equality. Cognitive Technologies. Springer, Cham.

Sarasola, K., I. Aldabe, A. Diaz de Ilarraza, A. Estarrona, A. Farwell, I. Hernáez, E. Navas (2023). Language Report Basque. In: Rehm, G., Way, A. (eds) European Language Equality. Cognitive Technologies. Springer, Cham.

Sara Gracia, Maite Oronoz, Alicia Pérez

Ideiagintza suizidaren identifikazioa sare sozialetan (2023)

IKERGAZTE NAZIOARTEKO IKERKETA EUSKARAZ. UEU. 2023ko maiatzaren 17,18 eta 19. Donostia

Alicia Pérez, Maite Oronoz, Juan Martinez-Romo, Lourdes Araujo

OBSER-MENH: Digital OBSERvatory of MENtal Health in social networks for Healthcare Institutions based on Language Technologies (2023)

Accepted (not published). Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2023) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2023)

Izaskun Aldezabal, María Jesús Aranzabe

Euskararen eredutik hizkuntza-ereduen euskarara (2023)

David Lindemann (arg.), Miren Azkarateri esker onez, 57-75. Bilbo: UPV/EHUko Argitalpen Zerbitzua

Kepa Sarasola, Itziar Aldabe, Nora Aranberri

Enabling additional official languages in the EU for 2025 with language-centred Artificial Intelligence (2023)

Special issue of 'De Europa' journal "Llinguistic rights, multilingualism and language varieties in Europe in the age of artificial intelligence" pp.93-107. Turin, 2023.

Itziar Aduriz, Manex Agirrezabal, Eneko Agirre, Iñaki Alegria, Xabier Arregi, Jose Mari Arriola Xabier Artola, Arantza Díaz de Ilarraza, Ainara Estarrona, Izaskun Etxeberria, Nerea Ezeiza, Kepa Sarazola

Mofologia Konputazionala Euskaraz, 35 urte (2023)

Lindemann, D. (arg.). Miren Azkarateri esker onez, 15-30. UPV/EHU Argitalpen zerbitzua. Bilbo.

David Lindemann, Aitzol Astigarraga, Marije Bidaguren, Emilio Delgado, Galder Gonzalez, Kepa Sarasola

Inguma eta Wikidata uztartuz, euskarazko zientziaren ezagutza-graforantz (2023)

Lindemann, D. (arg.). Miren Azkarateri esker onez, 15-30. UPV/EHU Argitalpen zerbitzua. Bilbo.

Murali Kondragunta, Olatz Perez-de-Viñaspre, Maite Oronoz

Improving and Simplifying Template-Based Named Entity Recognition (2023)

In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, pages 79–86, Dubrovnik, Croatia. Association for Computational Linguistics. May 2023, Dubrovnik, Croatia.

Aner Egaña, Itziar Aldabe, Oier Lopez de Lacalle

Exploration of Annotation Strategies for Automatic Short Answer Grading (2023)

The 24th International Conference on Artificial Intelligence in Education, AIED 2023

Igone Zabala

Euskararen erregistro akademikoen garapenaz: hiztegia eta fraseologia (2023)

Lindemann David (ed.) Miren Azkarateri esker onez. Bilbo: UPV/EHUko Argitalpen Zerbitzua: 313-332

Miriam Peña-Zabala, Nagore Martinez-Merino, Mikel Iruskieta

UBE: Hezkuntza komunitatea elkareraginean (2023)

Estructura modular, metodologías activas y compromiso social en innovación educativa universitaria: La experiencia de la Facultad de Educación de Bilbao, UPV/EHU (2011-2021)

Ainara Estarrona, Izaskun Etxeberria, Manuel Padilla-Moyano, Ander Soraluze

Measuring language distance for historical texts in Basque (2023)

Procesamiento del Lenguaje Natural, Revista no 70, marzo del 2023, pp. 53-61

Paula Ontalvilla, Aitziber Atutxa, Maite Oronoz

Osasun-arloko entitate izendunen etiketatzea (2023)

IkerGazte 2023- Ikertzaile Euskaldunen Bosgarren kongresua (

Ekain Arrieta, Igor Odriozola, Xabier Arregi, Mikel Iruskieta

HABE-IXA euskarazko idazmen-proben corpuseko idazlanen mailakatze automatikoa (2023)

eHizpide 101

Irune Ibarra, Mikel Iruskieta

Intervención individualizada de la transcripción escrita con smartpen y basada en corpus lingüísticos: casos de 2 niños mellizos con trastorno del desarrollo del lenguaje (TDL) (2023)

GRAO 62: Análisis y estudios. 38-49 or.

Ander Salaberria, Gorka Azkune, Oier Lopez de Lacalle, Aitor Soroa, Eneko Agirre

Image captioning for effective use of language models in knowledge-based visual question answering (2023)

Expert Systems with Applications, 2023, vol. 212, p. 118669. Preprint: