publications

Jon Ander Campos, Kyunghyun Cho, Arantxa Otegi, Aitor Soroa, Eneko Agirre, Gorka Azkune

Improving Conversational Question Answering Systems after Deployment using Feedback-Weighted Learning (2020)

Proceedings of the 28th International Conference on Computational Linguistics (COLING). (Pages 2561–2571). Outstanding Paper.

Jan Deriu, Don Tuggener, Pius von Däniken, Jon Ander Campos, Alvaro Rodrigo, Thiziri Belkacem, Aitor Soroa, Eneko Agirre, Mark Cieliebak

Spot The Bot: A Robust and Efficient Framework for the Evaluation of Conversational Dialogue Systems (2020)

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). (Pages 3971–3984). Honorable Mention Paper.

Juan J. Lastra-Díaz, Josu Goikoetxea, Mohamed Ali Hadj Taieb, Ana Garcia-Serrano, Mohamed Ben Aouicha, Eneko Agirre, David Sánchez

A large reproducible benchmark of ontology-based methods and word embeddings for word similarity (2020)

Information Systems. Online first.

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Translation Artifacts in Cross-lingual Transfer Learning (2020)

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). (Pages 7674–7684).

Jon Ander Campos, Arantxa Otegi, Aitor Soroa, Jan Deriu, Mark Cieliebak, Eneko Agirre

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 7302–7314

Ainara Estarrona, Izaskun Etxeberria, Ricardo Etxepare, Manuel Padilla-Moyano, Ander Soraluze

Sintaktikoki etiketatutako euskarazko corpus historikoa eraikitzen (2020)

Fontes Linguae Vasconum 50 urte. Ekarpen berriak euskararen ikerketari. Nuevas aportaciones al estudio de la lengua vasca

Iker de la Iglesia, Mikel Martinez-Puente, Alexander Platas, Iria San Miguel, Aitziber Atutxa, Koldo Gojenola

MEDIA team at the CLEF-2020 MultilingualInformation Extraction Task (2020)

Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum Thessaloniki, Greece, September 22-25, 2020.

Uxoa Iñurrieta

Identification and translation of verb+noun multiword expressions: a Spanish-Basque study (2020)

Procesamiento del Lenguaje Natural, 64, pp. 123-126.

Itziar Aduriz, Jose Mari Arriola, Xabier Artola, Zuhaitz Beloki, Nerea Ezeiza, Koldo Gojenola

Morfeus+: Word Parsing in Basque beyond Morphological Segmentation (2020)

WORD STRUCTURE 13.3, 283-315

Jose Ramom Pichel Camos

Medidas de distância entre línguas baseadas em corpus (2020)

Nazioarteko tesia. Artikulu bilduma.

Kepa Sarasola, Iñaki Alegria, Olatz Perez de Viñaspre

Language Technology for Language Communities: An Overview based on Basque Experience 2020 (2020)
file2
(2020)

Symposiwm Academaidd Technolegau Iaith Cymru 2020 -11-04 // Wales Academic Symposium on Language Technologies 2020-11-04

Itziar Aduriz, Jose Mari Arriola

Testu-corpusen informazio morfosintaktikoaren etiketatze automatikoa hizkuntz ezagutzan oinarrituz: zenbait arazo, hainbat erronka (2020)

Fontes Linguae Vasconum 50 urte. Ekarpen berriak euskararen ikerketari / Nuevas aportaciones al estudio de la lengua vasca.

Amaia Aguirregoitia Martinez, Kepa Bengoetxea Kortazar, Itziar Gonzalez-Dios

Are CLIL texts too complicated? A computational analysis of their linguistic characteristics (2020)

Journal of Immersion and Content-Based Language Education (Available online)

Alberto Blanco, Alicia Pérez, Arantza Casillas

Automatic Classification of Medical Records with Multi-label Classifiers and Similarity Match Coders (2020)

CEUR Workshop Proceedings, Vol 2696 - Working Notes of CLEF 2020 - Conference and Labs of the Evaluation Forum

Alberto Blanco, Olatz Perez de Viñaspre, Alicia Pérez, Arantza Casillas

Boosting ICD multi-label classification of health records with contextual embeddings and label-granularity (2020)

Computer Methods and Programs in Biomedicine, Volume 188, 105264

Alberto Blanco, Alicia Pérez, Arantza Casillas

Extreme multi-label ICD classification: sensitivity to hospital service and time (2020)

IEEE Access, Volume 8, 183534-183545

Alberto Blanco, Alicia Pérez, Arantza Casillas, Daniel Cobos

Extracting Cause of Death from Verbal Autopsy with Deep Learning interpretable methods (2020)

IEEE Journal of Biomedical and Health Informatics

Arantxa Otegi, Jon Ander Campos, Gorka Azkune, Aitor Soroa, Eneko Agirre

Automatic Evaluation vs. User Preference in Neural Textual Question Answering over COVID-19 Scientific Literature (2020)

Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020

Santana, S and Pérez, A and Casillas, A

HapLap at eHealth-KD Challenge 2020 (2020)

Proceedings of the Iberian Languages Evaluation Forum co-located with 36th Conference of the Spanish Society for Natural Language Processing, IberLEF@ SEPLN

Hormaetxe G., Iruskieta M.

Parekoen behaketarekin komunikazio-gaitasuna ebaluatzen: zer dute nahiago ikasleek, errubrika tradizionala ala bideo-behaketa (2020)

e-Hizpide 96

Camacho A., Iruskieta M., Latatu A., Lonbide P.

UEUren Online ikaskuntzarako eredu pedagogikoaren sorrera eta garapena: teoriatik praktikara (2020)

Uztaro

Ibarra, I., Ortube, M., Iruskieta, M.

Loturak landuz: idazketa errazeko programa (2020)

Booktegi.

Mikel Artetxe

Itzulpen automatiko gainbegiratu gabea / Unsupervised Machine Translation (2020)

-

Mikel Artetxe, Gorka Labaka, Noe Casas, Eneko Agirre

Do all roads lead to Rome? Understanding the role of initialization in iterative back-translation (2020)

Knowledge-Based Systems, Volume 206 (online first). Pre-print https://arxiv.org/abs/2002.12867

Eneko Agirre

Cross-Lingual Word Embeddings (Book Review) (2020)

Computational Linguistics 46 (1), 245-248. (https://doi.org/10.1162/COLI_r_00372)

Jan Deriu, Katsiaryna Mlynchyk, Philippe Schläpfer, Alvaro Rodrigo, Dirk von Grünigen, Nicolas Kaiser, Kurt Stockinger, Eneko Agirre, Mark Cieliebak

A Methodology for Creating Question Answering Corpora Using Inverse Data Annotation (2020)

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 897-911.

Xabier Soto, Dimitar Shterionov, Alberto Poncelas, Andy Way

Selecting Backtranslated Data from Multiple Sources for Improved Neural Machine Translation (2020)

Xabier Soto, Dimitar Shterionov, Alberto Poncelas, Andy Way (2020) Selecting Backtranslated Data from Multiple Sources for Improved Neural Machine Translation. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp: 3898–3908. https://www.aclweb.org/anthology/2020.acl-main.359 DOI: 10.18653/v1/2020.acl-main.359

Uxoa Inurrieta, tziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola

Learning about phraseology from corpora: A linguistically motivated approach for Multiword Expression identification. (2020)

Inurrieta U, Aduriz I, Díaz de Ilarraza A, Labaka G, Sarasola K (2020) Learning about phraseology from corpora: A linguistically motivated approach for Multiword Expression identification. PLoS ONE 15(8): e0237767. https://doi.org/10.1371/journal.pone.0237767

Jan Deriu, Alvaro Rodrigo, Arantxa Otegi, Guillermo Echegoyen, Sophie Rosset, Eneko Agirre, Mark Cieliebak

Survey on Evaluation Methods for Dialogue Systems (2020)

Artificial Intelligence Review. Online 25 June. Pages 1-56. https://doi.org/10.1007/s10462-020-09866-x

Ivana Kvapilíková, Mikel Artetxe, Gorka Labaka, Eneko Agirre, Ondřej Bojar

Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining (2020)

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. Pages 255-262

Gorka Azkune, Aitor Almeida, Eneko Agirre

Cross-environment activity recognition using word embeddings for sensor and activity representation (2020)

Neurocomputing (available online 1 September 2020)

José Ramom Pichel, Pablo Gamallo, Marco Neves & Iñaki Alegria

Distância diacrónica automática entre variantes diatópicas do português e do espanhol (2020)

Linguamática, Vol. 12 N. 1, 117–126 ISSN: 1647–0818

C. Pradel, D. Sileo, A. Rodrigo, A. Peñas, E. Agirre.

Question Answering when Knowledge Bases are Incomplete? (2020)

Proceedings of Conference and Labs of the Evaluation Forum.

Sara Santiso, Alicia Pérez, Arantza Casillas, Maite Oronoz

Neural negated entity recognition in Spanish electronic health records (2020)

Journal of Biomedical Informatics (JBI) https://doi.org/10.1016/j.jbi.2020.103419

Javier Álvez, Itziar Gonzalez-Dios, German Rigau

Applying the Closed World Assumption to SUMO-basedFOL Ontologies for Effective Commonsense Reasoning (2020)

To appear in 24th European Conference on Artificial Intelligence - ECAI 2020 (preprint) ECAI2020 proceedings, including the main conference and the PAIS papers, will be published OA, as usual, by in IOS Press Ebook Series Frontiers in Artificial Intelligence and Applications (FAIA) on August 29.

Arantxa Otegi, Aitor Agirre, Jon Ander Campos, Aitor Soroa, Eneko Agirre

Conversational Question Answering in Low Resource Scenarios: A Dataset and Case Study for Basque (2020)

Proceedings of The 12th Language Resources and Evaluation Conference, pp. 429–435. European Language Resources Association. ISBN: 979-10-95546-34-4

Itziar Gonzalez-Dios, Kepa Bengoetxea, Amaia Aguirregoitia

LagunTest: A NLP Based Application to Enhance Reading Comprehension (2020)

1st Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI2020), pages 63–69. ISBN: 979-10-95546-44-3 https://www.aclweb.org/anthology/2020.readi-1.10/ https://lrec2020.lrec-conf.org/media/proceedings/Workshops/Books/READI2020book.pdf

Elena Zotova, Rodrigo Agerri, Manuel Nuñez and German Rigau

Multilingual Stance Detection in Tweets: The Catalonia Independence Corpus (2020)

Language Resources and Evaluation Conference (LREC 2020)

Rodrigo Agerri, Iñaki San Vicente, Jon Ander Campos, Ander Barrena, Xabier Saralegi, Aitor Soroa, Eneko Agirre

Give your Text Representation Models some Love: the Case for Basque (2020)

Proceedings of LREC. Also available at arxiv https://arxiv.org/pdf/2004.00033.pdf

Rodrigo Agerri, German Rigau

Language independent sequence labelling for Opinion Target Extraction (2020)

International Joint Conference on Artificial Intelligence (IJCAI 2020)

Nora Aranberri

With or without you? Effects of using machine translation to write flash fiction in the foreign language (2020)

Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, p. 165–174, Lisboa, Portugal, November 2020.

Itziar Aduriz, Jose Mari Arriola

Testu-corpusen informazio morfosintaktikoaren etiketatze automatikoa hizkuntz ezagutzan oinarriutz: zenbait arazo, hainbat erronka (2020)

Fontes Linguae Vasconum 50 urte: ekarpen berriak euskararen ikerketari/ Nuevas aportaciones al estudio de la lengua vasca. (argitaratze-bidean)

Adrián Nuñez-Marcos, Gorka Azkune, Eneko Agirre, Diego López-de-Ipiña, Ignacio Arganda-Carreras

Using External Knowledge to Improve Zero-shot Action Recognition in Egocentric Videos (2020)

International Conference on Image Analysis and Recognition (ICIAR)

Arantxa Otegi, Aitor Soroa, Eneko Agirre, Jon Ander Campos

Cómo gestionar la sobrecarga de información científica sobre COVID-19 (2020)

The Conversation. ISSN 2201-5639. https://theconversation.com/como-gestionar-la-sobrecarga-de-informacion-cientifica-sobre-covid-19-138651

Jose Mari Arriola, Josu Goikoetxea, Mikel Iruskieta

Hizkuntza-teknologiak hizkuntzen ikas-irakaskuntzan: zenbat aukera, hainbat erronka (2020)

ehizpide 95: 1--21

Thierry Declerck, Itziar Gonzalez-Dios, German Rigau (editors)

Proceedings of the LREC 2020 Workshop on Multimodal Wordnets (MMWN-2020) (2020)

European Language Resources Association (ELRA), Paris. https://lrec2020.lrec-conf.org/media/proceedings/Workshops/Books/MMW2020book.pdf ISBN: 979-10-95546-41-2 EAN: 9791095546412

Jon Alkorta, Itziar Gonzalez-Dios

Exploring the Enrichment of Basque WordNet with a Sentiment Lexicon (2020)

Proceedings of the Workshop on Multimodal Wordnets (MMWN-2020), pages 20–24. ISBN: 79-10-95546-41-2 https://lrec2020.lrec-conf.org/media/proceedings/Workshops/Books/MMW2020book.pdf

Itziar Gonzalez-Dios, Javier Álvez, German Rigau

Towards a Model for Ontologising WordNet Adjectives (2020)

Proceedings of the Workshop on Multimodal Wordnets (MMWN-2020), pages 1–6. ISBN: 979-10-95546-41-2 https://lrec2020.lrec-conf.org/media/proceedings/Workshops/Books/MMW2020book.pdf

Begoña Altuna, María Jesús Aranzabe, Arantza Díaz de Ilarraza

EusTimeML: A mark-up language for temporal information in Basque (2020)

Research in Corpus Linguistics 8: 86-104. ISSN 2243-4712. Asociación Española de Lingüística de Corpus (AELINCO) DOI 10.32714/ricl.08.01.06

Kepa Bengoetxea, Itziar Gonzalez-Dios, Amaia Aguirregoitia

AzterTest: Open source linguistic and stylistic analysis tool (2020)

Procesamiento del Lenguaje Natural, 64, 61-68. http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6196

Mikel Artetxe, Sebastian Ruder, Dani Yogatama

On the cross-lingual transferability of monolingual representations (2020)

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

Mikel Artetxe, Sebastian Ruder, Dani Yogatama, Gorka Labaka, Eneko Agirre

A Call for More Rigor in Unsupervised Cross-lingual Learning (2020)

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

Pablo Gamallo José Ramom Pichel and Iñaki Alegria

Measuring Language Distance of Isolated European Languages (2020)

MDPI Information 2020, 11(4), 181 doi:10.3390/info11040181

Sara Santiso

Adverse Drug Reaction extraction on Electronic Health Records written in Spanish (2020)

Procesamiento del Lenguaje Natural http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6203

Nora Aranberri

Can translationese features help users select an MT system for post-editing? (2020)

Revista Procesamiento del Lenguaje Natural, 64, 93-100.

Mikel Iruskieta, Amaia Arroyo-Sagasta, Abel Camacho, Montse Maritxalar

Teknologia, testuinguru digitala eta konpetentzia digitalak hezkuntzan (2020)

Euskonews 748. ISSN: 1139-3629. URL: http://www.euskonews.eus/zbk/748/teknologia-testuinguru-digitala-eta-konpetentzia-digitalak-hezkuntzan/ar-0748001002E/

Javier Álvez, Itziar Gonzalez-Dios, German Rigau

Towards Word Sense Disambiguation by Reasoning (2020)

Vampire 2018 and Vampire 2019. The 5th and 6th Vampire Workshops. EPiC Series in Computing. Pages 19-29. ISSN: 2398-7340

Jose R. Pichel, Pablo Gamallo, Iñaki Alegria, Marco Neves

A Methodology to Measure the Diachronic Language Distance between Three Languages Based on Perplexity (2020)

Journal of Quantitative Linguistics. DOI 10.1080/09296174.2020.1732177

Oscar Sainz, Oier Lopez de Lacalle, Itziar Aldabe, Montse Maritxalar

Domain Adapted Distant Supervision for Pedagogically Motivated Relation Extraction (2020)

Proceeding of 12th Edition of its Language Resources and Evaluation Conference (LREC2020). Marseille, France

Andrea Horbach, Itziar Aldabe, Marie Bexte, Oier Lopez de Lacalle and Montse Maritxalar

Linguistic Appropriateness and Pedagogic Usefulness of Reading Comprehension Questions (2020)

Proceeding of 12th Edition of its Language Resources and Evaluation Conference (LREC2020). Marseille, France

Piroska Lendvai , Sándor Darányi, Christian Geng, Moniek Kuijpers, Oier Lopez de Lacalle , Jean-Christophe Mensonides, Simone Rebora and Uwe Reichel

Detection of Reading Absorption in User-Generated Book Reviews: Resources Creation and Evaluation (2020)

Proceeding of 12th Edition of its Language Resources and Evaluation Conference (LREC2020). Marseille, France

Oier Lopez de Lacalle, Ander Salaberria, Aitor Soroa, Gorka Azkune and Eneko Agirre

Evaluating Multimodal Representations on Visual Semantic Textual Similarity (2020)

Proceedings of the Twenty-third European Conference on Artificial Intelligence, ECAI 2020, June 8-12, 2020, Santiago Compostela, Spain

Rebecka Weegar, Alicia Pérez, Arantza Casillas, Maite Oronoz

Recent advances in Swedish and Spanish medical entity recognition in clinical texts using deep neural approaches (2020)

BMC Medical Informatics and Decision Making

Thierry Etchegoyhen, Haritz Arzelus, Harritxu Gete, Aitor Álvarez, Inma Hernaez, Eva Navas, Ander González-Docasal, Jaime Osácar, Edson Benites, Igor Ellakuria, Eusebi Calonge, Maite Martin 

MINTZAI: Sistemas de Aprendizaje Profundo E2E para Traduccion Automatica del Habla

Procesamiento del Lenguaje Natural, 2020; 65, 97 - 100 - 1135-5948