publications

Aitziber Atutxa, Kepa Bengoetxea, Arantza Diaz de Ilarraza, Mikel Iruskieta

Towards a top-down approach for an automatic discourse analysis for Basque: Segmentation and Central Unit detection tool (2019)

PLoS ONE 14(9): e0221639

Eneko Agirre, Arantxa Otegi, Camille Pradel, Sophie Rosset, Anselmo Peñas, Mark Cieliebak

LIHLITH: Learning to Interact with Humans by Lifelong Interaction with Humans (2019)

Procesamiento Del Lenguaje Natural, vol. 63, pp. 147-150. ISSN: 1989-7553

Y Yaghoobzadeh, K Kann, TJ Hazen, E Agirre, H Schütze

Probing for Semantic Classes: Diagnosing the Meaning Content of Word Embeddings (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics.

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Unsupervised Neural Machine Translation, a new paradigm solely based on monolingual text (2019)

Procesamiento del Lenguaje Natural 63 (2019): 151-154.

Mikel Artetxe, Holger Schwenk

Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond (2019)

Transactions of the Association for Computational Linguistics 7 (2019): 597-610.

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Bilingual Lexicon Induction through Unsupervised Machine Translation (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 5002-5007.

Mikel Artetxe, Gorka Labaka, Eneko Agirre

An Effective Approach to Unsupervised Machine Translation (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 194-203.

Mikel Artetxe, Holger Schwenk

Margin-based Parallel Corpus Mining with Multilingual Sentence Embeddings (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 3197-3203.

Aitor Ormazabal, Mikel Artetxe, Gorka Labaka, Aitor Soroa and Eneko Agirre

Analyzing the Limitations of Cross-lingual Word Embedding Mappings (2019)

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 4990-4995.

Ainara Estarrona, Izaskun Etxeberria, Ander Soraluze, Manuel Padilla-Moyano

Spelling Normalisation of Basque Historical Texts (2019)

Procesamiento del Lenguaje Natural, vol. 63, pp. 59-66

Ander Soraluze, Olatz Arregi, Xabier Arregi, Arantza Diaz de Ilarraza

EUSKOR: End-to-end coreference resolution system for Basque (2019)

PLoS ONE 14(9): e0221801. https://doi.org/10.1371/journal.pone.0221801

Damien Sileo, Camille Pradel, Guillermo Echegoyen, Anselmo Peñas, Arantxa Otegi, Jan Deriu, Mark Cieliebak, Ander Barrena, Eneko Agirre

Matching Words and Knowledge Graph Entities with Meta-Embeddings (2019)

Proceedings of CAp2019, Toulouse (France) pages 34-39

Juan J. Lastra-Díaz, Josu Goikoetxea, Mohamed Ali Hadj Taieb, Ana García-Serrano, Mohamed Ben Aouicha, Eneko Agirre

Reproducibility dataset for a large experimental survey on word embeddings and ontology-based methods for word similarity (2019)

Data in Brief. DOI: https://doi.org/10.1016/j.dib.2019.104432

Juan J. Lastra-Díaz, Josu Goikoetxea, Mohamed Ali Hadj Taieb, Ana García-Serrano, Mohamed Ben Aouicha, Eneko Agirre

A reproducible survey on word embeddings and ontology-based methods for word similarity: linear combinations outperform the state of the art (2019)

Engineering Applications of Artificial Intelligence. Volume 85, October 2019, Pages 645-665. DOI: https://doi.org/10.1016/j.engappai.2019.07.010

José Ramom Pichel, Pablo Gamallo, Iñaki Alegria

Measuring diachronic language distance using perplexity. Application to English, Portuguese and Spanish. (2019)

Natural Language Engeenering

José Ramom Pichel, Pablo Gamallo, Iñaki Alegria

Cross-lingual Diachronic Distance: Application to Portuguese and Spanish (2019)

SEPLN, 2019

Xabier Soto, Olatz Perez de Viñaspre, Maite Oronoz, Gorka Labaka

Leveraging SNOMED CT terms and relations for machine translation of clinical texts from Basque to Spanish (2019)

Proceedings of the Second Workshop on Multilingualism at the Intersection of Knowledge Bases and Machine Translation

Xabier Soto, Olatz Perez de Viñaspre, Gorka Labaka, Maite Oronoz

Neural Machine Translation of clinical texts between long distance languages (2019)

JAMIA (Journal of the American Medical Informatics Association)

Eneko Agirre, Anders Jonsson, Anthony Larcher

Framing Lifelong Learning as Autonomous Deployment: Tune Once Live Forever (2019)

Dialogue Systems and Lifelong Learning special session at Tenth International Workshop on Spoken Dialogue Systems Technology (IWSDS)

lñigo Lopez-Gazpio, Montse Maritxalar, Mirella Lapata, Eneko Agirre

Word n-gram attention models for sentence similarity and inference (2019)

Expert Systems with Applications. Volume 132, 15 October 2019, Pages 1-11. https://doi.org/10.1016/j.eswa.2019.04.054.

Anselmo Peñas, Mathilde Veron, Camille Pradel, Arantxa Otegi, Guillermo Echegoyen, Alvaro Rodrigo

Continuous Learning for Question Answering (2019)

Proceedings of the 10th International Workshop on Spoken Dialog Systems (IWSDS 2019) - DSLL Special Session

Jan Deriu, Alvaro Rodrigo, Arantxa Otegi, Guillermo Echegoyen, Sophie Rosset, Eneko Agirre, Mark Cieliebak

Survey on Evaluation Methods for Dialogue Systems (2019)

Pre-print available at arXiv:1905.04071

Mikel Iruskieta

CLARIN Europako sarea: eHumanitateak eta zientzia sozialak lankidetzarako behar duten hizkuntza-azpiegitura sortzen (2019)

Humanitate digitalak: aukerak, erakundeen rol berriak eta elkarlana. UEU. 2019ko ekainaren 20a. Bilbo. ULR: https://www.youtube.com/watch?v=EEAwzxPL4GA&t=1346s

Mikel Iruskieta, Abel Camacho

Euskararen irakaskuntza eta IKTak (2019)

Bikaintasuna Euskal Ikasketetan IX. Euskara eta euskal kultura eremu digitalean. Deskargatu ikastaroa: www.labur.eus/cZjwN

Itziar Gonzalez-Dios, Izaskun Etxeberria

Kultura digitalizatua: Europa eta Euskal Herria (2019)

Bikaintasuna Euskal Ikasketetan IX. Euskara eta euskal kultura eremu digitalean

Gorka Urbizu, Ander Soraluze, Olatz Arregi

Deep Cross-Lingual Coreference Resolution for Less-ResourcedLanguages: The Case of Basque (2019)

Proceedings of the 2nd Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2019), co-located with NAACL 2019

Begoña Altuna, Maria Jesus Aranzabe, Arantza Diaz de Ilarraza

EusTimeBank-TL corpusa: denbora-informaziodun testuetatik denbora-lerroetara (2019)

Olatz Arbelaitz, Urtzi Etxeberria, Ainhoa Latatu, Miren Josu Ormaetxebarria (arg.), III. Ikergazte. Nazioarteko Ikerketa Euskaraz, Giza Zientziak eta Artea (1. liburukia), 83-90. Udako Euskal Unibertsitatea (UEU). Bilbo. ISBN: 978-84-8438-682-7

Itziar Aldabe, Josu Aztiria, Francho Beltrán, Myriam Bras, Klara Ceberio, Itziar Cortes, Jean-Baptiste Coyos, Benaset Dazeas, Louise Esher, Gorka Labaka, Igor Leturia, Kepa Sarasola, Aure Séguier, Jean Sibille

LINGUATEC: Desarrollo de recursos lingüı́sticos para avanzar en la digitalización de las lenguas de los Pirineos (2019)

Procesamiento del Lenguaje Natural, (forthcominng) ISSN 1989-7553

Joseba Fernandez de Landa, Rodrigo Agerri, Iñaki Alegria

Large Scale Linguistic Processing of Tweets to Understand Social Interactions among Speakers of Less Resourced Languages: The Basque Case (2019)

MDPI: Information: Vol. 10, 6. 212. doi: 10.3390/info10060212 https://www.mdpi.com/2078-2489/10/6/212

Arantza Casillas, Nerea Ezeiza, Iakes Goenaga, Alicia Pérez, Xabier Soto

Measuring the Effect of Different Types of Unsupervised Word Representations on Medical Named Entity Recognition (2019)

International Journal of Medical Informatics (https://doi.org/10.1016/j.ijmedinf.2019.05.022)

Aitziber Atutxa, Arantza Diaz de Ilarraza, koldo Gojenola,Maite Oronoz, Olatz Perez de Viñaspre

Interpretable Deep Learning to Map Diagnostic Texts to ICD10 Codes (2019)

International Journal of Medical Informatics https://doi.org/10.1016/j.ijmedinf.2019.05.015 Link to publication: https://authors.elsevier.com/c/1ZANI4xGJ~syOE

Amir Zeldes, Debopam Das, Erick G. Maziero, Juliano A. Desiderato, Mikel Iruskieta

Proceedings of the Workshop on Discourse Relation Parsing and Treebanking 2019 (2019)

Proceedings of Discourse Relation Parsing and Treebanking (DISRPT2019), pages 1–168. Minneapolis, MN, June 6, 2019. ACL

Amir Zeldes, Debopam Das, Erick G. Maziero, Juliano D. Antonio, Mikel Iruskieta

The DISRPT 2019 Shared Task on Elementary Discourse UnitSegmentation and Connective Detection (2019)

Proceedings of Discourse Relation Parsing and Treebanking (DISRPT2019), pages 144–152. Minneapolis, MN, June 6, 2019. ACL

Mikel Iruskieta, Kepa Bengoetxea, Aitziber Atutxa, Arantza Diaz de Ilarraza

Multilingual segmentation based on neural networks and pre-trained word embeddings (2019)

Proceedings of Discourse Relation Parsing and Treebanking (DISRPT2019), pages 125-133. Minneapolis, MN, June 6, 2019. ACL

Jon Alkorta, Koldo Gojenola, Mikel Iruskieta

Towards discourse annotation and sentiment analysis of the Basque Opinion Corpus (2019)

Proceedings of Discourse Relation Parsing and Treebanking (DISRPT2019), pages 144–152. Minneapolis, MN, June 6, 2019. ACL

Mikel Iruskieta, Chloé Braud

EusDisParser: improving an under-resourced discourse parser with cross-lingual data (2019)

Proceedings of Discourse Relation Parsing and Treebanking (DISRPT2019), pages 62–71. Minneapolis, MN, June 6, 2019. ACL

Gorka Urbizu, Ander Soraluze, Olatz Arregi

Neurona-sareetan oinarritutako euskararako korreferentzia-ebazpena (2019)

III. Ikergazte: Nazioarteko ikerketa euskaraz. pp. 141-147, Baiona. ISBN 978-84-8438-686-5

Alberto Poncelas, Kepa Sarasola, Meghan Dowling, Andy Way, Gorka Labaka, Iñaki Alegria

Adapting NMT to caption translation in Wikimedia Commons for low-resource languages (2019)

SEPLN 2019

Manex Agirrezabal, Begoña Altuna, Lara Gil-Vallejo, Josu Goikoetxea, Itziar Gonzalez-Dios

Creating vocabulary exercises through NLP (2019)

Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, CEUR-WS, vol. 2364, pp. 18-32. ISSN:1613-0073. http://ceur-ws.org/Vol-2364/ http://ceur-ws.org/Vol-2364/2_paper.pdf

Mikel Iruskieta, Arantxa Otegi, Larraitz Uria, Arantza Diaz de Ilarraza, Amaia Artolazabal

Zer i(ra)kas dezakegu geure corpusekin "jolastuz"? (2019)

Traineru bete lagun: Iñaki Gaminde omenduz. UPV/EHU. 35-66 or.

Sara Santiso, Alicia Pérez, Arantza Casillas

Smoothing dense spaces for improved relation extraction between drugs and adverse reactions (2019)

International Journal of Medical Informatics https://doi.org/10.1016/j.ijmedinf.2019.05.009

Maria Jesus Aranzabe, Aitziber Atutxa, Kepa Bengoetxea, Arantza Díaz de Ilarraza, Iakes Goenaga, Koldo Gojenola, Larraitz Uria

Ekaia, 35, 2019, 291-307. (https://doi.org/10.1387/ekaia.19745). ISSN 0214-9001 e-ISSN:2444-3255

Begoña Altuna, María Jesús Aranzabe, Arantza Díaz de Ilarraza

Euskarazko denbora-informazioaren azterketa tratamendu automatikorako (2019)

In Itziar Aduriz eta Ruben Urizar (ed.), Hizkuntzalari euskaldunen III. topaketa. Zer berri?, 135-148. Bilbo: Udako Euskal Unibertsitatea (UEU). ISBN: 978-84-8438-679-7

Itziar Gonzalez-Dios

Nautikako terminologia biltzen testu-generoak abiapuntu: nabigazio-egunerokoen eredua (2019)

Hizkuntzalari euskaldunen III. topaketa. Zer berri?. ed. Itziar Aduriz, Ruben Urizar. 79-91. Udako Euskal Unibertsitatea.

Izaskun Etxeberria, Iñaki Alegria, Larraitz Uria

Weighted finite-state transducers for normalization of historical texts (2019)

Natural Language Engineering 25 (2), 307–321 https://doi.org/10.1017/S1351324918000505

Mikel Iruskieta, Montse Maritxalar

Euskaraz i(ra)kasteko baliabideen eta tresnen garapena: aukerak eta mugak, hizkuntza-teknologietatik begiratuta (2019)

Zornotzako Barnetegia. URL: http://aurtenbai.eus/worldcafea.html

Udane Beaskoetxea, Mikel Iruskieta

Ipuin-moldaketa herri-hizkerara egokitzeko, aldatzeko eta modu esanguratsuan kontatzeko markaketa: Ahozko komunikazioa lantzen eta aztertzen Haur Hezkuntzako gelan (2019)

Tantak

Mikel Iruskieta, Jose Mari Arriola

Tresna digitalak hizkuntzak eta gramatikak ikertzeko eta irakasteko (2019)

II Jornadas GrOC/GaiGram 2019. https://hittlinguistics.wixsite.com/groc-euskalherria

Arantza Diaz de Ilarraza, Mikel Iruskieta

Ayuda de las tecnologı́as lingüı́sticas en la investigación en Humanidades Digitales (2019)

XVI Simposio Internacional de Comunicacion Social (XVI-SICS)

Mikel Iruskieta, Arantza Diaz de Ilarraza

Tecnologı́as del lenguaje para la enseñanza e investigación en Humanidades Digitales (2019)

Universidad de La Havana

Diez Gaspon, I., Saratxaga, I., Lopez de Ipiña, K.

Deep Learning For Natural Sound Classification

. Proc. Internoise 2019

Cooke, M., King, S., Hazan, V., Stylianou, Y., Janse, E., Baskent, d., Hohmann, V., Winneke, A., Hernaez, I. 

Enriched communication across the lifespan

Procesamiento del Lenguaje Natural, Vol. 63, 2019, pp. 175-178

Serrano, L., Raman, S., Tavárez, D., Navas, E.,  Hernaez, I. 

Parallel vs. Non-Parallel Voice Conversion for Esophageal Speech

. Proc. Interspeech 2019, 4549-4553, DOI: 10.21437/Interspeech.2019-2194.

Raman, S., Serrano, L., Winneke, A., Navas, E.,  Hernaez, I.

Intelligibility and Listening Effort of Spanish Oesophageal Speech

. Applied Sciences, 9(15), 3233 IF: 2.217,  2019;  2076-3417

Sarasola, X., Navas, E., Tavárez, D., Serrano, L., saratxaga, I., Hernaez, I. 

Application of Pitch Derived Parameters to Speech and Monophonic Singing Classification

. Applied Sciences, 9(15), 3140,  2019