publications

Mikel Artetxe, Gorka Labaka, Iñigo Lopez-Gazpio, Eneko Agirre

Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation (2018)

Proceedings of the 22nd Conference on Computational Natural Language Learning (CoNLL 2018), pages 282–291. Brussels, Belgium, October 31 - November 1, 2018. Best paper award

Alicia Pérez, Aitziber Atutxa, Arantza Casillas, Koldo Gojenola, Álvaro Sellart

Inferred joint multigram models for medical term normalization according to ICD (2018)

International Journal of Medical Informatics. Volume 110, February 2018, Pages 111–117

Jorge Pérez, Alicia Pérez, Arantza Casillas, Koldo Gojenola

Cardiology record multi-label classification using Latent Dirichlet Allocation (2018)

Computer Methods and Programs in Biomedicine https://doi.org/10.1016/j.cmpb.2018.07.002

Aitziber Atutxa, Arantza Casillas, Nerea Ezeiza, Iakes Goenaga, V. Fresno, Koldo Gojenola, R. Martinez, Maite Oronoz, Olatz Perez-de-Viñaspre

IxaMed at CLEF eHealth 2018 Task 1: ICD10 Coding with a Sequence-to-Sequence approach (2018)

CLEF 2018 Online Working Notes. CEUR-WS

Begoña Altuna, Maria Jesús Aranzabe, Arantza Diaz de Ilarraza

Adapting TimeML to Basque: Event Annotation (2018)

In Gelbukh A. (eds.) Computational Linguistics and Intelligent Text Processing. CICLing 2016. Lecture Notes in Computer Science (LNCS, vol 9624), 565-577. Springer, Cham. DOI https://doi.org/10.1007/978-3-319-75487-1_43; Print ISBN 978-3-319-75486-4; Online ISBN 978-3-319-75487-1

Klara Ceberio, Itziar Aduriz, Arantza Díaz de Ilarraza and Ines Garzia-Azkoaga

Coreferential Relations in Basque: The Annotation Process (2018)

J Psycholinguist Res (2018) 47, Issue 2. Pages 325-342. https://doi.org/10.1007/s10936-018-9559-6. ISSN 0090-6905. Online ISSN 1573-6555.

Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola

Konbitzul: an MWE-specific Database for Spanish-Basque (2018)

Proceedings of the 11th Language Resources and Evaluation Conference, Miyazaki, Japan. orrialdeak: pages 2500-2504.

Uxoa Iñurrieta, Itziar Aduriz, Ainara Estarrona, Itziar Gonzalez-Dios, Antton Gurrutxaga, Ruben Urizar, Iñaki Alegria

Verbal Multiword Expressions in Basque corpora (2018)

In the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions (at COLING 2018)

Ainara Estarrona, Izaskun Aldezabal, Arantza Díaz de Ilarraza

How the corpus-based Basque Verb Index lexicon was built (2018)

Language Resources and Evaluation. First Online 05 December 2018. DOI: https://doi.org/10.1007/s10579-018-9440-0. Springer Netherlands

Thierry Etchegoyhen, Eva Martínez, Andoni Azpeitia, Gorka Labaka, Iñaki Alegria, Itziar Cortes, Amaia Jauregi, Igor Ellakuria, Maite Martin eta Eusebi Calonge

Neural Machine Translation of Basque (2018)

EAMT 2018. Alicante.

Mark Stevenson, Eneko Agirre

Word Sense Disambiguation (2018)

The Oxford Handbook of Computational Linguistics 2nd edition (2 ed.) Edited by Ruslan Mitkov. Oxford. ISBN: 9780199573691. DOI of the chapter: 10.1093/oxfordhb/9780199573691.013.28

Ander Barrena, Aitor Soroa, Eneko Agirre

Learning text representations for 500K classification tasks on Named Entity Disambiguation (2018)

The SIGNLL Conference on Computational Natural Language Learning CONLL 2018

Rodrigo Agerri, Yiling Chung, Itziar Aldabe, Nora Aranberri, Gorka Labaka, German Rigau

Building Named Entity Recognition Taggers via Parallel Corpora (2018)

In Proceedings of the 11th Language Resources and Evaluation Conference (LREC 2018), 7-12 May, 2018, Miyazaki, Japan.

Mikel Laburu, Alicia Pérez, Arantza Casillas, Iakes Goenaga, Maite Oronoz

Can I find information about rare diseases in some other language? (2018)

IEEE International Conference on Bioinformatics and Biomedicine. Artificial Intelligence techniques for Biomedicine and Healthcare. Madrid (December, 2018); ISBN: 978-1-5386-5487-3; Pgs: 2102-2108

Olatz Perez-de-Viñaspre, Maite Oronoz, Natalia Elvira

KabiTermICD: Nested Term Based Translation of the ICD-10-CM into a Minor Language (2018)

Workshop "MultilingualBIO: Multilingual Biomedical Text Processing" of LREC 2018. Proceedings of the workshop. Miyazaki (Japan), 8th May 2018.

Izaskun Aldezabal, Xabier Artola, Arantza Diaz De Ilarraza, Itziar Gonzalez-Dios, Gorka Labaka, German Rigau and Ruben Urizar

Basque e-lexicographic resources: linguistic basis, development, and future perspectives (2018)
file2
(2018)

Workshop on eLexicography: Between Digital Humanities and Artificial Intelligence. https://lexdhai.insight-centre.org/Lex_DH__AI_2018_paper_5.pdf

Rodrigo Agerri, German Rigau

Language independent sequence labelling for Opinion Target Extraction (2018)

Artificial Intelligence, 268 (2018) 85-95

Shuyuan Cao

Using Annotated Discourse Information of a RST Spanish-Chinese Treebank for Translation and Language Learning Tasks (2018)

Universitat Pompeu Fabra

Thierry Etchegoyhen, Eva Martı́nez, Andoni Azpeitia, Iñaki Alegria, Gorka Labaka, Arantxa Otegi, Kepa Sarasola, Itziar Cortes, Amaia Jauregi, Igor Ellakuria, Eusebi Calonge, Maite Martin

QUALES: Estimación Automática de Calidad de Traducción Mediante Aprendizaje Automático Supervisado y No-Supervisado (2018)

Procesamiento del Lenguaje Natural, vol. 61, pp. 143-146. ISSN: 1135-5948

Sara Santiso, Alicia Pérez, Arantza Casillas

Exploring Joint AB-LSTM with embedded lemmas for Adverse Drug Reaction discovery (2018)

IEEE Journal of Biomedical and Health Informatics https://doi.org/10.1109/jbhi.2018.2879744

Abel Camacho, Mikel Iruskieta

Euskararen ikaskuntza-prozesuetan teknologiak baliatzeko estrategiak (2018)

Helduen Euskalduntze Alfabetatzea aurrera begira. Glotodidaktikako VI. Topaketa. BIDEOA: https://www.youtube.com/watch?v=A5fyLb3nYEY&t=0s&list=PLu1XiDfqOVIycUaAn-SmkjOcTV_zUqKY4&index=14

Olatz Perez-de-Viñaspre eta Maite Oronoz

Estandar klinikoen ituzlpen automatikoa (2018)

BAT - Soziolinguistika aldizkaria (108), 3. bol, 64-81 orriak

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Unsupervised Statistical Machine Translation (2018)

In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3632–3642, Brussels, Belgium, October-November. Association for Computational Linguistics.

Iakes Goenaga, Aitziber Atutxa, Koldo Gojenola, Arantza Casillas, Arantza Díaz de Ilarraza, Nerea Ezeiza, Maite Oronoz, Alicia Pérez, Olatz Perez de Viñaspre

A Hybrid Approach For Automatic Disability Annotation (2018)

Proceedings of the Third Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018). ISSN 1613-073. Vol-2150. Pages 31-36.

Itziar Cortes, Igor Leturia, Iñaki Alegria, Aitzol Astigarraga, Kepa Sarasola eta Manex Garaio

Massively multilingual accessible audioguides via cell phones (2018)

EAMT 2018 (Project/Product Track)

I. Aduriz, I. Alegria, I. Aldezabal, X. Artola, A. Díaz de Ilarraza, N. Ezeiza, K. Sarasola, R. Urizar

Euskara (batua) ingurune digitalean: bidean ikasiXa eta etorkizuneko erronkak (2018)
file2
(2018)

Euskaltzaindiaren XVII. Biltzarra (Arantzazutik mundu zabalera), Arantzazun. “Euskara batuaren etorkizuneko erronkak ingurune digitalean” saioa. Arantzazu 2018

Ainara Estarrona, Izaskun Aldezabal

Towards a Spatial Annotation Scheme for Basque based on ISO-Space (2018)

Proceedings of the Second Workshop on Corpus-Based Research in the Humanities (CRH-2), 75-84

Itziar Gonzalez-Dios, María Jesús Aranzabe, Arantza Díaz de Ilarraza

The corpus of Basque simplified texts (CBST) (2018)

Language Resources and Evaluation 52. pp. 217-247. DOI 10.1007/s10579-017-9407-6

Sara Santiso, Arantza Casillas, Alicia Pérez, Maite Oronoz

Word embeddings for negation detection in health records written in Spanish (2018)

Soft Computing https://doi.org/10.1007/s00500-018-3650-7 https://link.springer.com/content/pdf/10.1007%2Fs00500-018-3650-7.pdf

Mikel Artetxe, Gorka Labaka, Eneko Agirre, Kyunghyun Cho

Unsupervised Neural Machine Translation (2018)

Sixth International Conference on Learning Representations (ICLR 2018)

Mikel Artetxe, Gorka Labaka, Eneko Agirre

A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings (2018)

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 789–798. Melbourne, Australia, July 15 - 20, 2018. c 2018 Association for Computational Linguistics

Rebecka Weegar, Alicia Pérez, Arantza Casillas, Maite Oronoz

Deep Medical Entity Recognition for Swedish and Spanish (2018)

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine. International Workshop on Biomedical and Health Informatics. Pages 1595-1601. Madrid (December, 2018). ISBN: 978-1-5386-5487-3

Jose Ramón Pichel, Pablo Gamallo, Iñaki Alegria

Measuring language distance among historical varieties using perplexity. Application to European Portuguese. (2018)

VarDial 5: Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (at Coling 2018).

Rodrigo Agerri, Xavier Gómez, German Rigau, Miguel Anxo Solla

Developing New Linguistic Resources and Tools for the Galician Language (2018)

Proceedings of the 11th Language Resources and Evaluation Conference (LREC 2018), pages 2322-2325; 7-12 May, 2018, Miyazaki, Japan.

Rodrigo Agerri, Núria Bel, German Rigau, Horacio Saggion

TUNER: Multifaceted Domain Adaptation for Advanced Textual Semantic Processing (2018)

Procesamiento del Lenguaje Natural, 61, 163-166.

Rodrigo Agerri, German Rigau

imple Language Independent Sequence Labelling for the Annotation of Disabilities in Medical Texts (2018)

Proceedings of the Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018), Diann Track, Sevilla, Spain.

Jon Alkorta, Koldo Gojenola, Mikel Iruskieta

Saying no but meaning yes: negation and sentiment analysis in Basque (2018)

Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis

Alberto Poncelas, Kepa Sarasola, Andy Way

The ADAPT System Description for the IWSLT 2018 Basque to English Translation Task (2018)

IWLST. 15th International Workshop on Spoken Language Translation. Bruges, Belgium, 2018

Kepa Sarasola

Hizkuntza-teknologia: aukera bat (2018)

ILS-IMH 25 urte. Industry Language Services-Instituto Maquina herramienta. Elgoibar, 2018-12-12

Rebecka Weegar, Alicia Pérez, Hercules Dalianis, Koldo Gojenola, Arantza Casillas, Maite Oronoz

Ensembles for clinical entity extraction (2018)

Revista: Procesamiento del Lenguaje Natural, Vol 60, p. 13-20, mar. 2018. ISSN 1989-7553. DOI 10.26342/2018-60-1

Iakes Goenaga, Aitziber Atutxa, Koldo Gojenola, Arantza Casillas, Arantza Díaz de Ilarraza, Nerea Ezeiza, Maite Oronoz, Alicia Pérez, Olatz Perez de Viñaspre

Automatic Misogyny Identification Using Neural Networks (2018)

Proceedings of the Third Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2018). ISSN 1613-073. Vol-2150. Pages 249-254.

Javier Álvez, Itziar Gonzalez-Dios, German Rigau

Towards Word Sense Disambiguation by Reasoning (2018)
file2
(2018)

The fifth Vampire workshop

Igone Zabala, Izaskun Aldezabal, María Jesús Aranzabe, Jose Maria Arriola, Itziar Gonzalez-Dios eta Mikel Lersundi

Corpus-driven Terminology Work for Describing Basque Academic Terminology: the Weaving Terminology Networks programme (TSE programme) (2018)
file2
(2018)

EAFT Summit Poster-saioa

Mikel Iruskieta, Arantxa Otegi

Infraestructuras de PLN y tareas básicas con Python: una introducción práctica (2018)

29ª edición Cursos de Verano UNED. https://www.youtube.com/watch?v=RkKPK9pA2U0.

Lide Aizpurua, Udane Beaskoetxea, Joana Karrillo, Mikel Iruskieta

Ipuinetako pertsonaiak ezaugarritzeko komunikazio estrategiak: irakaslearentzako gida xume ulerterraza (2018)

UZTARO 107, 37--51

Sara Santiso and Arantza Casillas and Alicia Pérez

The class imbalance problem detecting adverse drug reactions in electronic health records (2018)

Health Informatics Journal http://journals.sagepub.com/doi/10.1177/1460458218799470

Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola

Analysing linguistic iniformation about word combinations for a Spanish-Basque Rule-Based Machine Translation system (2018)

Multiword Units in Machine Translation and Translation Technologies, John Benjamins Publishing Company, pp. 41-60.

Izaskun Aldezabal, Maria Jesus Aranzabe

Euskara, bizia duen hizkuntza Ingeniaritza eta Zientzian (2018)

In Gidor Bilbao, Pruden Gartzia eta Mari Karmen Garmendia (koord.), Bai, jauna, bai: fisika euskaraz! Jose Ramon Etxebarria irakaslearen omenez, 189-197. Bilbo: UEU. ISBN: 978-84-8438-650-6

Shuyuan Cao, Iria da Cunha, Mikel Iruskieta

The RST Spanish-Chinese Treebank (2018)

In 12th Linguistic Annotation Workshop (LAW XII), 27th International Conference on Computational Linguistics (COLING 2018): 156--166. Santa Fe, New-Mexico, USA.

Mikel Artetxe, Gorka Labaka, Eneko Agirre

Generalizing and Improving Bilingual Word Embedding Mappings with a Multi-Step Framework of Linear Transformations (2018)

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18) pages 5012-5019.

Javier Alvez, Itziar Gonzalez-Dios, German Rigau

Cross-checking WordNet and SUMO Using Meronymy (2018)

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). pages 4570-4577.

Mario Almagro, Soto Montalvo, Arantza Diaz de Ilarraza, Alicia Pérez

MAMTRA-MED at CLEF eHealth 2018: A Combination of Information Retrieval Techniques and Neural Networks for ICD-10 Coding of Death Certificates (2018)

CLEF 2018 Online Working Notes. CEUR-WS; The full CLEF programme is at: http://clef2018.clef-initiative.eu/index.php?page=Pages/programme.html

Igone Zabala

Euskararen lantze funtzionala eta profesionalen komunikazio-gaitasunen garapena osasun-alorrean (2018)

BAT Soziolinguistika Aldizkaria 108, 2018 (3): 11-34

Nora Aranberri, Jose A. Pascual

Towards a Post-Editing Recommendation System for Spanish-Basque Machine Translation (2018)

Proceedings of the 21st Annual Conference of the European Association for Machine Translation (EAMT 2018), 28–30 May, Alacant, Spain, pp. 21-30.

Nora Aranberri, Jose A. Pascual

Ikasketa automatikoko tekniken erabilgarritasun-azterketa euskararako postediziorako gomendio-sistema eraikitzeko (2018)

EKAIA (2018). DOI: 10.1387/ekaia.19700

Noelia Migueles-Abraira, Rodrigo Agerri and Arantza Diaz de Ilarraza

Annotating Abstract Meaning Representations for Spanish (2018)

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. Editor: Nicoletta Calzolari (Conference chair) and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Koiti Hasida and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis and Takenobu Tokunaga. Publisher: European Language Resources Association (ELRA).

Igone Zabala

Euskararen terminologiaren garapena Terminologiaren Teoria Komunikatiboaren argitan (2018)

In Ruben Urizar eta Itizar Aduriz (ed.) Hizkuntzalari Euskaldunen III Topaketa. Zer berri? (argitara bidean)

Igone Zabala

The elaboration of Basque in academic and professional domains. (2018)

In Pia Lane eta Unn Røyneland (ed.) Linguistic Minorities in Europe Online. Kees de Bot, Lenore Grenoble: Mouton de Gruyter. (argitara bidean)

Igone Zabala, Jose Ramon Aiartza, Arturo Apraiz, Arantza Aranburu, Jose Maria Arizmendi, Naiara Arrizabalaga, Asier Eiguren, Julio Garcia, Irantzu Martinez, Juan Carlos Odriozola, Martin Olazar, Miren Josu Omaetxebarria, Maren Ortiz, Osane Oruetxebarria, Nerea Zabala, Olatz Zuloaga

Elaboración interdisciplinar de un libro de estilo y de una rúbrica para la evaluación continua y formativa de las competencias comunicativas en euskera en los grados de la Facultad de Ciencia y Tecnología de la UPV/EHU (2018)

XV Foro de la Evaluación de la Calidad de la Investigación y dela Educación Superior. Santander: (2018) (argitara bidean)

Igone Zabala, Jose Ramon Aiartza, Arturo Apraiz, Arantza Aranburu, Jose Maria Arizmendi, Naiara Arrizabalaga, Asier Eiguren, Julio Garcia, Irantzu Martinez, Juan Carlos Odriozola, Martin Olazar, Miren Josu Omaetxebarria, Maren Ortiz, Osane Oruetxebarria, Nerea Zabala, Olatz Zuloaga

Interdisciplinary training assessment of communication skills for students with Basque as instruction language in the Faculty of Science and Technology at UPV/EHU University (2018)

10th International Conference on Education and New Learning Technologies. Preoceedings. Palma de Mallorca: EDULERN18, 6216-6222.

Igone Zabala

Euskararen lexiko espezializatuaren garapenaz eta harmonizazioaz (2018)

In Gidor Bilbao, Pruden Gartzia eta Mari Karmen Garmendia (koord.), Bai, jauna, bai: fisika euskaraz! Jose Ramon Etxebarria irakaslearen omenez. Bilbo: UEU,

Iñaki Alegria, Nora Aranberri, Mikel Artetxe, Izaskun Etxeberria, Antton Gurrutxaga, Uxoa Iñurrieta, Gorka Labaka, Mikel Lersundi, Igor Leturia, Lluis Màrquez, Aingeru Mayor, Maite Oronoz, Kepa Sarasola, Ruben Urizar

TADEEP: Traducción automática en profundidad (2018)

JORNADAS DE SEGUIMIENTO 2018. Subdivisión de Programas Temáticos Científico Técnicos. Área de Ciencias y TIC. Ministerio de Energia, Industria y Competitividad. Madrid 2018-06-20

Josu Goikoetxea, Aitor Soroa eta Eneko Agirre

Knowledge-Based Systems (KNOSYS). Volume 150, 15 June 2018, Pages 218-230. ISSN: 0950-7051. DOI https://doi.org/10.1016/j.knosys.2018.03.017 Preprint at https://arxiv.org/pdf/1804.08316.pdf

Rebecka Weegar, Alicia Pérez, Hercules Dalianis, Koldo Gojenola, Arantza Casillas, Maite Oronoz

Ensembles for clinical entity extraction (2018)

Procesamiento del Lenguaje Natural

Itziar Gonzalez-Dios, María Jesús Aranzabe, Arantza Díaz de Ilarraza

Euskarazko testu idatzien konplexutasunaren azterketa eta sinplifikazio automatikorako proposamena (2018)
file2
(2018)

Ekaia 34 (ale berezia 2018), 209--223. Euskal Herriko Unibertsitateko Argitalpen Zerbitzua. ISSN: 0214-9001; e-ISSN: 2444-3255

Kepa Bengoetxea, Mikel Iruskieta

A Supervised Central Unit Detector for Spanish (2018)

Procesamiento del Lenguaje Natural 60: 29--36. ISSN 1135-5948. DOI 10.26342/2018-60-3

Begoña Altuna, María Jesús Aranzabe, Arantza Díaz de Ilarraza

An Event Factuality Annotation Proposal for Basque (2018)

In Andrew U. Frank, Christine Ivanovic, Francesco Mambrini, Marco Passarotti and Caroline Sporleder (eds.), Proceedings of the Second Workshop on Corpus-Based Research in the Humanities (CRH-2), 1, 15-24. Published by Gerastree Proceedings, GTP1. Dept. of Geoinformation, TU Wien, Austria. ISBN: 978-3-901716-43-0

Jon Alkorta, Koldo Gojenola, Mikel Iruskieta

SentiTegi: building a semantic oriented Basque lexicon (2018)

Proceedings of CICLing 2018

Antton Gurrutxaga, Iñaki Alegria, Xabier Artola

Caracterización computacional de la idiomaticidad: aplicación a la combinación nombre+verbo en euskera (2018)

Estudios de Lexicología y Lexicografía. Homenaje a Eloína Miyares Bermúdez (book chapter)

Hernaez, I., Navas, E., Municio Martín, J.A., Gomez Suárez, J.

RESTORE Project: REpair, STOrage and REhabilitation of speech

. Proc Iberspeech, 2018

Serrano, L., Tavarez, D., Sarasola, X., Raman, S., Saratxaga, I., Navas, E., Hernaez, I.

LSTM based voice conversion for laryngectomees

Proc Iberspeech, 2018

Odriozola, I., Hernaez, I., Navas, E., Serrano, L., Sanchez, J.

The observation likelihood of silence: analysis and prospects for VAD applications

Proc Iberspeech, 2018

Sarasola, X., Navas, E., Tavarez, D., Serrano, L., Saratxaga, I.

Speech and monophonic singing segmentation using pitch parameters

. Proceedings of IberSPEECH 2018, pp. 147 – 151, 2018.

Odriozola, I., Hernaez, I., Navas, E.

An on-line VAD based on Multi-Normalisation Scoring (MNS) of observation likelihoods

. Expert Systems with Applications, 2018; 110, 52 - 61 - 0957-4174