research projects

Openminted: Sharing IXA pipes in the OpenMinTeD platform.


Openminted: Sharing IXA pipes in the OpenMinTeD platform.

(2018 - 2018)

The aim of this project is the integration of IXA pipes (http://ixa2.si.ehu.es/ixa-pipes/), a set of ready to use Natural Language Processing (NLP) tools within the OpenMinTeD platform (http://openminted.eu) . The aim of IXA pipes is to provide a modular set of ready to use Natural Language Processing (NLP) tools. Apart from being easy to train and deploy, they are also a good fit for our aim of providing NLP tools for many languages because every module but the tokenizer is machine learning based. In fact, IXA pipes tries to use the same approach across NLP tasks in order to create robust processors both across domains and languages. This strategy has proven to be very successful for several tasks and languages, such as NER and Opinion Target Extraction (OTE), both in out-of-domain and in-domain evaluations. In the project, we will integrate IXA pipes into OpenMinTed, an open platform that will be a gateway to many types of language data, including tagsets, ontologies, publications and corpora. The platform will also offer services and functionalities that are useful for text and data mining, and allow miners to share their tools and build their own workflows. To this end, the IXA pipes modules will be shared as Docker images as well as previous works in doing similar integrating (e.g., the case of Alvis NLP modules).



Webpage:
Organization:  H2020 OPENMINTED
Main researcher: Rodrigo Agerri
Participants
Rodrigo Agerri, German Rigau , Aitor Soroa


Back

HiTZ is made up of the following research groups: