The main aim of NexusLinguarum is to promote synergies across Europe between linguists, computer scientists, terminologists, and other stakeholders in industry and society, in order to investigate and extend the area of linguistic data science.


We understand linguistic data science as a subfield of the emerging “data science”, which focuses on the systematic analysis and study of the structure and properties of data at a large scale, along with methods and techniques to extract new knowledge and insights from it. Linguistic data science is a specific case, which is concerned with providing a formal basis to the analysis, representation, integration and exploitation of language data (syntax, morphology, lexicon, etc.). In fact, the specificities of linguistic data are an aspect largely unexplored so far in a big data context.


The activities of the Action aim to foster collaboration and knowledge-sharing between the Action members, and include Short-Term Scientific Missions (STSMs), WG meetings, conferences and workshops, training schools, and other dissemination events.


The results of the Action include reports, publications, pointers to relevant systems and resources, as well as collaborations and bridges to related initiatives.


NexusLinguarum is composed of five working groups (WGs 1-5), interoperating and providing mutual feedback between themselves. The core group is responsible for the coordination and management of the whole network, and for the dissemination of its results.


The results of the Action include reports, publications, pointers to relevant systems and resources, as well as collaborations and bridges to related initiatives.

54 documents
  • Armaselu, Florentina, Apostol, Elena-Simona, Khan, Anas Fahad, Liebeskind, Chaya, McGillivray, Barbara, Truică, Ciprian-Octavian, Utka, Andrius, Valūnaitė Oleškevičienė, Giedrė, van Erp, Marieke. (September, 2022). LL(O)D and NLP perspectives on semantic change for humanities research. Zenodo.
  • Chiarcos, Christian, Sérasset, Gilles. (June, 2022). A Cheap and Dirty Cross-Lingual Linking Service in the Cloud. Zenodo.
  • Rosner, Michael, Ahmadi, Sina, Apostol, Elena-Simona, Bosque-Gil, Julia, Chiarcos, Christian, Dojchinovski, Milan, Gkirtzou, Katerina, Gracia, Jorge, Gromann, Dagmar, Liebeskind, Chaya, Oleškevičienė, Giedrė Valūnaitė, Sérasset, gilles, Truicȃ, Ciprian-Octavian. (June, 2022). Cross-Lingual Link Discovery for Under-Resourced Languages. Zenodo.
  • Bajčetić, Lenka, Declerck, Thierry. (July, 2022). UsingWiktionary to Create Specialized Lexical Resources and Datasets. Zenodo.
  • Declerck, Thierry. (July, 2022). Integration of sign language lexical data in the OntoLex-Lemon framework. Zenodo.
  • Seung-Bin Yim, Lenka Bajčetić, Thierry Declerck, John P. McCrae. (July, 2022). EDIE - Elexis DIctionary Evaluation Tool. Zenodo.
  • Gollam Rabby, Farhana Keya, Vojtēc Svátek, Renzo Arturo Alva Principe. (July, 2022). Effect of heuristic post-processing on knowledge graph profile patterns: cross-domain study. Zenodo.
  • Blerina Spahiu, Renzo Arturo Alva Principe, Andrea Maurino. (July, 2022). Profiling Linguistic Knowledge Graphs. Zenodo.
  • Rackevičienė, Sigita, Utka, Andrius, Bielinskeinė, Agnė, Rokas, Aivaras. (April, 2022). Distribution of Terms across Genres in the Annotated Lithuanian Cybersecurity Corpus. Zenodo.
  • Hugo Gonçalo Oliveira. (June, 2022). Exploring Transformers for Ranking Portuguese Semantic Relations. Zenodo.