---
title: "NoDaLiDa 2023"
subtitle: "22nd-24th May 2023. Tórshavn, Faroe Islands"
author: "\\small Arianna Masciolini, Felix Morger, Ricardo Muñoz Sánchez"
theme: "lucid"
logo: "gu.png"
date: "15/06/2023"
---
## The trip

## Old and new friends

## Tórshavn

## The Nordic House

## TODO: some nature

## Workshops
- RESOURCEFUL-2023 (Dana and Felix)
- NLP4CALL (David and Elena)
- Constraint Grammar Workshop
## RESOURCEFUL-2023
- 8 talks, 10 posters
- Keynotes:
- Jörg Tiedemann. _Democratizing Machine Translation with OPUS and OPUS-MT_
- Darja Fišer. _The role of the CLARIN research infrastructure in the era of data-intensive language studies_
- Panel discussion
## NLP4CALL
- 12th edition
- emphasis on GED/GEC
- shared task

## The conference
- 23rd-24th May 2023
- Keynotes:
- Georg Rehm. _Towards Digital Language Equality in Europe: An Overview of Recent Developments_
- Marta Costa-Jussà. _No-language-left-behind: Scaling Human-Centered Machine Translation and Toxicity at Scale_
- Invited talk: Hjalmar Petersen: _Aspects of the Structure of Faroese_
## SBX at NoDaLiDa 2023
- \footnotesize __Felix Morger__. _Are There Any Limits to English-Swedish Language Transfer? A Fine-grained Analysis Using Natural Language Inference_
- __Elena Volodina__. Christopher Bryant, Andrew Caines, Orphée De Clercq, Jennifer-Carmen Frey, Elizaveta Ershova, Alexandr Rosen and Olga Vinogradova, _MultiGED-2023 shared task at NLP4CALL: Multilingual Grammatical Error Detection_
- __Elena Volodina__, __Yousuf Ali Mohammed__, __Aleksandrs Berdicevskis__, __Gerlof Bouma__ and Joey Öhman. _DaLAJ-GED - a dataset for Grammatical Error Detection tasks on Swedish_
- __Arianna Masciolini__. _A query engine for L1-L2 parallel dependency treebanks_
- __Niklas Zechner__. _Length Dependence of Vocabulary Richness_
- __Aleksandrs Berdicevskis__. Viktor Erbro, _You say tomato, I say the same: A large-scale study of linguistic accommodation in online communities_
- __Dimitrios Kokkinakis__, __Ricardo Muñoz Sánchez__ and Mia-Marie Hammarlin. _Scaling-up the Resources for a Freely Available Swedish VADER_
## TODO: Ricardo
### Dyslexia paper 👀
Marina Björnsdóttir, Nora Hollenstein, Maria Barrett.
_Dyslexia Prediction from Natural Reading of Danish Texts_
### Turns out normal ASR does not work well with children 🧒👶
Agnes Luhtaru, Rauno Jaaska, Karl Kruusamäe, Mark Fishel.
_Automatic Transcription for Estonian Children’s Speech_
## Arianna's digest
### Evaluation of UD parsers on a specific domain
Sara Stymme, Carin Östman and David Håkansson. _Parser Evaluation for Analyzing Swedish 19th-20th Century Literature_
### A DaLAJ-like dataset
David Samuel and Matias Jentoft. _NoCoLa: The Norwegian Corpus of Linguistic Acceptability_
## Felix' impressions
### Benchmarks
- _Standardized datasets: NorBench – A Benchmark for Norwegian Language Models_ (David Samuel, Andrey Kutuzov, Samia Touileb, Erik Velldal, Lilja Øvrelid, Egil Rønningstad, Elina Sigdel, Anna Palatkina)
- _ScandEval: A Benchmark for Scandinavian Natural Language Processing_
(Dan Saattrup Nielsen)
### Translationese
- _Machine vs. Human: Exploring Syntax and Lexicon in German
Translations, with a Spotlight on Anglicisms_ (Anastassia Shaitarova, Anne Göhring, Martin Volk)
## Thank you!
