--- title: "NoDaLiDa 2023" subtitle: "22nd-24th May 2023. Tórshavn, Faroe Islands" author: "\\small Arianna Masciolini, Felix Morger, Ricardo Muñoz Sánchez" theme: "lucid" logo: "gu.png" date: "15/06/2023" --- ## The trip ![](img/smyril_horizontal.JPG) ## Old and new friends ![](img/birds.JPG) ## Tórshavn ![torshavn](TODO) ## The Nordic House ![nordic house](TODO) ## TODO: some nature ![nature](img/nature.jpg) ## Workshops - RESOURCEFUL-2023 (Dana and Felix) - NLP4CALL (David and Elena) - Constraint Grammar Workshop ## RESOURCEFUL-2023 - 8 talks, 10 posters - Keynotes: - Jörg Tiedemann. _Democratizing Machine Translation with OPUS and OPUS-MT_ - Darja Fišer. _The role of the CLARIN research infrastructure in the era of data-intensive language studies_ - Panel discussion ## NLP4CALL - 12th edition - emphasis on GED/GEC - shared task ![](img/shared_task.png) ## The conference - 23rd-24th May 2023 - Keynotes: - Georg Rehm. _Towards Digital Language Equality in Europe: An Overview of Recent Developments_ - Marta Costa-Jussà. _No-language-left-behind: Scaling Human-Centered Machine Translation and Toxicity at Scale_ - Invited talk: Hjalmar Petersen: _Aspects of the Structure of Faroese_ ## SBX at NoDaLiDa 2023 - \footnotesize __Felix Morger__. _Are There Any Limits to English-Swedish Language Transfer? A Fine-grained Analysis Using Natural Language Inference_ - __Elena Volodina__. Christopher Bryant, Andrew Caines, Orphée De Clercq, Jennifer-Carmen Frey, Elizaveta Ershova, Alexandr Rosen and Olga Vinogradova, _MultiGED-2023 shared task at NLP4CALL: Multilingual Grammatical Error Detection_ - __Elena Volodina__, __Yousuf Ali Mohammed__, __Aleksandrs Berdicevskis__, __Gerlof Bouma__ and Joey Öhman. _DaLAJ-GED - a dataset for Grammatical Error Detection tasks on Swedish_ - __Arianna Masciolini__. _A query engine for L1-L2 parallel dependency treebanks_ - __Niklas Zechner__. _Length Dependence of Vocabulary Richness_ - __Aleksandrs Berdicevskis__. Viktor Erbro, _You say tomato, I say the same: A large-scale study of linguistic accommodation in online communities_ - __Dimitrios Kokkinakis__, __Ricardo Muñoz Sánchez__ and Mia-Marie Hammarlin. _Scaling-up the Resources for a Freely Available Swedish VADER_ ## TODO: Ricardo ### Dyslexia paper 👀 Marina Björnsdóttir, Nora Hollenstein, Maria Barrett. _Dyslexia Prediction from Natural Reading of Danish Texts_ ### Turns out normal ASR does not work well with children 🧒👶 Agnes Luhtaru, Rauno Jaaska, Karl Kruusamäe, Mark Fishel. _Automatic Transcription for Estonian Children’s Speech_ ## Arianna's digest ### Evaluation of UD parsers on a specific domain Sara Stymme, Carin Östman and David Håkansson. _Parser Evaluation for Analyzing Swedish 19th-20th Century Literature_ ### A DaLAJ-like dataset David Samuel and Matias Jentoft. _NoCoLa: The Norwegian Corpus of Linguistic Acceptability_ ## Felix' impressions ### Benchmarks - _Standardized datasets: NorBench – A Benchmark for Norwegian Language Models_ (David Samuel, Andrey Kutuzov, Samia Touileb, Erik Velldal, Lilja Øvrelid, Egil Rønningstad, Elina Sigdel, Anna Palatkina) - _ScandEval: A Benchmark for Scandinavian Natural Language Processing_ (Dan Saattrup Nielsen) ### Translationese - _Machine vs. Human: Exploring Syntax and Lexicon in German Translations, with a Spotlight on Anglicisms_ (Anastassia Shaitarova, Anne Göhring, Martin Volk) ## Thank you! ![](img/bandanagang.jpg)