# Evaluation Results - Test Data **Date:** 07/07/2022 ## Notes <i class="fa fa-file" style="font-size:24px"></i> **[Qualitative report]( https://drive.google.com/drive/folders/1ka7MwnU31jILxuhzVysALzZkwdZQIE7d?usp=sharing)** ## 1. Data In this report, annotation systems are evaluated using single-annotated data from session 5 and single-annotated and cross-annotated from sessions 1 to 4. ![](https://i.imgur.com/wVOMza2.png) ## 2. Metrics ### 2.1 Tagging ![](https://i.imgur.com/16O4DBT.png) ### 2.2 Linking #### 2.2.1 Tag distribution ![](https://i.imgur.com/d7rUVfL.png) #### 2.2.2 Results ![](https://i.imgur.com/EEnCMed.png) ### 2.3 End-to-End ![](https://i.imgur.com/0OZoAkr.png) ## 3. PR curves ### 3.1 ELQ fine-tunning #### 3.1.1 Tagging ![](https://i.imgur.com/KKKz7Oq.png) #### 3.1.1 End-to-end ![](https://i.imgur.com/mXEJXHf.png) ### 3.2 ELQ Off-the-shelf #### 3.2.1 Tagging ![](https://i.imgur.com/vf9zM2O.png) #### 3.2.1 End-to-end ![](https://i.imgur.com/PVvmTne.png)