# TU/e master project pitches
## Pitches for this cycle
1. Improving knowledge graph completeness with schemas: Wikidata and ShEx
In the collaboratively built knowledge base [Wikidata](https://www.wikidata.org/wiki/Wikidata:Main_Page) some editors would appreciate suggestions of how to improve the completeness of items. Currently some community members use an existing tool, [Recoin](https://www.wikidata.org/wiki/Wikidata:Recoin), described in [this paper](https://doi.org/10.1145/3184558.3191641), to get suggestions of relevant properties to use to contribute additional statements. This process could potentially be greatly improved by leveraging the knowledged captured in [prexisting schemas](https://book.validatingrdf.com/index.html). We propose to study and develop extensions to Recoin that would generate relevant properties based on a ShEx schema from [Wikidata's schema namespace](https://www.wikidata.org/wiki/Wikidata:Database_reports/EntitySchema_directory).
This work will help improve the quality and completeness of Wikidata, as a global open knowledge resource for humanity.
Futher reading:
* Vevake Balaraman, Simon Razniewski, Werner Nutt. Recoin: Relative Completeness in Wikidata. WWW (Companion Volume) 2018: 1787-1792.
* Michael Luggen, Julien Audiffren, Djellel Eddine Difallah, Philippe Cudré-Mauroux. Wiki2Prop: A Multimodal Approach for Predicting Wikidata Properties from Wikipedia. WWW 2021: 2357-2366.
2. Explaining schema conformance for knowledge graphs: conformance reporting for WikiProjects members
[Wikidata](https://www.wikidata.org/wiki/Wikidata:Main_Page) is an open collaboratively built knowledge base. In the Wikidata community groups of editors who share interest in specific topics form [WikiProjects](https://www.wikidata.org/wiki/Wikidata:WikiProjects). As part of their regular work, members of WikiProjects would like to regularly test the conformance of entity data in Wikidata against [schemas](https://book.validatingrdf.com/index.html) for [entity classes](https://www.wikidata.org/wiki/Wikidata:Database_reports/EntitySchema_directory). We propose to study schema conformance checking for WikiProjects and develop a tool that would generate weekly conformance reports in tabular form for schemas of interest to WikiProjects. These tables would report overview statistics on the subset of entities covered by the schema and detailed information on items that are not in conformance. For non-conformant data the table would provide understandable rationale text for why data is not in conformance. Time permitting, we would also consider the evolution of schema conformance over time within projects as data and/or schemas are updated, and support for inspecting and explaining the temporal dynamics of conformance.
This work will help improve the quality and completeness of Wikidata, as a global open knowledge resource for humanity.
Wikidata logo for website:
https://commons.wikimedia.org/wiki/File:Wikidata-logo-en.svg
## Pitches for another time
3. Conformance Reports for External Data Providers
Data providers external to the Wikidata community would like to know how data from their organization is changing after contribution to Wikidata. We propose a ShEx schema-driven conformance report that provides an overview of data covered by the schema. The report will detail items that have not changed as well as items that have changed and provide information on how they have changed
Note: This could potentially overlap work on the Mismatch Finder