--- tags: liber-dslib --- # Important documents - [LIBER DSlib book of notes](https://hackmd.io/@nehamoopen/liber-dslib) - [DSLib WG: 2022 Planning](https://hackmd.io/@nehamoopen/Skd-K-yXq) - [Zotero library](https://www.zotero.org/groups/4344603/liber_dslib/library) - [Literature Review Working Document](https://docs.google.com/document/d/1cKBWovMKuZQ1SmmTKXE_UrXlSMafhDBl6QfvIEN9krA/edit#heading=h.h9lvnjxod372) --- # WG Meeting Notes ## 2026-02-04 WG Meeting #37 ### Agenda * Membership change * Reflection on the LIBER Annual Conference in Lausanne and the upcoming event in Norway. * Updates on the LIBER AI Task Force (roadmap and landscape study). * Contributions from the WG on Data Science to the AI Task Force. * Suggestions for WG activities. * Any interesting events or publications to share ### Participants * Athanasia Salamoura * Peter Kiraly * Arben Hajra * Simone Cocchi * Jez Cope * Birgit Schmidt ### Membership change - Angela Vorndran (DNB, Germany) left the group. - David Tréfás, the new liaison person between the Exec Board and the working groups - The DSLib web page needs an update to reflect the current list of members. ### LIBER Annual Conferences Announcement of Digital Scholarship & Data Science Topic Guides https://libereurope.github.io/ds-topic-guides/ our workshop: Uniting Networks: LIBER and RDA Europe Advancing Research Together https://www.rd-alliance.org/news/uniting-networks-liber-and-rda-europe-advancing-research-together/ The questions we discussed at the Knowledge Café part: - What are your current data management tools and practices? Do you have a wish list? - How do you manage skill development in your library? Do you organise courses? Do you have funds to support librarians to participate in courses elsewhere? Do you support self-education? - What are the current or planned collaborations/activities taking place between research data and traditional library holdings? What types of barriers do you see here? - How do you support practices following FAIR and CARE principles? Do you monitor the level of support? How do you respond when there are improvements needed? ### LIBER AI Task Force • Several areas are in focus, such as: copyright and law, ethics; security and infrastructures; networking within and beyond LIBER; AI Literacy and Applications. • DSLib contributions to the AI Task Force: - [Area: AI Literacy] - Sharing skills, knowledge and experiences, highlighting the challenging practices, developing resources and materials that can be made openly available, to facilitate good support for AI use, etc. - [Area: Application] - Metadata Enrichment Services - Organizing workshops/webinars • Draft position paper about the European Data Union strategy and TDM and question from EARE In the draft, you will find highlighted in yellow, sections where you to share real examples of the importance of TDM use in public-private partnerships, and **concrete examples of how licensing schemes have blocked TDM** exceptions in your daily work or the work of your members. Please share your feedback by returning the document with tracked changes or by sending bullet points via email by 10 February. [Draft position paper](https://share.zbw.eu/s/8LrLkyDaEDLx3AM) ### WG activities Suggestions for WG activities /Workplan: - Internal: webinars, presentations, or inviting external guests. - what activities are other groups involved in? - collaborations with other groups - External: contributing to LIBER events with workshops, presentations, etc. - Institutions can provide a summary of case studies (data science, AI, LLMs, etc.) Ex. Research Data Management Case Studies https://libereurope.eu/research-data-management-case-studies/ ### Interesting events or publications - Péter Király, on the published book “Library Catalogues as Data” [[Library Catalogues as Data](https://www.routledge.com/Library-Catalogues-as-Data-Research-Practice-and-Usage/Gooding-Terras-Ames/p/book/9781783306589)] Includes 10 essays – from different perspectives, theoretical and practical. --- ## 2025-XX-YY WG Meeting #36 ### Agenda ### Participants ### Updates on DS activities ### Updates on work areas ### AOB ## 2025-05-07 WG Meeting #35 ### Agenda * Update on data science activities * Organisation of webinars * LIBER AI Task force ### Participants * Peter Verhaar * Péter Kiraly * Annika Lindh * Simone Cocchi * Asimina Vlachaki * Arben Hajra * Helen Ezeokoli * Birgit Schmidt ### Update on data science activities - Annika - Use of OpenAlex ORCiD, Crossref APIs - Arben – working on similar issues related to author identification and citation data harvesting. The process involves identifying authors using ORCID and also leveraging Wikidata as a linking hub to locate PIDs. OpenAlex is used for author clustering, though some issues with accurate identification remain. Additionally, exploring agentic AI and RAG applications. Current efforts include RAG implementations based on catalog data and AI-powered chatbots for frequently asked questions (FAQs). (using Mistral, Ollama, Qdrand, ...) - Péter - book under development (ToC added below), library catalogues as data, editors (M Terras,...), how to use library cats in the context of DH - Birgit - R&D department recently reported that they are developing a ML/genAI based subject classification tool for subject librarians invited to event, AI realities for libraries panel, https://www.charleston-hub.com/the-charleston-conference/welcome/charleston-in-between/ - someone interested to take over? - Peter ERC project, reusing library data summer school, learning materials ### Webinars Topics, speaker ideas? - Open data sharing, Creative Commons, copyright - AI Workshop of last year's LIBER conference, https://cdsleiden.github.io/exploring-ai/intro.html - Library catalogues as data how to extract raw data, normalize, analyze demos - Peter, Péter Leo Lahti, U Turku ### AI task force meeting on 14 May 2025, coordinated by Karin ambition to develop a strategic agenda for AI, to be integrated in the next LIBER strategy WGs chairs are invited, how the WGs engage with AI 3 most important AI-related topics, goals and activities over the next 12 months - webinar on AI basics and hands-on - survey on the use of DS in libraries ### Study / paper writing in between the WG meetings writing sprint in 2 weeks, Wednesdays at the same time Peter will send a calendar invite ### Events * https://2025.computational-humanities-research.org * https://www.charleston-hub.com/the-charleston-conference/welcome/charleston-in-between/ ### Literature - Library Catalogues as Data. Research, Practice and Usage ToC: Introduction: The Library Catalogue Data Ecosystem Making the Conceptual Concrete: Defining, Describing and Visualising Collective Collections Effects of Open Science and the Digital Transformation on the Bibliographical Data Landscape Data Quality in Library Catalogues and Its Impact on Access, Analysis and Reuse Data Bias and the Natural Language Processing of Metadata ‘Contains Scenes of Mild Peril’: Illuminating the Catalogues of Dark Archives Book Formats, Printing Practices and Reading Habits in Early Modern Europe ‘[S]hut Not Thy Heart, nor Thy Library’: Realising the Potential of Historical Library Borrowing Data ChatGPT for Bibliometrics: Potential Applications and Limitations Using Generative AI to Turn 19th-Century Library Catalogues into Data: Applications and Limitations A Corpus Linguistic Analysis of Catalogue Data: Understanding Curatorial Practice Over Time - Walder, A. (2025). Künstliche Intelligenz in der Literaturrecherche. Bibliothek Forschung und Praxis, 49(1), 109–120. https://doi.org/10.1515/bfp-2024-0063 - Corey Davis (2025) Unlocking Web Archives: LLMs, RAG, and the Future of Digital Preservation. https://web.uvic.ca/~coreyd/LLM_RAG.pdf - 2025 Library Systems Report https://americanlibrariesmagazine.org/2025/05/01/2025-library-systems-report/ - Transforming Metadata: Getting ready for AI https://www.oclc.org/research/presentations/2024/ALA-2024-transforming-metadata-ready-for-ai.html - Sarah Oberbichler, Cindarella Petz: Working Paper: Implementing Generative AI in the Historical Studies (2025) https://zenodo.org/records/14924737 Two conferences: - SemDH 2025 – Second International Workshop of Semantic Digital Humanities https://semdh.github.io/ - DH 2025 Programme https://dh2025.adho.org/browse-the-program-agenda/ ## 2025-04-02 WG Meeting #34 ### Agenda * DS Topic Guides: Guest presentation by Nora McGregor (British Library) * Report on the landscape study * LIBER 2025 Conference: Workshops and papers * Setting the objectives for 2025 ### participants * Nora McGregor (guest) * Peter Verhaar * Annika Lindh * Arben Hajra * Helen Ezeokoli * Jez Cope * Simone Cocchi * Péter Király ### DS Topic Guides: Guest presentation by Nora McGregor (British Library) https://libereurope.github.io/ds-topic-guides/ Source at Github: https://github.com/libereurope/ds-topic-guides Issue queue: https://github.com/libereurope/ds-topic-guides/issues ### LIBER 2025 Conference: Workshops and papers Accepted submissions: Presentation: DS Topic Guides (with Digital Scholarship WG) Workshop: Uniting Networks: LIBER and RDA Europe Advancing Research Together https://docs.google.com/document/d/1WmmrsopOINTDhfeFCXHxik64EfFi4q3d9e6paidRh6s/edit?tab=t.0 ### Setting the objectives for 2025 LIBER Strategy 2023-2027 https://libereurope.eu/strategy/ our overview report: Landscape Study “Data Science in Academic Libraries https://docs.google.com/document/d/1_ZlIO2hS7tLSpa1nL-AhDT952miON495T32t0vENDQo/edit?tab=t.0#heading=h.rxm46to2dpwz Where should we publish it? LIBER Quarterly would be a natural choice. Annika: start the meetings with a round of the participants. One sentence or two about current activities as an ice breaker to start discussion. Jez: data science in Humanities network formed during the pandemic in connection with software ?? initiative: Data science for Galleries, Libraries, Archives & Museums (GLAM) https://glamdatasci.network/ Peter: there is a similar Dutch initiative. Organising two workshop days for aplication DS in libraries. Python, linked open data, etc. Annika: discussion on what are your challenges In the next meeting we can discuss what topic we should work on ### sharable links AI’s role in the future of library services (Clarivate, 2025) https://discover.clarivate.com/Generative_AI_and_the_future_of_library_services Marshall Breeding: Harnessing the power of AI in academic libraries (the summary of the Clarivate report) https://librarytechnology.org/pr/31209 Generative AI for library and information professionals (IFLA AI SIG, 2024) https://www.ifla.org/g/ai/generative-ai/ Peter E. Murray: Generative AI in Libraries (Thursday Threads, Issue 109, February 27, 2025) https://dltj.org/article/issue-109-llm-library/ data publications: Kruusmaa, K., Tinits, P., & Nemvalts, L. (2025). Curated Bibliographic Data: The Case of the Estonian National Bibliography. Journal of Open Humanities Data, 11: 16, pp. 1–15. DOI: https://doi.org/10.5334/johd.280 Et Kgl. Bibliotek bibliografisk datasæt med bl.a. danske monografier fra ca. 1600-1900 https://loar.kb.dk/handle/1902/49106 ## 2025-01-15 WG Meeting #33 ### Participants apologies: Jez Cope, Kiera McNeice - Péter Király ### Agenda - topic related news - proposals for the next LIBER conference (Lausanne, 2-4 July, https://liberconference.eu/) - writing sprint for the landscape study “Data Science in Academic Libraries” (https://docs.google.com/document/d/1_ZlIO2hS7tLSpa1nL-AhDT952miON495T32t0vENDQo/edit?tab=t.0) ### topic related news - Fantastic Futures 2024 conference videos are available: https://www.nfsa.gov.au/fantastic-futures-conference-canberra-2024 ## 2024-12-04 WG Meeting #32 ### Participants - Péter Kiraly - Annika Lindh - Arben Hajra - Asimina - Simone Cocchi - Yusuf Ozkan - Birgit Schmidt ### Agenda - working on the paper ### paper / study - [draft document](https://docs.google.com/document/d/1_ZlIO2hS7tLSpa1nL-AhDT952miON495T32t0vENDQo/edit?tab=t.0#heading=h.rxm46to2dpwz) - suggestions: create a folder for the paper - survey - presented at the LIBER2023 conference: - notes: https://bit.ly/dslib-2023-notes - slides: https://bit.ly/dslib-2023 - a dedicated Zotero library ("Landscape Study" inside the group library) to support the paper: https://www.zotero.org/groups/4344603/liber_dslib/collections/TK59IED2 - LIBER Quartely uses APA-7 citation style (see https://liberquarterly.eu/author-guidelines) - use British English ### AOB ## 2024-11-06: WG Meeting #31 ### Participants - Annika Lindh - Arben Hajra - Asimina Vlachaki - Bernat Montaña Suñé - Helen Ezeokoli - Peter Verhaar - Simone Cocchi (UniMoRe) - Birgit Schmidt - Péter Király ### Agenda - membership changes - Do you have any relevant news to share (papers, conferences, events, other activities)? - DS Essentials Topic Guide Writing Sprint: status report (the next sprint will be on next Tuesday, 5th on November. Registration: https://forms.office.com/e/iw7FVA0L5c, More details: https://libereurope.github.io/ds-essentials/contributing.html) - main topic: brainstorming about publishing the results of our (and other activities) ### membership change - Bernat is final year LIS student from Barcelona - Helen is the new liason person for LIBER ### news - Peter Verhaar: Leiden symposium on AI and academic publishing (https://www.library.universiteitleiden.nl/news/2024/09/leiden-university--elsevier-symposium-on-ai-and-academic-publishing) - Annika Lindh: Focus group for COUNTER on generative AI - Association for Computers and the Humanities: new working group: DH in Libraries - Call for participants in AI4LAM Focus Group. University of Michigan School of Information Student Group. They will be gathering feedback from AI4LAM community members about the short and long-term goals of the organization. https://urldefense.com/v3/__https://forms.gle/QTETa7HxdtJPSwVLA__;!!EDx7F7x-0XSOB8YS_BQ!fi5Z0vcNXs5taLWlLCW1nJ66D9-SG2v_ZbabW4iUQwhyvSNxkU_1k8FxFIlFum4DSnHixaROCRK2-Zd9$ - Birgit: guild on AI in universities. Topics include: how to handle master thesis. Is there anyone you could suggest as a speaker in AI in LAM thematic? ### brainstorming about publishing the results of our survey https://docs.google.com/document/d/1_ZlIO2hS7tLSpa1nL-AhDT952miON495T32t0vENDQo/edit?tab=t.0#heading=h.rxm46to2dpwz Peter: to give a state-of-the-art to LIBER community. The document is a draft created about a year ago. outline: Introduction, terminology, potential applications of DS (categorised), challenges, list of recommendation. We should collect all texts/materials such as: - documents like this - submitted proposals - presentations - workshop materials Potential target: LIBER Quarterly (https://www.liberquarterly.eu/) We can set deadline for beginning of February would be contributors: - Peter - Arben - Annika - Asimina - Péter Next steps: - send around an email asking who would like to contribute - in two weeks: those who decided to contribute select a section - in the December meeting: writing sprint ## 2024-09-04: WG Meeting #30 Participants: - Camilla Lindelöw - Peter Verhaar - Arben Hajra - Angela Vorndran - Niamh Malin - Simone Cocchi - Péter Király ### Agenda - Discussion of LIBER Summer Conference in Limassol - LIBER DS Essentials Topic Guides (aka learning hub) - The current status of the paper review process - Planning for second half of 2024: Brainstorm about the priorities for the working group. What are you expecting from these meetings, what would be your priorities? - Proposal for LIBER Winter Event (Maribor, Slovenia on the 26-27 November, https://libereurope.eu/event/liber-winter-event-2024/). Anyone is planning to go? - Suggestions for paper to discuss next time? - Would you like to give a presentation about your own data science related activities? Who would you like to see are guest presenter? ### LIBER Summer Conference in Limassol Peter: beginning of July. Many sessions on AI in libraries. There are recordings at the LIBER YouTube channel (https://www.youtube.com/playlist?list=PLHA3lUmrYM3s4T9REFdAZtibsQkBAO0Y6). The closing session was very interesting. Our workshop was a hands-on, practical introduction to AI. It was fully booked (40) participants. Slides (https://zenodo.org/records/13682212) and Jupyter notebooks (https://cdsleiden.github.io/exploring-ai/) are available. The discussion on the first NB went well, but the second one on LLM and RAG was longer, it took quite a long time to run. But the workshop went well, there were lots of good feedback. The slides gives a general structure of the workshop and shows the most important concepts. Some keywords: vectorisation, deep learning, the origin of bias, object detection, LLM, generative AI, RAG. A Finnish library director would like to setup a workshop for her institution. Conclusion: maybe a workshop should cover a bit less concepts, and the LLM/RAG section should run quicker. Nora McGregor and other organisers were in the room, and helped participants if they had questions. ### LIBER DS Essentials Topic Guides https://libereurope.github.io/ds-essentials/ ### The current status of the paper review process reviews: https://drive.google.com/drive/folders/1jWlqn0DwVPOrSfkVIsb4ns34HpUpSDm0 Zotero library: https://www.zotero.org/groups/4344603/liber_dslib ### Planning for second half of 2024 Camilla: we should approach the Swedish Nat. Lib's data science laboratory. Kris Hafenden (Ü) is their supervisor. Camilla with contact them. Niahm: there is a new Jisc analytics (?) group in UK, that have a similar profile. Camilla: group for everybody including beginners or for experts? Niahm: overview of data collections for librarian. We can use the DS Essentials and write a blog pont on LIBER site. Peter: experiment about Open Data. The impact of Open Science. OpenAlex, Crossref API. Pitfalls, and advantages. Camilla: The new Leiden index is based on OpenAlex. The bibliometrics community also have lots of common intersection with data science. A master student researches how AI could help libraries e.g. in metadata enhancement. Librarians/students needs actual examples of usage of AI. Arben: extract keywords/subject, interfaces in search, RAG applications. AI services Dashboard (short demonstration) ### Suggestions for paper to discuss next time (Péter's pick: Rebecca Sutton Koeser, and Zoe LeBlanc. "Missing Data, Speculative Reading". Cultural Analytics Vol. 9, Issue 2, 2024 May 29. https://doi.org/10.22148/001c.116926) ### presentation plans ## 2024-06-05: WG Meeting #29 Participants: - Peter Kiraly, GWDG - Jez Cope, The British Library - Niamh Malin, University of Cambridge - Kiera McNeice, Cambridge University Press - Birgit Schmidt, SUB Göttingen - Simone Cocchi, XX - Barbara van der Vaart, LIBER ### News OCLC-LIBER “Building for the future” programme - Closing Plenary Session (Thursday 6 June 2024 at 15:00) https://connect.oclc.org/en/oclc-liber-building-for-the-future-series-2023 Thinking About Culture (not) as Data: a ‘Code & Culture’ talk by Lev Manovich https://www.youtube.com/watch?v=uiUznEEZHzM Robert Sanderson: Linked Data Enlightenment: Lessons Learned from LUX event: https://pro.europeana.eu/event/linked-data-enlightenment-lessons-learned-from-lux recording: https://www.youtube.com/watch?v=DEtfWZPJUBg Text Analysis Pedagogy Institute summer courses https://constellate.org/tap-institute?source=list LIBER 2024: Pre-Conference Workshop 2 'Exploring AI Hands-On: Shaping the Future of Research Library Services' preparation meeting https://doodle.com/meeting/participate/id/b21wk31a Digital Scholarship & Data Science Essentials for Library Professionals - Collections as Data: Getting Started https://libereurope.github.io/ds-essentials/collectionsasdata.html Building a simple Retrieval-Augmented Generation (RAG) pipeline https://colab.research.google.com/drive/1wJyQQ4BgQohJFTObU7qHI1HDTmSWlaVL?usp=sharing What Is Metadata? A Discussion with Cyril Heude https://open.spotify.com/episode/748All8hGu4Dg7Wl9tdxOD?si=SEQMVa_USEOkhkUeNN511Q #### Calls for papers - Computational Humanities Research CfP Aarhus (Denmark), 2024-12-04/06 http://2024.computational-humanities-research.org/cfp/ - Call for Papers: New special collection of the Journal of Open Humanities Data “Amplifying GLAM Collections: Scalable and Inclusive Data Practices” https://openhumanitiesdata.metajnl.com/announcements#call-for-papers-amplifying-glam-collections-scalable-and-inclusive-data-practices — [name=Jez Cope] - deadline for abstract is 1st of August Birgit: - The AI literacy working group is in formation phase. - Report by Royal Society just published on AI application in hard sciences, https://royalsociety.org/news-resources/projects/science-in-the-age-of-ai/ LIBER2024: Kiera, Peter Verhaar, Next meeting: second half/end of August ## 2024-05-08: WG Meeting #28 Participants: - Peter Verhaar - Jez Cope - Annika Lindh - Arben Hajra - Birgit Schmidt - Kiera - Péter Register: LIBER DS Essentials Topic Guide Writing Sprint (Online) Tuesday 14 May 2024, 9:00-11:00 (BST)/10:00-12:00 (CET) via MS Teams https://forms.office.com/Pages/ResponsePage.aspx?id=t0ykIcP5AE-a-r0ejoi82X7ZIEJh0odPq9ej1FnpQEJUNTlTTTQ3WjBWU0gxOTdaNk1WSUpVV1FPRS4u The book itself: https://libereurope.github.io/ds-essentials/contributing.html agenda: https://docs.google.com/document/d/1XEzP6LAHQMdUiw0aqSie4e2Ae8WdvN6Fogx-NI6D_vg/edit#heading=h.1sivh53y3qq0 - Welcome and Introduction (10 minutes) - Explanation of the Process (15 minutes) - Topic Selection (15 minutes) - Writing Phase (60 minutes) - Wrapping Up (15 minutes) - Closing Remarks (5 minutes) submission checklist: https://docs.google.com/document/d/1cmiDeTbR16hJPRypcRqnpoR3Rv2_JhKVZaFHZs-K2fA/edit#heading=h.y2qyj0hjzf9h Annika: the sessions were well prepared. There are lots of interest, but most people are at the beginning phase, but suprised to hear that some people are using, and there were some in house developments. More on data science side. Birgit: AI literacy WG is in infant stage. There will be a call for members. They are shaping the goals. Probably will launch in autumn. EU commission website: [guide for AI in research](https://ec.europa.eu/info/funding-tenders/opportunities/portal/screen/support/news/30086). Jez: it might be a problem that using [ChatGPT] is so easy Annika: there is a responsibility on the reviewers' part - sometimes they reject text because it was written by non native speaker. ChatGPT helps to improve the language. Peter: I started on Jupyter notebooks for the LIBER conference (will be available on Google Collab: https://colab.research.google.com/drive/1mwncu3AimoHboIE0J522XcomXcKUtWJL#scrollTo=PRZUbbb-yvor). The overview for LLMs, using the HuggingFace library/model repository. E.g. summary generation, sentiment analysis, named entity recognition. Arben: use LLM to convert questions into keywords, then using it in digital library related tasks. LLama 3 from Meta works very well in this context. The templates are configurable. We also use LangChain (https://www.langchain.com/). The evaluation part is very complicated, because you always get different results. The metadata experts created some test case scenarios (including questions, keywords, set of results etc.) Annika: certain types of biases (e.g. gender biases) happened in the past when working with transformative models. Even if you use Google Translate. Other biases: region, language, disciplines, etc. ## 2024-04-03: WG Meeting #27 Agenda 1. membership change: Neha Moopen (founder and co-chair) left the group 2. relevant news to share 3. upcoming event: OCLC-LIBER session on AI and Machine Learning (Wednesday 17 April) 4. results of the submitted proposals for the LIBER conference 5. learning hub: writing sprint and next steps 6. literature review: next steps Participants - Birgit Schmidt - Arben Hajra - Yusuf Ozkan - Simone Cocchi - Camilla Lindelöw - Rosie Alison - Annika Lindh - Asimina Vlachaki - Péter Király ### Membership change - Neha Moopen (funder and co-chair) left the group - the LIBER Office Liaison Rosie Allison will have a new position at Research Data Alliance, we will have a new contact person in the future ### News - Survey: integrating volunteer and AI-enriched metadata into collections systems (https://collectivewisdomproject.org.uk/survey-integrating-volunteer-and-ai-enriched-metadata-into-collections-systems/) - by Mia Ridge, previous member of the WG - Helsinki Digital Humanities Research Seminar – Spring 2024 Schedule (https://www.helsinki.fi/en/digital-humanities/teaching/digital-humanities-research-seminar) - CfP: Computational Humanities Research, December 4-6, 2024 at Aarhus University, Denmark (http://2024.computational-humanities-research.org/) - CfP: 28th International Conference on Theory and Practice of Digital Libraries, 24-27 September 2024, Ljubljana, Slovenia (https://tpdl2024.nuk.si/) - AI in Libraries Network Group - The Conference of European National Librarians (https://www.cenl.org/networkgroups/ai-in-libraries-network-group/) - AI for humanists – tutorials (https://www.aiforhumanists.com/tutorials/) - Knowledge Exchange report released: Barker, M., & Chue Hong, N. (2024). Approaches to scaling up reproducibility in research organisations. Zenodo. https://doi.org/10.5281/zenodo.10663903 - New research project in Sweden, AI as a risk and opportunity for the authenticity of archives (https://wasp-hs.org/project/artificial-intelligence-as-a-risk-and-opportunity-for-the-authenticity-of-archives/), with PhD student Johannes Widegren focussing specifically on Sámi people (https://lnu.se/en/staff/johannes.widegren/) Q that came up: is there anybody with experience on using LLMs for extracting metadata? If you have, please contact Camilla Lindelöw. ### OCLC-LIBER session on AI and Machine Learning 17 April 2024, Peter and others facilitate the discussion Registration is closed but send a message to Rosie (rosie.allison@libereurope.org) if you would like to join (waiting list) https://libereurope.eu/event/oclc-liber-building-for-the-future-facilitated-discussion-3-ai-machine-learning/ ### Results of the submitted proposals for the LIBER conference - Digital Scholarship & Data Science Essentials for Library Professionals not accepted evaluation: one very supportive, the other one less positive - Exploring AI Hands-On: Shaping the Future of Research Library Services accepted as a PRE-CONFERENCE WORKSHOP abstract submitted: https://docs.google.com/document/d/1LI0LeB7qzw51qI_2JC1ZlNlI7WaRiY1s5rOWqKYAsfc/edit ### Learning hub: writing sprint and next steps The aim of this training resource is to make it easier for librarians to quickly find the most relevant, current and most useful training materials covering foundational topics in digital scholarship and data science in libraries. It aims to present a series of topic guides containing recommendations for specific learning activities. Lead by Nora McGregor (British Library) Edit-a-Thon document https://docs.google.com/document/d/1Yi4jZEUuKcs2RPyyv7-ElUkCUfsN_EHsHSJWjhKMrsc/edit#heading=h.1sivh53y3qq0 GitBook Structure & Topic Guides Edit-a-Thon https://docs.google.com/document/d/1TcrREAmIK6YYWf9AhzNX_beZZk5gxoDLoHXbJEKjkHY/edit#heading=h.haw4xf7q8mfh Topic Guide: Submission Checklist https://docs.google.com/document/d/1cmiDeTbR16hJPRypcRqnpoR3Rv2_JhKVZaFHZs-K2fA/edit#heading=h.y2qyj0hjzf9h The output: https://libereurope.github.io/ds-essentials/ an example page: Understanding Large Language Models https://libereurope.github.io/ds-essentials/llms.html The GitBook source https://github.com/libereurope/ds-essentials It is based on [Quarto](https://quarto.org/) technology. ### Literature review: next steps --- ## 2024-03-06: WG Meeting #26 Agenda 1. new member: Yusuf Ozkan, Research Outputs Analyst, Library Services, Imperial College London. Athanasia Salamoura left the group. 2. Writing sprint with WG Digital Scholaship about data scienler learning hub in The Hague, March 12-13 (hybrid event) 3. Workshop on AI planned with RDM working group at the Summer Conference 4. LIBER Carpentries membership 5. OCLC-LIBER Discussion #3: AI, Machine Learning, and Data Science, 17 April 15:30 (https://connect.oclc.org/en/oclc-liber-building-for-the-future-series-2023) 6. LIBER Winter Event 2024 will take place from the 26-27 November at the University of Maribor Library, Slovenia. Participants - Péter Király - Peter Verhaar - Annika Lindh - Kiera McNeice - Niamh Malin - Rosie Alison - Simone Cocchi - Jez Cope - Birgit Schmidt - Yusuf Ozkan ### Writing sprint in The Hague at KB - joint initiative of WG Digital Scholaship and WG DSLib - purpose: to collect good tutorials, and provide a guide for librarians about DS (in general sense) - https://nehamoopen.github.io/liber-learning-hub/ - Nora McGregor's initiative (British Lib.) - some part of the event will be hybrid - later there will be an online writing event - Nora will give a talk at the LIBER summer conference ### Yusuf - scholarly communication team member - Power BI dashboard - support academics with bibliometrics, indicators - previously at the Open Access team - PhD study at King's College ### Workshop on AI planned with RDM working group at the Summer Conference - abstract submitted: https://docs.google.com/document/d/1LI0LeB7qzw51qI_2JC1ZlNlI7WaRiY1s5rOWqKYAsfc/edit - the workshop has been accepted for LIBER conf. in Cyprus - purpose: hands on workshop on AI for people with little or no previous knowledge - AI, Large Language Modules - entity recognition, sentiment analysis, making summaries - we had a meeting some weeks ago: - which LLM to use? Alpaka, Bart, ChatGPT - creating excercises - previous presentation: https://docs.google.com/presentation/d/1TwsEOgYIWyOHT3E-MGY_rzwzkANJxOZk/edit?usp=sharing&ouid=109195792506806970866&rtpof=true&sd=true - datasets: full text, bibliographical data, data about usage of library resources - hosted notebooks (Google Collab) - AI for Humanists: https://www.aiforhumanists.com/ - An incubator Carpentries lesson: https://carpentries-incubator.github.io/machine-learning-librarians-archivists/ - Annika Lindh: there is a new plugin in Excel - Jez: compare very general models with specific tools (traned to ...) -- but it might be too much for a workshop - Annika Lindh: write a sentence in different tones. Then ask to get a DOI of papers on some specific topics. What people think about usage of AI? - Peter: we asked similar questions in the Winter event ### LIBER Carpentries membership - basicly there is no budget for that this year - there are instructors already in the community. Maybe we can collect the names at least. - the office does not have the capacity to organise events ### OCLC-LIBER Discussion #3: AI, Machine Learning, and Data Science, 17 April 15:30 (https://connect.oclc.org/en/oclc-liber-building-for-the-future-series-2023) - registration: https://www.oclc.org/oclc-forms/en/events/2024/liber-webinar-ai-machine-learning-data-science.html?_gl=1*15q15mv*_gcl_au*MjA0OTIzMjY0LjE3MDQyMTAyMzc. - Annika: the format is similar than last time (intro speech + small discussion rooms + outro) - good balance between ethics and other more technical topics - no public recordings and the meeting minutes alse not public - blog: https://hangingtogether.org/libraries-support-data-driven-decision-making/ ### LIBER Winter Event 2024 will take place from the 26-27 November at the University of Maribor Library, Slovenia. - https://libereurope.eu/article/save-the-date-liber-announces-date-and-host-of-2024-winter-event/ ### Update on creating an AI WG Birgit: Discussion at LIBER Exec Board meeting. There is a suggestion from U Bergen to create an AI WG - specific topic: AI literacy, collection management. ### News / Reading - K. Devlin. How will AI impact mathematics research? MAA blog. 5 March 2024. https://www.mathvalues.org/masterblog/how-will-ai-impact-mathematics-research - Invitation to the Slack channel: https://join.slack.com/t/liberdslib/shared_invite/zt-2e8ejid6c-C4fGjEi_l7fIbvWLm_gxvg ## 2024-01-17: WG Meeting #25 Agenda 1. Membership change 2. News 3. LIBER-OCLC event series 4. State of literature review 5. LIBER summer conference 8. Reading Participants * Péter Király * Peter Verhaar * Arben Hajra * Rosie Allison (LIBER) * Jez Cope (British Library) * Annika Lindh ### Membership change Matthijs de Zwaan (VU University Library) left the WG - Rosie has removed Matthijs from website and mailing list ### News * Cultural Heritage Data School applications close on 21 January 2024 -- https://www.cdh.cam.ac.uk/about/news/applications-for-the-next-cultural-heritage-data-school-close-on-21-january-2024/ * Collections as Data founding scheme * Law suits against companies behind large language models * the LIBER Intellectual Property Rights WG is planning to investigate some legal topics related with LLMs (e.g. regulations regarding training materials). * there is an EU law regading text and data mining ### LIBER-OCLC event series Facilitated discussion 2: Data-driven decision making https://connect.oclc.org/en/oclc-liber-building-for-the-future-series-2023 Wednesday 7 February 2024 at 14:30 (GMT) / 15:30 (CET) (Duration: 90 minutes) Today we are awash in data. This session will focus on collaborative data-analytics informing library collection management, contract management and negotiations with publishers, research information management and data-driven decision making in other areas. This session will take virtually and will be a 90-minute discussion including three breakout sessions focusing on data-driven decision making. There is nothing to prepare in advance, just come as you are, bringing your own knowledge, perspective, and experience to the conversation. Please plan to come to listen, contribute, and participate. This is a rich opportunity for us to learn from each other and to find mutual support for efforts that challenge us all. Note that seating for this session is limited. https://docs.google.com/document/d/1snpMJGOQwIPXIdq1w8LviHCVVlDqcCUTl_kfvA9Tp14/edit#heading=h.ehlfg6n7v0rt ### State of literature review The reviews: https://drive.google.com/drive/folders/1jWlqn0DwVPOrSfkVIsb4ns34HpUpSDm0 Peter will create a first draft of landscape study in March. ### LIBER summer conference Current draft proposals * Digital Scholarship & Data Science Essentials for Library Professionals: https://docs.google.com/document/d/1Im3yXEf7QOHWbzO6Qo0d7OqoiRqSQ5ChohyzRzx8QoA/edit * AI in data management and metadata: https://docs.google.com/document/d/1LI0LeB7qzw51qI_2JC1ZlNlI7WaRiY1s5rOWqKYAsfc/edit All these are cooperations with other working groups. Annika: OpenAlex use language models. Jez: Recording of the degree of certainty that a given metadata entry is "correct" seems like an increasingly important thing for provenance (Of course, that opens up questions of the accuracy of human-created metadata too...) https://www.loc.gov/marc/bibliographic/bdapndxj.html ### Reading Alkemade, H., Claeyssens, S., Colavizza, G., Freire, N., Lehmann, J., Neudecker, C., Osti, G., & van Strien, D. (2023). Datasheets for Digital Cultural Heritage Datasets. Journal of Open Humanities Data, 9:17, pp. 1–11. DOI: https://doi.org/10.5334/johd.124 * Peter: it is not clear what collections are they talking about. * Annika: * Jez: FYI it is one of the most viewed/downloaded papers in JOHD's history