KCL e-Research

@kcl-e-research

https://docs.er.kcl.ac.uk/

Public team

Joined on Sep 11, 2024

  • Where scheduled workshops have a link, this should take you to the relevant SkillsForge page to sign up. Alternatively, all of our events can be seen on the e-Research SkillsForge page. Training calendar Jan Feb Introduction to HPC with CREATE Performance Profiling & Optimisation for Python Introduction to Containers with Docker and Singularity March
     Like  Bookmark
  • Questions from Peter: will they be using Jupyter notebooks? Slides / background we have 90 min for this needs to introduce things like "what is a CPU?", "when is using HPC useful?", "what's a virtual environment?", "what is parallelisation?", etc we can base this on existing material from the Bioinfo MSc and Intro to HPC slides should have more focus on practical applications than theoretical computer science will need to check with Mary/Neil that we're pitching it at the right level and not missing out things that might seem obvious to us!
     Like  Bookmark
  • Slides Motivation feedback on existing services / support increase awareness guide strategy development and future work evidence need for growth increase academic profile Focus on feedback and future workAwareness and outputs will happen if we do it well
     Like  Bookmark
  • 2025-05-07 KCL RSE CommunityWant to host events on other campuses Room booking during term time is difficult - much easier out of term time Next events would be late-August / early-September and another before Christmas break The Berlin Thing (CONVERSE v0.1) Starting 22nd May, roughly fortnightly from then, but irregularDetails available for first 3/4 events Will be a page at Humboldt we can link to
     Like  Bookmark
  • External stakeholders to invite: John Lavelle (KHP Digital Health Hub) Alessandra Vigilante (HAB) Someone from KDL 2025-04-30 Agenda: Introductions for people outside e-Research
     Like  Bookmark
  • Platform: KEATS (moodle)Coderunner plugin?
     Like  Bookmark
  • Friday 11th April 2025 Great Hall and Safra Lecture Theatre, Strand Campus, King's College London Agenda 14:00-14:10 - Introduction and opening address Richard Christie, Director of e-Research 14:10-14:30 - PharosAI: a New Initiative to Democratise Cancer AI Anita Grigoriadis, Professor of Molecular and Digital Pathology, Comprehensive Cancer Centre, KCL 14:30-14:50 - Social Media Data Capture and Analysis with KingsCAT Liliana Bounegru, Senior Lecturer in Digital Media, Culture and Society, Department of Digital Humanities, KCL
     Like  Bookmark
  • // add links for more info Anita Grigoriadis (Professor of Molecular and Digital Pathology, Comprehensive Cancer Centre, KCL) led with an introduction to the recently launched PharosAI project which aims to provide rich multi-modal data and tooling to support the development of AI models in cancer research and healthcare. https://www.kcl.ac.uk/news/platform-research-drug-discovery-ai-powered-cancer-care Up next was Liliana Bounegru (Senior Lecturer in Digital Media, Culture and Society, Department of Digital Humanities, KCL) who spoke about the KingsCAT tool for data collection and analysis of content from a wide range of social media platforms. This tool supports researchers to https://www.kcl.ac.uk/research/kingscat
     Like  Bookmark
  • Course Description: Many research projects involve processing of large datasets or running intensive computational pipelines using high performance computing (HPC) clusters. There are often multiple different data storage locations associated with an HPC cluster, which can make it challenging to understand where and how to best store your data and code. This workshop introduces principles for organising your data and code, as well as tools for moving data around and for managing your code. The course will be structured in an interactive approach, alternating between lecturing, discussion and extended hands-on activity. Live demos will use the CREATE HPC cluster. Learning objectives: After this training, you should be able to: Identify the most appropriate location to store different kinds of data
     Like  Bookmark
  • Hello world
     Like  Bookmark
  • Join us at the e-Research annual symposium on Friday 11th April for our poster competition! This is your chance to share your current work, highlight how you've used our e-Research resources, and connect with researchers from across disciplines. It's a great way to get feedback, spark new collaborations, and gain valuable exposure. The judging panel will award a prize of £100 to the poster that they believe best: Communicates your research clearly and engagingly to a broad audience. Demonstrates innovative and effective use of e-Research resources (e.g. HPC, RDS, TRE). You will also be offered the opportunity to present a 1-minute lightning talk to encourage people to come and see your poster. These lightning talks will be held immediately before the poster session from 16:20. The poster session will run during the networking session at the end of the symposium and provide plenty of time for people to speak to you about your research. The prize of £100 will be given as a gift card.
     Like  Bookmark
  •  Like  Bookmark
  • Conclusions - text suggested for paper These algorithm benchmarks were collected on the CREATE HPC cluster at King's College London using the scratch storage filesystem. This filesystem uses the Ceph distributed storage system across a large number of storage nodes with diverse disk types and properties. The CREATE HPC scratch storage in file access benchmarks was found to have a throughput of 645MiB/s sequential read (1179MiB/s for cached files) and a random read latency of 37.8 ms (sdev 51.4 ms) or 12.9 ms (sdev 39.1 ms) for cached files. Sequential write throughput was 494MiB/s with a random write latency of 48.1 ms (sdev 86.4 ms). These storage benchmarks were performed during a period of low usage and were found to be subject to occasional degradation due to factors out of our control within the HPC cluster and its Ceph storage cluster. Notes
     Like  Bookmark
  • [ ] (Louise) Get some resources on testing for HackMD doc or similar. [ ] (Jost) Change testing example to something where we make a change to a function, e.g. numpy instead of list [ ] (Louise) Add timeit to Profiling section (also defaulttimer) [ ] After premature optimisation section, add a new section about not doing all steps manually and letting optimal Python/C code be used wherever possible. Include content from start of NumPy section. [ ] (Jost) Revise content of dictionary and sets sections to link in with new section above [ ] (Jost) Think about whether diagrams would be useful for the secrching section, Louise has volunteered to help if needed. Or use library analogy. [ ] Skip bytecode section? Perhaps use smaller example. [ ] Skip scope section, not going to be useful for substantial loops etc. [ ] "Built in Functions Operators" make edit to title. [ ] Add an example to Built-in Functions section. Or maybe link to earlier material?
     Like  Bookmark
  • Key messages STEP-UPWork with us Successes of RSE to emulate Related roles and how things are tied together What could we have done better? What's possible now that wasn't 10 years ago
     Like  Bookmark
  • Why are we doing this? We've started to receive requests for it Recording who's been trainedAnd what they've been trained with Supports compliance Allows us to do onboarding with many people in parallel - currently doing a lot of this one-to-one Address common queries e.g. "can I have internet access?" Who is this for?
     Like  Bookmark
  • Points to make Short-term engagement Example scopes Any domain Try to assign someone with relevant background if possible How to request
     Like  Bookmark
  • We should where possible endeavor to make the software we create: correct accessible maintainable extensibile (re)usable performant Default practices
     Like  Bookmark
  • Web Development Backend See also https://roadmap.sh/backend flowchart subgraph Fundamentals direction TB python[Backend language<br>e.g. Python, PHP, JavaScript] end
     Like  Bookmark
  • 2024-11-04 High priority areas Asset register Risk flow chart Mapping out what kinds of project / software we work one.g. planning to commercialise? How to structure this work Start with a list of problemsWhat do we need to agree to avoid them - document Where do we want to point at policy to avoid a difficult conversation / protect our ways of working
     Like  Bookmark