# EM software meeting 2021-07-02 Last meeting minutes: https://confluence.diamond.ac.uk/display/SCI/EM+Meeting+2021-05-21 ## Agenda: ### Review action points from the last meeting: * [ ] Anton Lawrence/Satheesh/Martin: compile a concise list of requirements/issues with database regarding write access via SQLAlchemy * Neil suggests a review at the end of the run, opening up the EM tables for writing, then another review afterwards. * [x] James Hall: add fields to allow multiple datacollections in one visit (raw2, ... folders) * [x] EM data analysis team: improve pipeline resilience * In contact with Relion developers to see if we can address this at both ends, with more resilience on our side, and atomic writing on the Relion side. * [ ] EM data analysis team: utilise all database fields that we can * This is blocked by us not yet making use of the Zocalo Buffer table, which we now have. Should have this sorted by next meeting * [x] EM data analysis team/Dan Clare/Yuriy: consider which features are desired in SynchWeb, in collaboration with the new starter and eBIC staff. One output of this meeting will be a 6 month plan for the new starter. * We have a proper meeting series for this now. Positive development, things are moving. * [ ] Dan Clare: show Martin Walsh the Relion pipeline and SW interface. * Showed James and Andy the pipeline, but not yet to MW ### Review Agreed Time-table from initial meeting on 2021-04-09: #### Main Relion stuff: Future: #### Q3 - logic to tie data together - [x] new SW developer start - fully automate analysis - Data collection registration and control processing via SW from next run - https://jira.diamond.ac.uk/browse/SC-3114 - No answer for 2+ weeks, escalate via Nigel next week, if still no traction escalate via SM - DC will get into contact with Martin - Analysis is triggered automatically once the data collection is in ISPyB (via user input) - Keep the options on Synchweb (eg. do 2D/3D classification) unless they can be automatically determined (eg. file extension) - We need to know when to stop - current logic: stop when no new files in 2 days and all files have processed - after 7 days the process will die due to cluster policy - sometimes we iterate on the last batch rather than ending - we could use number of holes as given by EPU, may have to consider users adding more holes during data collection - all relevant data in ISPyB - driven by visualisation needs, ie. once we want something shown in Synchweb then ensure it goes into ISPyB - Synchweb features: - Particle picking view - 2D classes - Icebreaker summary report - Icebreaker running update #### Q4 - Discuss Q4 targets in September meeting - continual improvements - support different user needs as they arise ### Updates on current progress: 1) SynchWeb * new starter has started and is making progress 2) ISPyB * new particle picking column added, buffer table set up 3) Relion python/Zocalo * it is running and so far there haven't been too many support requests * there is still Josh's classic pipeline around, which MG believes is no longer used. This will be removed in the next shutdown 4) EM * Want to process EER files, need an option in Synchweb to add .eer extension. * Setting this extension will use the relion-internal motion correction * Also needs to add a new integer field to control the EER fractionating * "How many frames do you want in your movie?" * then do maths for users * This brings in a rounding complication, if the number of frames is not an integer multiple of this field then frames will be thrown away, which introduces an error to the ~~dose~~ electron fluence calculation. ### AOB: * none ## Action points * See unresolved items at the top of the page * Next meeting in 6 weeks