---
tags: temporal
---
# COVID-19 SciCloj Online Hackathon I
_This document is temporal: it is not meant to be updated after 2020-03-21_
This document is public and editable. Please add your own ideas, and consider which projects you're most interested in :)
## Projects
Projects highlighted by the organizing team before we split into groups to work.
### Explore article: COVID-19 data in the REPL
On March 6th, Dave Liepmann published [COVID-19 data in the REPL][1] on the Applied Science Studio blog. The article explores COVID-19 spread using the Johns Hopkins data, applies it to Germany, and discusses how to think about visualizing the data. A [very brief intro to the project](https://youtu.be/-Yk4KoSwPms?t=3230)
Beginner path: load up the data in the REPL and reproduce the graphs in the article.
Beginner path: "at this location, what was the time where there was a change in the trend?"
Advanced path: Adapt the COVID19 data in the REPL blog post to new local datasets.
Link to Zulip stream: https://clojurians.zulipchat.com/#narrow/stream/227504-covid-19/topic/project-covid-19-data-in-the-repl
Participants:
- John (Practicalli) add UK GeoMap and specific views of UK
- David Schmudde. Add Italy and use it to describe local trends.
- Noor and Ashima. Explore the repo and visualize India's data.
### Explore article: Baby steps with Covid-19 data for (Clojure) programmers
https://dragan.rocks/articles/20/Corona-1-Baby-steps-with-Covid-19-for-programmers
Beginner path: see that it works at your machine, and try to ask&answer new questions.
[Example project](https://github.com/practicalli/data-science-corvid-19) that you can open `src/practicalli/design_journal.clj` in your editor, start a REPL and evaluate expressions starting at the top.
[Video walk-through of the article](https://youtu.be/-Yk4KoSwPms).
### SEIR model Epidemic simulations
[Chat Channel](https://clojurians.zulipchat.com/#narrow/stream/227504-covid-19/topic/project-seir-infection-model)
Demonstrate epidemic model SEIR (Susceptible → Exposed → Infected → Removed) in Clojure for Covid-19 to similate community spread. If time permits, create a risk exposure calculator.
* [Clojure seirlib](https://github.com/anthonyghr/seirlib) Looks promising
* [Clojure SEIR infectious disease modeling](https://github.com/kaz-yos/epi501-clojure) - Probably not what we need
* Example [Epidemic calculator](https://gabgoh.github.io/COVID/index.html) and [epcalc source](https://github.com/gabgoh/epcalc/blob/master/src/App.svelte#L95) - In svelte language
### Look for existing notebooks
(Daniel)
### Discuss how we best can contribute as a community
How can we better help? What is the broader picture? Where do Clojurians fit in, what is their unique skill that they can contribute?
Possibly applicable for a bigger team. Discuss, and summarize the disucssion. Could turn into a blog post.
Possible topic: data quality. We test too few, and tread confirmed infected as infected. This leads to lagging indicators of infected.
Possible topic: Concern about how we present our findings
Format: video chat might provide higher bandwidth than Zulip
Participants:
- Teodor
Discussion with one person doesn't make a ton of sense. So please ping me if you'd like to chat!
## Project Ideas
Community contribution: where can we help?
- Adapt the [COVID19 data in the REPL][1] blog post to new local datasets
- Related technology: Oz/Vega-lite, geoJSON
- [repository](https://github.com/appliedsciencestudio/covid19-clj-viz)
- Build a Clojure interface to the [Outbreaks data set][2]
- Related technology: R
- Even better: build a plug-and-play tool to automatically convert Rdata to a neutral data format (e.g. JSON or EDN)
- this would be **incredibly** useful
- perhaps with babashka?
- or as a single-public-function library?
- same but for Pickle data from Python-land
- Chart or map [Germany data][5] (by date & region)
- Apply linear & exponential models to case reporting data
- NB: this would be purely for our own fun/education, because it's not rigorous enough for actual science and especially not for public release
- Provide Clojure-accessible data translation of [Mexico data][6]
- Play with interactive visualization as a way of understanding COVID-19 spread. An article like this has already been published in Washington Post. For inspiration about interactive visualization, see Bret Victor
- Reference: [Why outbreaks like coronavirus spread exponentially][3] (Washington post)
- Reference: [Learnable programming][4] (Bret Victor)
- Paint some data art from [coronavirus genome data][7]
- Possible technologies: Quil &more
- Find and visualize data source for pollution changes since date of lockdowns
- e.g. [NYC](https://www.dec.ny.gov/cfmx/extapps/aqi/aqi_forecast.cfm)
- Challenging: trawl news sources to try to determine the degree of under-reporting & under-testing in various regions (especially Iran & USA)
- Manual but incredibly useful effort: collect data pitfalls, e.g.:
- [Germany not testing the dead][8], causing their death rate to be artificially low
- [criticisms of WHO data](https://ourworldindata.org/coronavirus#why-we-stopped-relying-on-data-from-the-world-health-organization)
- Cross-country/region under-reporting calculator: extrapolate from deaths to what the infection rate must be, compared to reported cases
- Cross-country data analysis can provide insight into the effectiveness of policy decisions.
- Policy dimensions can be soft and difficult to measure: transparency of reporting, sense of urgency from leadership, etc….
- Policy dimensions can be quantifiable: number of people tested, date of first quarantine order, pandemic response budget, rate of spread after two weeks, etc….
- My goal is to encourage people to stay confined by doing data-driven predictions. Questions that I would like to be answered:
- Given a country, how likely do I have COVID-19 without knowing it?
- What could be the consequence on other people if I don't stay confined at all?
- If I decide to visit my parents, what is the likelyhood of them getting COVID? likely of them dying?
- If I remove luck out of the equation, how would their life expectency be reduced?
- Create repository under Scicloj for people to share explorations?
- Not data science, but important for people under lockdown/quarantine: apps to keep in touch, check on each other, maintain solidarity and good habits (home cooking, home workouts)
- Not data science, and not sure this is a good idea, but: social distancing tracker. Would ping you to write down the people you come into contact with while in lockdown. It would have to ping the user every few hours (2x/day minimum) which might be annoying enough to kibosh the whole plan. But tracking our often-imperfect social distancing *might* be helpful. E.g. "roommate (every day), supermarket cashier (Tuesday), took an uncrowded metro (Friday)".
- If you can invest just an hour or two: I have ~2.6K lines of clojure code. Help me please with cleanup, refactoring, better names, TODOs etc. See [Bost/corona_cases](https://github.com/Bost/corona_cases). 🙏 Thanks in advance.
[1]: http://www.appliedscience.studio/articles/covid19.html
[2]: https://github.com/reconhub/outbreaks
[3]: https://www.washingtonpost.com/graphics/2020/world/corona-simulator/
[4]: http://worrydream.com/LearnableProgramming/
[5]: https://www.citypopulation.de/en/germany/covid/
[6]: https://github.com/guzmart/covid19_mex/
[7]: https://www.ncbi.nlm.nih.gov/genbank/sars-cov-2-seqs/
[8]: https://www.reddit.com/r/Coronavirus/comments/fi1k9s/germany_currently_not_doing_postmortem_tests_for/