changed 4 years ago

Data Dojo Würzburg 3

August 2021

Token: ftrBCaE4EMXu9ac4

Participants

Please add your name to the list (click the pen icon at the top left to edit) if you plan to come. And please remove it if you can not make it.

  • Markus (R/tidyverse)
  • Laura (R/Python (pandas))
  • David (python, a bit of R)

General Question

Should we decide on a tool/programming language and collaboration workflow beforehand? Maybe even stick to one programming language? Idea: people can add their preferred language/tool upon sign-up. If there are at least three participants with the same preference they'll use that in a break out room. If there are not enough people for a tool they can join other rooms or agree on using any other tool together.

Collaborative Tools and Workflow

For Notebooks (R, python, julia, js, ) with real time collaboration CoCalc seems to be the best option right now. It worked great last time so we'll stick to it for now. You need to register an account there (it is free).

Real time collaboration tools

Feel free to add suggestions to this list

Unfortunately, notebook support with VS Code Live Share is an experimental feature which is currently broken (but expected to be fixed soon): https://docs.microsoft.com/en-us/visualstudio/liveshare/reference/notebooks and https://github.com/MicrosoftDocs/live-share/issues/4291. Update: on June 22, this issue was anounced as fixed. Unfortunately there is a new (unrelated) issue that prevents notebook output from being shown for guests: https://github.com/MicrosoftDocs/live-share/issues/4342. This issue is being worked on...

Once this is available, it seams to be the ideal solution especially as it has a unified interface independent of the language (python/julia/R/)

If notebook support is fixed the VS Code Insiders version would be required as long as it is experimental. This can be installed alongside regular VS Code without problems.

Dataset

UN Votes

Question Pool:

  • Generic
    • What kind of information is stored in the table?
    • How much data is missing?
    • Is the dataset clean or are there any clear outliers?
  • Specific
    • Which country has the highest fraction of "Yes" ("No") votes?
    • How did the fraction of "Yes" votes change over time for each country (by topic)?
    • Which pair of countries (dis)agrees the most?
    • What are the topics of the votes that Germany voted "No"?
    • Which vote had the least amount of "Yes" votes?
    • What was the amount of general agreement over the years?
    • How many votes went unanimous?
  • Further Ideas
    • Create a world map where you can select a year and countries will be color coded based on the fraction of "Yes" votes in that year
    • Same map as before but you can select specific votes and see the "Yes"/"No"/"Abstain" per country
    • Country graph with connections depending on the correlation of voting behaviour

Future Suggestions

Add your suggestions to the list and

Image Not Showing Possible Reasons
  • The image file may be corrupted
  • The server hosting the image is unavailable
  • The image path is incorrect
  • The image format is not supported
Learn More →
to the end of a line you are interested in

Datasets

Kinds of Questions

Tools/Languages

Skills

  • interactive maps
  • dashboards
  • animations
Select a repo