# Introduction to data analysis with R
This is our joint workshop document, where we can collect additional resources, feedback, propose data sets, etc.
Feel free to add other information that you find helpful.
# Additional Resources
Add links to additional resources that helped you and provide a short description what the resource is about
- [tidyverse styleguide](https://style.tidyverse.org/): A book with best pratice code styling, variable naming, commenting, ... Very useful if you want to write nicely readable and sharable code according to best practice
- [styler package](https://styler.r-lib.org/): A package that helps you to automatically style your code (e.g. by applying the tidyverse style guide). On the top of the webpage, you can navigate to some articles that help you get started with the package
# Ideas for Bring your own data
## Project proposals
If you have a project you want to propose, just list it below with a short description. Others can see it and add their names next to the project if they are interested.
The description should include a short description of the data set and questions that can be invesigated. If you already have methods in mind (statistical methods, visualization, ...) you can add that to the project description.
Franz: cannot share the dataset unfortunately
## I'd rather work alone on my data
If you would rather work alone on your data set, that's totally fine. However, you will still be grouped with other people that also want to work alone on their data set. In order for the groups to be not totally random, please list your name below and shortly describe which data you have or which methods you intend to try.
* Franziska, tidy and plot ICP data sets
* Charlotte, reprocessing my data and making the plots look nice and tidy for a paper
* Franz
* ~Making a dataset ready for the (pharmacokinetic) modelling software NONMEM~ (cannot share the dataset unfortunately)
- Data from intensive care unit patients receiving dialysis(SLED) and the antibiotic drug meropenem as a infusion
- Probably lots of dplyr and tidyr
- this project would include the incorporation of several Excel sheets into one which could be used for the modelling process
- Here are the requirements for the dataset (https://ascpt.onlinelibrary.wiley.com/doi/10.1002/psp4.12404)
- the original dataset consists out of 3 excel sheets which need to be combined to one
- empty cells must be filled
- the EVID column is not correct and must be replaced
- formats of the dates and times must be checked and changed if necessary
- the Excel file could be found here
* Nilofer, format parts of my dataset (as it is quite big and covers >100 years) and explore the functionalities of R I learn't in the workshop. Specifically, I am looking forward to investigating if group sizes impact the productivity/reproductive output of the females. For this I want to explore the data with scatter plots, and use Linear regression and other functions relevant to the topic, I learn today.
## I don't have any data and don't want to joint a project
If you don't want to joint any of the projects proposed above and you don't want to work on your own data set, add your name below. I will provide some example data sets to experiment with.
* Yomna Nassar: I would prefer something on data cleaning and arrangments (dplyr/tidyr)+ I can also improve some of my plots for poster presentation
* Lena
* Baile
* Tingting
* Esther
* Melanie
* Dilem
* Christin
# Feedback & Suggestions
Please feel free to provide feedback on tasks, slides, etc. below.
If you find feedback that you agree with, you can also add a :thumbsup: next to it, then I know which points many people agree on.
To give feedback on the tasks, please just add any emojy next to the bullet point that you agree with:
## General comments
- Slides on linear models are a too detailed :thumbsup: :thumbsup: :thumbsup:
## Day 3
#### General Feedback
#### Tasks
- **1: Tests**
- too easy
- easy
- hard
- too hard
- just right :+1:
Comments to task 1 can be added here
- **2: Linear models**
- too easy
- easy
- hard :+1:
- too hard
- just right
## Day 2
#### General Feedback
- It would be nice if group works have specified times when they stop and we are called back...sometimes I lost track of time
#### Tasks
- **1: ggplots**
- too easy
- easy :+1: :+1: :+1:
- hard
- too hard
- just right :+1: :thumbsup::thumbsup:
If you have comments just provide them here
- **2: data transformation with dplyr**
- too easy
- easy
- hard :+1:
- too hard
- just right :+1: :+1: :thumbsup::thumbsup: :+1:
If you have comments just provide them here
-with a little more time, it would have been ok
- **3: tidy data with tidyr**
- too easy
- easy
- hard :thumbsup:
- too hard
- just right :+1: :thumbsup: :+1:
If you have comments just provide them here
## Day 1
#### General Feedback
#### Tasks
- **1: Create RStudio project**
- too easy
- easy :+1::thumbsup: :thumbsup::thumbsup: :thumbsup::thumbsup: :+1: :+1:
- hard
- too hard
If you have comments just provide them here
- **2: Vectors and data types in R**
- too easy
- easy :thumbsup: :thumbsup::thumbsup: :+1:
- challenging :+1:
- hard
- too hard
- just right :thumbsup: :thumbsup::thumbsup::thumbsup:
If you have comments just provide them here
- **3: Tibbles**
- too easy
- easy :+1:
- hard
- too hard
- just right :thumbsup::thumbsup: :+1: :thumbsup::thumbsup: :thumbsup::thumbsup:
If you have comments just provide them here
- **4: Readr**
- too easy
- easy :thumbsup: :+1:
- hard:thumbsup:
- too hard
- just right :thumbsup: :+1: :thumbsup::thumbsup::thumbsup:
If you have comments just provide them here