owned this note
owned this note
Published
Linked with GitHub
(*DS course, Data manipulation, analysis and visualisation in Python, Ugent, June 2022*)
# DS course Python for data analysis, June 2022
## README first
------------- *do not change this section* -------------
For each of the following sections/questions, add your answer to the list. In case your answer is already enlisted by someone else, do NOT add the answer as a new item, but provide a `*` sign before the answer.
As an example, suppose the question: *What's your favorite Simpson character*? and the current answers are:
```
*Lisa payt
* Bart
*Apu*
```
and your answer would be *Bart*; the result after your edit will be:
```
* Lisa
** Bart
```
if your answer would be *Homer* instead; the result after your edit will be:
```
* Bart
* Lisa
* Homer
```
___!! Respond in between the triple ticks!!.___
OK? Ready to go!
## I'm working with Python since 2022...
(*respond with a `year`*)
```
******2022
****2021
2020
2019
2018
*** 2017
2016
2015
* 2014
2013
<2012
```
_!! Respond in between the triple ticks!!._
## My first programming language was...
(_for example `C`, `C++`, `Pascal`, `Go`, `R`, `Python`,..._)
```
******Matlab
* R
** Python
** C
* Fortran
* Java
```
_!! Respond in between the triple ticks!!._
## I have experience with the following programming languages:
(*multiple answers are possible*)
```
********Matlab
********Python
*******LaTeX
*Java
*** R
** bash
*** C
* Fortran
VBScript
SQL
```
_!! Respond in between the triple ticks!!._
## I currently create graphs mostly with...
(_for example `plotly`, `gnuplot`, `excel`, `tableau`, `ggplot`, `matplotlib`, `bokeh`, `altair`, `matlab`,..._)
```
*******Excel
**matplotlib
***matlab
*R
* ggplot
*SPSS
```
_!! Respond in between the triple ticks!!._
## The size of data sets I'm typically working with are around...
(*if lots of different data sets, pick those you want to handle in Python*)
```
**** < 1MB
1MB < . < 10MB
*10MB < . < 100MB
***** 100MB < . < 1GB
*1GB < . < 10GB
* 10GB < . < 100GB
* 100GB < . < 1TB
> 1TB
```
_!! Respond in between the triple ticks!!._
## Currently, I mainly do data manipulation with...
(_for example `python`, `R`, `excel`, `SQL`, `bash`, `matlab`,..._)
```
****Python
*******Excel
**matlab
**bash
*R
**SPSS
```
_!! Respond in between the triple ticks!!._
## What I do like the most about Python, is...
(_can be both technical and/or non-technical reasons_)
```
*******freeware
******relatively user friendly syntax
****reproducible graphs
*basic syntax
*****increasingly popular programming language in my research domain
*fast
```
_!! Respond in between the triple ticks!!._
## Python packages/modules I'm using a lot:
(_hence, what you typical `import ...` to start with; multiple answers are possible_)
```
*** statistics
*** seaborn
**** matplotlib
** os
* re
***** numpy
**** pandas
* glob
* plotly
* scipy
```
_!! Respond in between the triple ticks!!._
## What I hope to learn this week...
```
How I can use Python for data manipulation and visualization in my research group
********* smooth data importation and visualisation
****** clean coding
****** visualization
*** being more efficient with my code
*** big data efficiency
**** best practices
** better understanding of my errors
```
_!! Respond in between the triple ticks!!._