<!-- .slide: data-background-image="https://i.imgur.com/Gncl5wR.gif"-->
<!--https://www.nytimes.com/interactive/2015/11/24/upshot/thanksgiving-flight-patterns.html-->
# <span style="color:#00bfbf">Data Studies 21 // S01</span>
---
<!--Pablo-->
Pablo Velasco ([pablov.me](https://pablov.me)) & Midas Nouwens ([@midasnouwens](https://twitter.com/midasnouwens))
<img src="https://pablov.me/pres/media/pablo.png" width=25%>
<img src="https://cc.au.dk/typo3temp/_processed_/csm_Midas_Nouwens_LK_556_PURE_square_f913e09f43.png" width=25%>
\+ Monika Mortensen
\+ Jakob Adolph
\+ Sine Jehle
// Information Studies @ Aarhus University
<!-- philosophy, blockchain, floss, draw, fat-->
---
### Plan for the day:
* instructors
* corona guidelines
* course structure <!-- workshop issues "instruktor time"-->
* *data studies* and survey *results*
---
<!-- .slide: data-background-image="https://media.giphy.com/media/hrRJ41JB2zlgZiYcCw/giphy.gif"-->
# <span style="color:white">BUMPY RIDE</span>
<!--
-tech issues unavoidable
-changes to calendar
-adapt
-->
---
## [Corona info](https://hackmd.io/@xpablov/rk7zKDgXP)
<img src="https://i.imgur.com/Ox8nkEh.jpg" width=400px>
---
### Course structure
* lectures
* workshops (discussion + tools and troubleshoot)
* groupwork: mini projects
* term project (starts W41) + final presentation (W49)
* exam (based on term project)
----
### Learning outcomes:
* **Knowledge**:
* Demonstrate an understanding of the role of data in society
* <span class="censor">Critically reflect on the use of data</span> to conclude general conditions in the world and in digital environments
* **Skills**:
* <span class="censor">Use digital tools to collect, analyse and present data</span>
* Critically reflect on the production and use of data in specific cases
* **Competences**:
* Critically <span class="censor">analyse and consider the role of data in society</span>, as well as the use and design of digital technologies for data collection and production
---
# DS
1. <span class="censor">What is DS?</span>
2. <span class="censor">What are the problems/issues DS deals with?</span>
3. <span class="censor">How to research data?</span>
----
## FIELDS inspiring or related to DS
Data Science
Data Rights/Governance
Digital Humanities
Cultural anaytics
Software Studies
Human-Computer Interaction
Science and Technology Studies
Digital Sociology
**Digital Methods** (Rogers 2017)
* <span class="censor">"the deployment of online tools and data for the purposes of social and medium research"</span>
* <span class="censor">"indications of societal concern"</span>
* <span class="censor">"rethink conditions of proof"</span>
* <span class="censor">"inquiries into the extend to which the medium is affecting the findings"</span>
<!--the medium can be google, but also a programming language-->
---
# SURVEY
<img src="https://i.imgur.com/e2kqNhx.jpg" width=500px>
<span class="pinky">"Why all these funny questions"</span>
<span class="pinky">"Weird questionnaire"</span>
----
### What do you understand by "data"?
<iframe style='width: 637px; height: 573px;' src='https://voyant-tools.org/tool/Cirrus/?stopList=keywords-7cfaf7cac239a91293f9b45ee1e30613&whiteList=&visible=500&corpus=57322bc00b12369c7148b0dfc43920a5'></iframe>
<!--
*voyant: https://voyant-tools.org/?corpus=9fcdb76f51f1ba4df8a842da03311224
-->
----
#### Incomplete sample
<span class="pinky">"You get 6 answers back and the responses can be considered data, which can be used for further analysis."</span>
<img src="https://i.imgur.com/ZRlICa4.png" width="85%">
<!--49 completed: it, but didnβt finish (curious about their reasons
-->
----
#### Irregular inputs
<!--
danish, html, upercase, typos: cleaning problem. and context
-->
<img src="https://i.imgur.com/W5EjXIQ.png" width=45%>
<img src="https://i.imgur.com/pV1DQBE.png" width=45%>
<span class="pinky">"Oversigt over typer af kriminalitet med Γ₯rstal, placering og tidspunkt ;) " </span>
<!-- emojis convey another kind of complexity-->
----
### Mention an example of "data"
<iframe style='width: 637px; height: 574px;' src='https://voyant-tools.org/tool/Cirrus/?stopList=keywords-011a2150a9889b353dd0e3ee684053e9&whiteList=&corpus=d2bd11978eea1ab5db04c8342775f29d'></iframe>
<!--voyant: https://voyant-tools.org/?corpus=d2bd11978eea1ab5db04c8342775f29d -->
----
<img src="https://i.imgur.com/2L666Ce.png" width=49%>
<img src="https://i.imgur.com/7Ibr9LW.png" width=49%>
<span class="pinky">"Don't put radio buttons with only two options for gender in a survey (or don't ask). π "</span>
<!--
-Q: Whatβs the difference between these 2 questions?
What is wrong here?
-binaries, preconcieved categories, lack of options, lack of signing out of options, alck of the possbility of creating new options-->
----
<span class="pinky">"Data is thus limited to how the real world can be translated into digital formats, and when extracting something in the world for datafication, certain elements, may or may not be able to be represented internally in a digital system"</span>
<img src="https://i.imgur.com/9dGFCu1.png" width=100%>
<span class="pinky">"information gathered by when choices are made"</span>
<span class="pinky">"a way to make knowledge concrete"</span>
----
<span class="pinky">"Weather data. Temperature, pressure and all that jazz"</span>
<img src="https://i.imgur.com/LNWtmMK.png" width=100%>
<span class="pinky">"Big chunks of information stored in different ways and through different programs and ways. Incredibly difficult to datafy everything"</span>
----
<img src="https://i.imgur.com/lW3WTcj.png" width=100%>
<span class="pinky">"a series of simple symbols that, when strung together and given context can say something about pretty much anything"</span>
----
<span class="pinky">"Messenger (i don't know if it counts)"</span>
<img src="https://i.imgur.com/eLTa6Ch.png" width=60%>
<span class="pinky">"that webpages track and see which items and how long we look at an item in order to provide other suggestions for that kind of product that has caught your intrest"</span>
<!-- f
"...and youtube (if that one counts)"
-tiktok, finally
-Q: should we count fb messenger as fb? What about instagram? Are talking about a company or an interface?
-Q: problem: instagram with a point
-"my roomates"
why assume social network is a platform? -->
----
<span class="pinky">"Also Data can be misused if it comes into the wrong hands"</span>
<span class="pinky">"Data is every character we have ever writen, every click we ave ever made, every word we have ever said and recorded. Everything around us is data."</span>
<img src="https://i.imgur.com/mdAU2dO.png" width=65%>
<!--
-popularity: youtube relies on other platforms
-Q: common fmb without fb?-->
----
<img src="https://i.imgur.com/p73YvZQ.png" width=60%>
<span class="pinky">"numbers"</span>
----
<img src="https://i.imgur.com/ikGKUDF.png" width=60%>
<span class="pinky">"an attempt to quantify natural phenomenons"</span>
<!--
Good for outliers
βScientificβ
Bad for making a good
-->
----
----
<img src="https://i.imgur.com/5mGeLKn.png" width=70%>
<!--
Q: Reasonable prediction?
Q: How did you gather the data? Is this reliable?
-->
<span class="pinky">"I hope the programming will not be too challenging:) "</span>
<span class="pinky">"I hope that even though programming isnt my bestfriend that I'll still enjoy the course"</span>
---
## A (SHORT) NOTE ON ETHICS
<img src="https://i.imgur.com/GGdDm5e.png" width="30%">
<span class="pinky">"Au has a lot of data about me, such as my birthday, name, and so on. This data about me is a set of data. Also when i submit this survey, i submit a set of data about myself."</span>
----
### GDPR (basics for "data collectors")
Processing (articles 4 and 6):
βcollection, recording, organisation, structuring, storage, adaptation or alteration, retrieval, consultation, use, disclosure by transmission, dissemination or otherwise making available, alignment or combination, restriction, erasure or destructionβ
----
<span class="pinky">"Personal information, such as cpr, adress and phone number"</span>
* Personal data (mainly articles 2, 4, 5): identifiable or re-identifiable
* a name and surname;
* a home address;
* an email address such as name.surname@company.com;
* an identification card number;
* location data (for example the location data function on a mobile phone);
* an Internet Protocol (IP) address;
* the advertising identifier of your phone;
* data held by a hospital or doctor, which could be a symbol that uniquely identifies a person.
<!--
Q - your data is βanonymizedβ but, are you re-identifiable?
-->
----
* Sensitive data:
* ethnic origin
* political, religious, and philosophical beliefs
* trade union affiliations
* genetic / biometric data
* health
* sex life & sexual orientation
<!--
Q - why is this kind of data particularly relevant?
[power, cohersion, manipulation, etc]
-->
----
*Have you ever accepted Terms of Agreements (in a website or app) without reading them?*
<img src="https://i.imgur.com/yBJKJfQ.png" width=60%>
----
* Is this ethical?
* Can this harm groups or individuals? Whom?
* Does this violates regulation (is this legal)? Made by whom?
* Is this legitimate?
* Who uses twitter/reddit, etc?
* Whenever we use a tool: who made this? Is my data safe? How can I know that? Is it open/close? Free for some populations? Which company is behind it?
<br><br>
---
## DS is a hands-on approach to the life of data, i.e. a technical practice which considers contextual, ethical, political, and methodological phenomena associated with the collection, processing and presentation of data.
---
<span class="pinky">"A lot of data gathering right of the bat"</span>
π₯ π π π€ π€© π π€ π π€ π€ π ποΈββοΈ
π π ;-) π :3 π π΄ π€ π΄ π π
π π¨πΌβπ» π π€ π₯² π π€ π π π πΊ π
π₯° π ποΈ π βΊοΈ π€ π₯° π πββοΈ π» π π
<span class="pinky">"LetΒ΄s do this baby !!"</span>
---
## For tomorrow:
* download and install: Chrome + [Data Miner plugin](https://data-miner.io/)
* think of a website you'd lke to scrape
* bring your name-tag
<style>
.reveal{
font-family:mono;
font-size: 25px;
}
.reveal .censor{
background:black;
color:white;
}
.reveal .censorw{
background:white;
color:black;
}
.reveal .pinky{
color:#e5157d;
font-style:italic;
font-size: .8em;
}
.reveal section img {
border: none;
box-shadow: none;
}
.reveal section left{
width:50%;
}
</style>
{"metaMigratedAt":"2023-06-16T08:58:23.794Z","metaMigratedFrom":"YAML","title":"DS21S01","breaks":true,"slideOptions":"{\"theme\":\"white\",\"transition\":\"slide\"}","contributors":"[{\"id\":\"088a33aa-785b-401b-a225-d782cd214529\",\"add\":18798,\"del\":8082}]"}