owned this note
owned this note
Published
Linked with GitHub
---
title: DS21S08
tags: class
slideOptions:
theme: white
transition: slide
---
## <span class="censor">Data Studies 2021 // S08. Networks and platform-specific data</span>
<!--image for class-->
<img src="https://hackmd.io/_uploads/SJSzy8uuw.png" width=65%>
Pablo Velasco // Information Studies // [pablov.me](https://pablov.me)
---
## Plan for the day:
* Graph theory & Sociometrics
* ANT epistemology & networks
* *Digital Methods* and Digital Methods Initiative
* Digital Objects: hyperlink & hashtag
* TCAT server
---
<!-- .slide: data-background-image="https://gitlab.com/xpablov/data-studies/-/raw/master/DS19/S07/koenigsberg.png" data-background-size="auto 100%"-->
<br><br><br><br><br><br><br><br><br><br><br><br><br>
<span class="censor">The seven bridges of Königsberg problem: is it possible to cross each bridge once and return to the starting point?</span>
----
<!-- .slide: data-background-image="https://gitlab.com/xpablov/data-studies/-/raw/master/DS19/S07/euler-graph.jpg" data-background-size="auto 100%"-->
### <span class="censor">Graph theory</span>
<br><br><br><br><br><br><br><br>
<span class="censor">Leonhardt Euler solved the problem, and inaugurated the field of graph theory: all points must have an even degree (an even number of connections).</span>
<!--
Route is irrelevant
Problem has no solution
Abstraction of the problem: nodes and edges
As a walker, number times one enters is equal as number of times one leaves (except starting and ending point): thus, the number of edges must be even
-->
----
Baran (1964)
* Structures based on Nodes and Edges
* Statistical/Quantitative relations: clusters, centrality, density
* Centralized, Decentralized and Distributed
<img src="https://gitlab.com/xpablov/data-studies/-/raw/master/DS19/S07/baran.png" width="100%">
----
<!-- .slide: data-background-image="https://i.imgur.com/kYHBgzT.png" data-background-size="100% auto"-->
----
<!-- .slide: data-background-image="https://pablov.me/pres/media/moreno-background.png" data-background-size="auto 100%"-->
### <span class="censor">Sociometrics
* <span class="censor">Moreno (1934)
* <span class="censor">Discover social life through network properties & topologies
* <span class="censor">Analysis of relational data
* <span class="censor">Gain insight into social relations and make them available to intervention (Guggenheim 2012) -> participants become observers of their own problems</span>
----
<img src="https://gitlab.com/xpablov/data-studies/-/raw/master/DS19/S07/moreno-rooms-girls.png" width="45%">
<img src="https://gitlab.com/xpablov/data-studies/-/raw/master/DS19/S07/moreno-topological-girls.png" width="40%"><br>
Likes and dislikes in a girl's boarding school
<span class="refs">Gießmann, S. (2017). Jacob Levy Moreno, Sociometry, and the Rise of Network Diagrammatics</span>
<!-- – how to study inter-personal life on a micro-scale. - In a series of five maps, the concrete topography of the camp with its 435 girls in sixteen houses is transformed into a topological structure. Attraction (red), rejection (black), attraction/rejection (red/ black), and indifference (blue) are represented graphically.
-to help explain why girls attempted to escape
-->
----
### ACTIVITY: trace a network of the class
* What are the nodes?
* What are the edges?
* What is the *starting point*?
* How do you measure the weight of the edges?
Go to: https://kinopio.club/ds21s08-XQXwaV8m0qglvM_h3BElR
(you need to sign up, but not obligatory!)
---
### Actor Network Theory(ANT)
Actor Network Theory emphasizes a "flat" approach to the social, and argues that it can be study as a network consisting of any kind of elements.
<img src="http://pablov.me/pres/media/latour-reass.jpg" width="30%">
<span class="refs">Latour, B. (2007). Reassembling the Social: An Introduction to Actor-Network-Theory</span>
----
### ANT
* **Relations** between “technology” and “society”
* **1-level**: neither nature and society, nor micro and macro
* **Actors**: computers, text, humans, etc (and mostly the places they "collide")
* Properties emerge from networks
<img src="https://i.imgur.com/DibiU8i.png" width="90%">
<span class="refs">Latour, B., Jensen, P., Venturini, T., Grauwin, S., & Boullier, D. (2012). ‘The whole is always smaller than its parts’–a digital test of Gabriel Tardes’ monads.</span>
----
* No point of departure (but an epistemological point of entry)
<img src="https://i.imgur.com/TULeDRo.png" width="80%">
<img src="https://i.imgur.com/63gd1KS.png" width="80%">
<img src="https://i.imgur.com/yr1lP78.png" width="80%">
<span class="censor">More subtle than the notion of system, more historical than the notion of structure, more empirical than the notion of a complexity, the idea of network is the Ariadne's thread of these interwoven stories (Latour 1993)</span>
----
#### Actors in a social network:
* By their engagement roles:
* sources
* leaders
* commenters
* lurkers
* etc.
* By their human / non-human categorization:
* bots
* hashtags
* servers
* etc.
* By their structural role:
* bridges
* hubs
* stars
* etc.
---
*How to do empirical research from this epistemological approach?*
# Digital <span style="color:hotpink">methods</span>
----
## *Digital Methods* (big umbrella)
* Digital methods: “the use of online and digital technologies to collect and analyse research data"
<span class="refs">Snee et al (2016). Digital methods for social science</span>
* Snee et al include:
* web-based surveys
<span class="refs">Dillman, D. A. (2011). Mail and Internet Surveys: The Tailored Design Method</span>
* online interviewing and focus groups
<span class="refs">Kazmer, M. M., & Xie, B. (2008). Qualitative Interviewing in Internet Studies: Playing with the media, playing with the method</span>
* computer mediated discourse-analysis
<span class="refs">Herring, S. C. (2004). Computer-Mediated Discourse Analysis</span>
* digital ethnographies e.g. virtual ethnography
<span class="refs">Hine, C. (2000). Virtual Ethnography</span>
----
## Redistribution of methods
<span class="refs">Marres, N. (2012). The redistribution of methods: On intervention in digital social research, broadly conceived</span>
* redistribution of research: not so much an opposition between IT firms and researchers (e.g. Savage and Burrows 2007), but a reconfiguration of agents in social research
* **4 views on the redistribution of methods** (along a spectrum):
* **methods as usual**: old social methodologies incorporated into digital devices
* **big-methods**: vast datasets allow us to perform large-scale analysis on *real* network dynamics**
* **virtual methods**: adaptation of the social research methods into the digital
* **<span style="color:hotpink">digital methods**</span>: adapt digital devices for the purposes of social research
----
## "Natively digital" methods & objects
<span class="censor">The rise of the Internet enables new research methods that deploy specifically digital devices such as links, comments and shares</span>
<span class="refs">Rogers, R. (2009). The End of the Virtual: Digital Methods</span>
* **methods** embedded in online devices: of crawling, scraping, folksonomy
* **digital objects**: tweet, username, timestamp, <span style="color:hotpink">hyperlink, hashtag</span>
* built upon existing services
### *Places* of research (*spheres*)
<span class="refs">Rogers, R. (2013). Digital Methods</span>
1. mid to late 90s: hyperlink, individual website analysis
2. early to mid 00s: blogosphere, search engine critique
3. late 00s: location-aware, web 2.0, social media
**"cross-spherical analysis”: web, blogs, news, twitter, etc**
----
## Digital Methods Initiative

https://wiki.digitalmethods.net/Dmi/ToolDatabase
---
<!-- .slide: data-background-image="https://pablov.me/pres/media/google-whitepaper.png" data-background-size="auto 100%"-->
## <span class="censor">Digital object: *hyperlink*</span>
<span class="refs" style="background:#333333;color:white">Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine.</span>
----
DMI tool: Issuecrawler
<img src="https://i.imgur.com/abkB5t3.png" width="70%">
https://issuecrawler.net
----
## <span class="censor">Digital object: *hashtag*</span>
<!-- .slide: data-background-image="https://cdn.theculturetrip.com/wp-content/uploads/2017/07/weechat_0-3-5_-_es-wikipedia_at_irc-wikimedia-org_-1024x576.png" data-background-size="auto 100%"-->
<span class="refs" style="background:#333333;color:white">Early use: [IRC(Internet Relay chat)](https://webchat.freenode.net/)</span>
----
## Ad-hoc hashtags
* *ad hoc* : emergent "as necessary" / "when needed"
* coordination between distributed, and possibly disconnected actors
* '#'
* user generated
* competing specific themes and uses
* evolving and non-deterministic
* community: common topics and (direct) engagement
* issue network (Marres): comunities generated from contested topics
<!-- how to find the right/relevant hashtag-->
<!--not trendy topics -->
<!-- issue network: BLM WLM-->
----
<a href="https://edition.cnn.com/2020/10/04/us/proud-boys-twitter-hashtag-gay-men-trnd/index.html" target="_BLANK"><img src="https://hackmd.io/_uploads/rkSpPxwuw.png" width="70%"></a>
----
Top hashtags in the Climate Change space on Twitter over time (March-Jun 2012)
<img src="https://i.imgur.com/cF3PGQ5.jpg" width="60%">
<span class="refs">(see project information [here](http://blogs.cim.warwick.ac.uk/issuemapping/cases/issue-lifelines/))</span>
---
### Visual Network Analysis
(tbc in workshop)
* evolving, non-deterministic data
* force-vector: *gravity*
* not spatial, but relative position of the nodes (in relation to each other)
* borders are not exact, communities are fluid and can overlap
* mixed methods: qualitative inquiry before, after, or alongside the quantitative analysis
<span class="refs">Venturini, T., & Jacomy, M. (2014). Visual Network Analysis</span>
----
<img src="https://i.imgur.com/WI43yHD.jpg" width="70%">
Based on Venturini et al (2014), which structures can you identify in this network? Is it significant, regardless of its specific content?
---
## DMI tool: TCAT
https://ds.cc.au.dk/
(access code on [workshop page](https://brightspace.au.dk/d2l/le/lessons/25343/lessons/656580))
* API based
* Built by keywords OR geolocations
* Can be thoroughly queried (if you have a complicated query on a big bin, it will take some time)
* Different data and formats can be exported from it (depending on the amount of data and the bin size, this process may take some time)
<style>
.reveal{
font-family:mono;
font-size: 25px;
}
.reveal .censor{
background:#333333;
color:white;
}
.reveal .censorw{
background:white;
color:black;
}
.reveal .pinky{
color:#e5157d;
font-style:italic;
font-size: .8em;
}
.reveal section img {
border: none;
box-shadow: none;
}
.reveal section left{
width:50%;
}
.reveal .refs {
color: grey;
font-size: small;
text-align: left;
}
</style>