owned this note
owned this note
Linked with GitHub
title: The Scene lives!
description: Echtzeit x GLAMhack 2021
- Presenting (see [intro](https://docs.google.com/presentation/d/1tABLP7omjEkfDQlStdd-XP8LLstOmxaI0tdC73Ifir0/edit?usp=sharing)) [Echtzeit](https://echtzeitkultur.org/) = Contributors to international demoscene
- [GLAMhack](https://openglam.ch) / [OpenGLAM](https://openglam.org/) = Community of international culture-data wranglers
- Our **#GLAMhack 2021** project: https://hack.glam.opendata.ch/project/114
- Prototype web app: **https://scene.rip/**
- Source code app: https://github.com/we-art-o-nauts/the-scene-lives-app
- Prototype API: **https://api.scene.rip/productions**
- Data + API: https://github.com/we-art-o-nauts/the-scene-lives
- Slack channel: [#team-26-echtzeit-x-openglam](https://glamhack2021.slack.com/archives/C01UBFPD7V4)
# 10'000 m goals
1. Make the [demoscene](https://en.wikipedia.org/wiki/Demoscene) more accessible to people who may have never heard about it.
(:wave: hello #GLAMhack!)
3. Create an open dataset of [demoscene](https://en.wikipedia.org/wiki/Demoscene) productions, which could be filtered to individual themes or platforms.
4. Support the [UNESCO digital heritage](http://demoscene-the-art-of-coding.net/) application, or just explore the history of the ['scene](https://docs.google.com/presentation/d/1tABLP7omjEkfDQlStdd-XP8LLstOmxaI0tdC73Ifir0/edit?usp=sharing).
..Or just tune into [SceneSat](https://www.scenesat.com/) :headphones: and enjoy electronic art at the hackathon!
[![Elevated by Urszula "Urssa" Kocol](https://media.demozoo.org/screens/s/c5/a8/8d52.pl82672.jpg)](https://demozoo.org/graphics/88841/)
Elevated by Urszula "Urssa" Kocol
## Our GLAMhack roadmap
- [x] `10 Mine some data from` [demozoo](#demozoo)
- [x] `20 Collect a few bytes from` [pouët](#trumpets)
- [x] `30 Create an initial` [Data Package](#package)
- [x] `40 Document process of` [aggregation](#scraping)
- [x] `50 Set up a basic demo` [service](#service)
- [x] `60 Push dataset as open` [repository](#repository)
- [x] `70 Propose a standard(TM)` [schema](#schema)
- [x] `80 Create and demo the` [prototype](#prototype)
- [ ] ... :money_mouth_face:
# :point_down: The drilldown
[![Road by PG and R0ger](https://media.demozoo.org/screens/s/2d/97/c2a1.182016.png)](https://demozoo.org/productions/201118/)
Road by PG and R0ger
[![Pouet logo by tomaes](http://content.pouet.net/logos/_20080804_4.jpg)](http://www.pouet.net/user.php?who=619)
Pouet logo by tomaes
The oldest and most well known central repository, pouët, makes daily data exports available at https://data.pouet.net/ with a JSON API endpoint at https://api.pouet.net/ and the open source code of it at https://github.com/pouetnet/pouet2.0-api
We downloaded and tried to parse the raw JSON with a couple of tools, and didn't manage to get far. Convoluted structure and formatting errors were rather demotivating. Nevertheless it influenced our thoughts about a "demoscene data standard", and brainstorm ideas of improving overall data quality (for example, we immediately noticed mismatched dates and missing values).
This API was also recently used to do some terrific [data analysis](https://twitter.com/pouetdotnet/status/1345056403338231808), and we reached out to the authors to find out if we can reuse their scripts. We have also reached out via the [#pouet Discord](https://discord.com/channels/534447164043165696/534975603573522433/832566846016323584) for guru meditation.
The Demozoo API is basic but usable: http://demozoo.org/api/v1/
As it's a paginated web service, it would require a bit of scraping code to aggregate. So we used the 'nuclear option' of getting the [database dump](http://data.demozoo.org/demozoo-export.sql.gz) in raw SQL format. Importing this into a local SQLite database (inspired by [all of Wikipedia in SQLite](https://old.reddit.com/r/DataHoarder/comments/mlc9gv/updated_ive_saved_all_of_wikipedia_into_a_sqlite/)) and then re-exported the tables in CSV format. This should be done differently for automated data updates.
We have reached out via the [#demozoo Discord](https://discordapp.com/channels/440807545662078977/440807545662078979/832549024900972564) and [GitHub](https://github.com/demozoo/demozoo/labels/API) for some further ideas.
[![Demos Take Control by Der Piipo and Kustaa and Mazor](https://media.demozoo.org/screens/s/cf/b3/284a.pl55208.jpg)](https://demozoo.org/graphics/50817/)
A popular current way to crowdsource open data in a distributed way is the [Data Package](https://frictionlessdata.io/), the preferred format for doing this using the Frictionless Data project, which has a [create tool](https://create.frictionlessdata.io/) to generate an initial `datapackage.json`.
An initial data package based on the Demozoo archive is at [GitHub](https://github.com/loleg/the-scene-lives), which compiles and aggregates the data from several tables using the Python [dataflows library](https://github.com/datahq/dataflows).
![Toolbox by Frictionless Data](https://frictionlessdata.io/img/home/toolbox.png "Toolbox" =320x240)
Exploring and transforming the data gives us a frame of reference based on which to think about some commonalities and differences between different archives' approaches. We did some research to see what effort in this direction was already made, and reached out to the Demozoo and Pouet communities.
Each of the data sources have a schema of their own, and some attempts at consolidation have been made. We started with a simplified version of the Demozoo model, created a [Table Schema](https://specs.frictionlessdata.io/table-schema/) which can be used for validation or annotation as [JSON Schema](http://json-schema.org/specification.html).
[![Ascii Attack by Latex](https://media.demozoo.org/screens/s/44/0e/ad34.jw63527.png =320x240)](https://demozoo.org/productions/38800/)
Our dataset repository clearly explains its sources, but also points out there are many other places which could be future data acquisition targets. These notable scene repositories and data sources include:
- https://demozoo.org/ (as above)
- https://pouet.net/ (as above)
[![Dump V1.71 by Dead Hackers Society](https://media.demozoo.org/screens/s/d9/ea/5a60.85932.png)](https://demozoo.org/productions/95633/)
While the Data Package is nice to look at in it's JSON glory, most people (and programmers) will want some kind of interface to it. We wrote a small server using the [Falcon Framework](https://falconframework.org/) to produce a barebones API. Since the data is loaded using a Frictionless Data wrapper for the [Pandas library](https://pandas.pydata.org/), it can incorporate various advanced sorting and filtering routines. Our proto-service is currently running at a private VPS hosted on Linode, but should also work on 'lambda function' hosts like Vercel or Heroku.
[![Dream by Vantage](https://media.demozoo.org/screens/s/64/8c/4c76.36647.jpg)](https://demozoo.org/productions/43074/)
Still from [Traffic Jam by Chainsaw](https://demozoo.org/productions/120261/)
If we make it this far, we would like to make a basic example of data usage. After all this data wrangling, we didn't have time to really explore the space of user interface possibilities. But we have a small application that demonstrates the API with an infinite-scrolling user interface showing productions.
See for yourself at https://scene.rip
### Ideas to build upon
- Establish live feed graph view of demoscene productions
- Create an infographic that helps to explain the demoscene
- An interactive app for scrolling through prods (e.g. [Netflix](https://github.com/Basitowaisi/VueJS-netflix-clone) or [Giphy](https://github.com/ademilter/bricklayer/) clone)
- A cheatsheet to learn the most important terms and famous groups/prods
- A virtual reality exhibition (like other teams are working on)
## More or less irrelevant links
- Thread showing various graphs of prod release stats from Pouët https://twitter.com/pouetdotnet/status/1345056403338231808
- Discussion of "greets graph" https://www.pouet.net/topic.php?which=12099&page=1
- Some debate about classification https://www.pouet.net/topic.php?which=12098
- Discussion of Demozoo API https://demozoo.org/forums/66/
- Oleg's blog explaining the demoscene, partially on the topic of "letting the data speak for itself" https://blog.datalets.ch/010/
- Teaser Revision 2017 seminar "Graph databases and the demoscene universe" https://2017.revision-party.net/events/seminars
- Ideas from demosceners on GitHub https://github.com/nesbox/TIC-80/discussions/1286
- Internet Archive gallery https://archive.org/details/softwarelibrary_c64_demos *"To some, the heart of the Demoscene - the self-playing examples of programming and artistic prowess of the last 30 years on the underpowered but extremely flexible C64."*
# Demo or :skull_and_crossbones: die!
[![Dig Your Own Hole by Scape / Mist](https://media.demozoo.org/screens/s/b4/31/369d.pl46352.png)](https://demozoo.org/graphics/39112/)
### :sheep: Thanks for watching!
:love_letter: email@example.com :bird: [@seismist](https://twitter.com/seismist)
Pass it forward: **hackmd.io/@oleg/the-scene-lives**
<small>This presentation is shared under [CC BY 4.0](http://creativecommons.org/licenses/by/4.0/)</small>