<style>
.reveal a {
color: #189ab4;
}
.reveal ul, .red {
color: #b53434;
}
.reveal ul li ul li {
font-size: 50%;
}
.reveal ul li span {
color: #383D3D;
}
.reveal img {
width: 4%;
padding: 5px 5px 10px 10px;
}
</style>
Mathew Vis-Dunbar
Data & Digital Scholarship Librarian
<span class = "red">\- \- \-</span>
mathew.vis-dunbar@ubc.ca
<span class = "red">\- \- \-</span>
2023-09-19
---
# Finding <span class = "red">Data</span>
---
## Provenance
<span class = "red">\- \- \-</span>
Are the data <span class = "red">generated</span>
Are they <span class = "red">saved</span>
Are they <span class = "red">distributed</span>
---
# Questions
---
### What person, body, or organization <span class = "red">generates</span> the data<span class = "red">?</span>
<span class = "red">\- \- \-</span>
Also a critical question in understanding who "owns" the data. "Ownership" can be read in a Western legal context (<span class = "red">*eg*</span> copyright) or interpreted by other frameworks of governance (<span class = "red">*eg*</span> Indigenous Peoples' Data).
---
### Who has responsibility to <span class = "red">record</span> the data<span class = "red">?</span>
<span class = "red">\- \- \-</span>
Not everything that is produced is preserved.
---
### Who has the responsibility to <span class = "red">manage</span> or curate the data<span class = "red">?</span>
<span class = "red">\- \- \-</span>
recorded data <span class = "red">≠</span> useable data
---
### Who has an interest or a responsibility to <span class = "red">amalgamate</span> the data<span class = "red">?</span>
<span class = "red">\- \- \-</span>
hospital <span class = "red">></span> health authority <span class = "red">></span> province <span class = "red">></span> federal agencies <span class = "red">></span> international bodies
---
### Who has an interest or a responsibility to <span class = "red">disseminate</span> the data<span class = "red">?</span>
<span class = "red">\- \- \-</span>
creator, amalgamator, third party<span class = "red">...</span>
---
### What level of <span class = "red">granularity</span> is the data available at<span class = "red">?</span>
<span class = "red">\- \- \-</span>
What variables for the data are available, are non-summarized values readily available<span class = "red">...</span>
---
# Example <span class = "red">1</span>
---
<span class = "red">**Research</span> Question**
Are racialized people disproportionately victims of police involved deaths in Canada<span class = "red">?</span>
<span class = "red">\- \- \-</span>
<span class = "red">**Data</span> Access**
Are the data generated<span class = "red">?</span>
Are they saved<span class = "red">?</span>
Are they distributed<span class = "red">?</span>
Note:
* [CBC Deadly Force](https://newsinteractives.cbc.ca/features/2020/fatalpoliceencounters/)
* [Tracking In Justice](https://trackinginjustice.ca/)
---
### Links
<span>CBC Deadly Force</span> [](https://newsinteractives.cbc.ca/features/2020/fatalpoliceencounters/)
<span>Tracking (In)Justice</span> [](https://trackinginjustice.ca/)
---
# Example <span class = "red">2</span>
---
<span class = "red">**Research</span> Question**
What is the relationship between place of residence, industry of employment, job classification or salary, and cancer diagnoses in Canada<span class = "red">?</span>
<span class = "red">\- \- \-</span>
<span class = "red">**Data</span> Access**
Are the data generated<span class = "red">?</span>
Are they saved<span class = "red">?</span>
Are they distributed<span class = "red">?</span>
Note:
* Employment at time of diagnosis vs time of exposure
* BC Cancer Registry: http://www.bccancer.bc.ca/health-professionals/professional-resources/bc-cancer-registry
* Canadian Cancer Registry: https://www150.statcan.gc.ca/n1/en/surveys/3207
* WHO Global Cancer Observatory: https://gco.iarc.fr/
* Canadian Institute for Health Information: https://www.cihi.ca/en
* Census Data: https://www12.statcan.gc.ca/census-recensement/2021/dp-pd/index-eng.cfm
---
### Links
BC Cancer Registry [](http://www.bccancer.bc.ca/health-professionals/professional-resources/bc-cancer-registry)
Canadian Cancer Registry [](https://www150.statcan.gc.ca/n1/en/surveys/3207)
WHO Global Cancer Observatory [](https://gco.iarc.fr/)
Canadian Institute for Health Information [](https://www.cihi.ca/en)
Census Data [](https://www12.statcan.gc.ca/census-recensement/2021/dp-pd/index-eng.cfm)
---
# 'Types' of <span class = "red">Data</span>
---
<span class = "red">\- \- \-</span>
Administrative
Generated or Automated
Research
<span class = "red">\- \- \-</span>
---
## Administrative
Collected as part of administrative or governance processes.
<span class = "red">\- \- \-</span>
<span class = "red">*eg*</span> Census, enrollment counts, Emergency Department figures
---
## Generated
Collected through some form of automation.
<span class = "red">\- \- \-</span>
<span class = "red">*eg*</span> Traffic cameras (image), weather stations (textual/tabular), biometrics (hardware dependent)
---
## Research
Collected to address a research question.
<span class = "red">\- \- \-</span>
<span class = "red">*eg*</span> Sampling, heterogenous across studies
---
# Data <span class = "red">Sources</span>
---
## Administrative <span class = "red">Data</span>
* <span>City of Kelowna Open Data Portal</span> [](https://www.kelowna.ca/city-services/city-maps-open-data/open-data-catalogue)
* <span>BC Data Catalogue</span> [](https://catalogue.data.gov.bc.ca/)
* <span>Canada Open Data Catalogue</span> [](https://search.open.canada.ca/opendata/)
* <span>Includes Statistics Canada</span>
* <span>OECD</span> [](https://data.oecd.org/)
* <span>UN Data</span> [](https://data.un.org/default.aspx)
---
## Research <span class = "red">Data</span>
* <span>Publishers</span> *eg* <span>Dryad</span> [](https://datadryad.org/stash)
* cleaned subsets, easiest through a publication data availability statement
* <span>Institutions</span> *eg* <span>Borealis</span> [](https://borealisdata.ca/)
* publication support and lab / institution 'memory'
* <span>National</span> *eg* <span>FRDR / Lunaris</span> [](https://www.lunaris.ca/en)
* Canadian administrative and research data
---
## Research Data <span class = "red">cont'd...</span>
* <span>Subject</span> *eg* <span>re3data</span> [](https://www.re3data.org/)
* list of subject specific repositories
* <span>Generic</span> *eg* <span>Zenodo</span> [](https://zenodo.org/) <span>or OSF</span> [](https://osf.io/)
* generic deposit, very messy!
---
## Other <span class = "red">Sources</span>
* <span>Institutionally purchased data</span> *eg* <span>Abacus</span>[](https://resources.library.ubc.ca/page.php?details=abacus-data-repository&id=1114)
* <span>Data Catalogues</span> *eg* <span>Data is Plural</span>[](https://www.data-is-plural.com/)
---
# Find a <span class = "red">Dataset</span>
{"description":"Mathew Vis-Dunbar","title":"ECON-495","slideOptions":"{\"theme\":\"serif\",\"transition\":\"fade\"}","contributors":"[{\"id\":\"ac08882d-85e9-4611-a1dd-24f635bf6517\",\"add\":19397,\"del\":10213}]"}