owned this note
owned this note
Published
Linked with GitHub
---
title: ECON 225
tags: presentation
slideOptions:
theme: serif
transition: 'fade'
# parallaxBackgroundImage: 'https://s3.amazonaws.com/hakim-static/reveal-js/reveal-parallax-1.jpg'
---
<style>
.reveal ul {
text-align: left;
display: inherit;
font-size: 70%;
list-style: none;
}
.reveal ul li {
margin-left: -2em;
}
.reveal ul li::before {
content: "\2022";
color: #055c9d;
font-weight: bold;
display: inline-block;
width: 1em;
}
.reveal ul li ul {
list-style: none;
}
.reveal blockquote {
font-size: 75%;
box-shadow: none;
color: #055c9d;
}
.hyperlink {
font-size: 60%;
}
.reveal a{
color: #189ab4;
}
</style>
# PUMFs
**Public Use Microdata Files**
Mathew Vis-Dunbar | Mathew.Vis-Dunbar@ubc.ca
ECON 225 | 2022-03-08
---
## Statistics Canada Products
Note:
* [Landing page](https://www150.statcan.gc.ca/n1/en/type/data)
----
## Summaries
* **Tables** summaries, subset of variables
* **Profiles** geographic region
* **Maps** Interactive visuals
<span class = "hyperlink">https://www150.statcan.gc.ca/n1/en/type/data</span>
Note:
Numbers have been crunched for you. The information is readily digestible and easily re-communicated. They usually allow some filtering. Great for a quick reference, or for other quick summations, like changes over time.
They link you to the survey tool that was used, which is great for further inquiry.
Examples:
* [Youth admissions to correctional services, by Indigenous identity and sex](https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=3510000701)
* [Average counts of young persons in provincial and territorial correctional services](https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=3510000301)
----
## Microdata
> The microdata used by CRDCN researchers come primarily from Statistics Canada Survey Master files.
>
> Increasingly, the Research Data Centres (RDCs) are repositories of administrative records from a variety of sources including tax, employment insurance, social assistance, and hospitalization records.
Note:
It would be nice to have something in between these two to facilitate ease of access to data allowing for unique analyses with more granular access.
* [https://www.statcan.gc.ca/en/microdata/data-centres/data](https://www.statcan.gc.ca/en/microdata/data-centres/data)
* [https://crdcn.ca/publications-data/data/](https://crdcn.ca/publications-data/data/)
----
## Microdata
* Extremely granular, risk of identifying individuals or individual businesses.
* Highly restricted access.
<span class = "hyperlink">[https://www.statcan.gc.ca/en/microdata/data-centres/data](https://www.statcan.gc.ca/en/microdata/data-centres/data)</span>
Note:
It would be nice to have something in between these two to facilitate ease of access to data allowing for unique analyses with more granular access.
* [https://www.statcan.gc.ca/en/microdata/data-centres/data](https://www.statcan.gc.ca/en/microdata/data-centres/data)
* [https://crdcn.ca/publications-data/data/](https://crdcn.ca/publications-data/data/)
----
## PUMFs
> Public use microdata files contain anonymized, non-aggregated data.
Note:
* [https://www150.statcan.gc.ca/n1/en/type/data?MM=1#publicusemicrodata](https://www150.statcan.gc.ca/n1/en/type/data?MM=1#publicusemicrodata)
----
## PUMFs
* Meticulously cleaned up to help prevent identification of participants.
* May be a subset.
* May be a sample.
Note:
* [https://www150.statcan.gc.ca/n1/en/type/data?MM=1#publicusemicrodata](https://www150.statcan.gc.ca/n1/en/type/data?MM=1#publicusemicrodata)
----
## Labour Force Survey
> The Labour Force Survey provides estimates of employment and unemployment ... [T]he LFS estimates are the first of the major monthly economic data series to be released.
> This [PUMF] contains non-aggregated data ... for users who prefer to do their own analysis by focusing on specific subgroups in the population or by cross-classifying variables that are not in our catalogued products.
<span class = "hyperlink">[LFS Microdata](https://www23.statcan.gc.ca/imdb/p2SV.pl?Function=getSurvey&SDDS=3701) | [LFS PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/71M0001X)</span>
Note:
* [LFS Microdata](https://www23.statcan.gc.ca/imdb/p2SV.pl?Function=getSurvey&SDDS=3701)
* [LFS PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/71M0001X)
----
### 2016 Census
> This Hierarchical File, 2016 Census Public Use Microdata File (PUMF) product provides access to non-aggregated data covering a sample of 1% of the Canadian households ...
>
> This comprehensive file is excellent tool for policy analysts, pollsters, social researchers...
<span class = "hyperlink">[2016 Census PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/98M0002X)</span>
Note:
* [2016 Census PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/98M0002X)
---
## Working with PUMFs
----
## Releases
* PUMFs are expensive to create and take time.
* When will 2021 Census PUMFs be released???
* Some are released regularly (LFS).
* Some are one offs or only produced irregularly.
----
### Licencing
> Statistics Canada Open Licence governs the use of PUMFs.
<span class = "hyperlink">[Statistics Canada Open Licence](https://www.statcan.gc.ca/en/reference/licence)</span>
Note:
* [Statistics Canada Open Licence](https://www.statcan.gc.ca/en/reference/licence)
* It basically allows you to do as you want, so long as you give credit.
----
### Documentation
* PUMFs will have stand alone documentation.
* Usually in a readme and meta file with a dictionary.
* PUMF records will be connected with:
* The source survey.
* Related products (surveys, summaries, analyses etc).
* The survey documentation will go into greater detail.
<span class = "hyperlink">[LSF PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/71M0001X) | [LFS survey](https://www23.statcan.gc.ca/imdb/p2SV.pl?Function=getSurvey&Id=1503965)</span>
Note:
* [LSF PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/71M0001X)
* [LFS survey](https://www23.statcan.gc.ca/imdb/p2SV.pl?Function=getSurvey&Id=1503965)
---
## Accessing PUMFs
Two portals
----
## Statistics Cananda
* Easy to navigate, filter and follow connections.
* Not all downloads are available, esp historical
* Downloads are sometimes broken up
* [Data Landing Page](https://www150.statcan.gc.ca/n1/en/type/data)
* [Statistical Program](https://www150.statcan.gc.ca/n1/en/type/surveys)
Note:
* The Data landing page omits census data when filtering
* The Statisical Program page requires knowing that a PUMF exists.
----
## Abacus
> The Data Liberation Initiative (DLI) is a partnership between post-secondary institutions and Statistics Canada with the goal of improving access to data resources.
<span class = "hyperlink">[DLI information](https://www.statcan.gc.ca/en/microdata/dli)</span>
Note:
* [DLI information](https://www.statcan.gc.ca/en/microdata/dli)
----
## Abacus
* [UBC's portal for acquired data](https://resources.library.ubc.ca/page.php?details=abacus-data-repository&id=1114)
* Full access to all PUMFs current and historical
* Not the easiest platform to navigate
Note:
* [DLI information](https://www.statcan.gc.ca/en/microdata/dli)
---
## Finding Data Generally
----
## General Approaches
* Who's responsible for
* generating
* collecting
* aggregating
* disseminating
Note:
* Simply because it's generated doesn't mean it's clean or accessible
----
## Deaths at the Hands of Police
<span class = "hyperlink">[CBC Deadly Force Database](https://newsinteractives.cbc.ca/features/2020/fatalpoliceencounters/) (2000 - 2020)</span>
<span class = "hyperlink">[Tracking (In)Justice](https://trackinginjustice.ca/) (2000 - Present)</span>
----
## Tracking (In)Justice
> The CBC data has been critiqued for lacking a transparent methodology. Additionally, policing scholars have noted that while certain practices in the collection of data by the CBC data meet journalistic standards, they may not meet scholarly research standards.
----
## Lunaris & FRDR
> FRDR provides powerful functionality to search for Canadian research data. This federated search tool aggregates metadata from numerous repositories, including datasets deposited in FRDR’s repository platform.
----
## Lunaris & FRDR
* Lunaris is the discovery layer
* FRDR is the database
* [Repository list](https://www.frdr-dfdr.ca/discover/html/repository-list.html?lang=en)
----
## Other Sources
* Organizations
* Municipal
* Provincial
* Federal
* International
* UN Oragnizations (WHO)
* OECD
* For fun: https://www.data-is-plural.com/