<style> .reveal ul { text-align: left; display: inherit; font-size: 70%; list-style: none; } .reveal ul li { margin-left: -2em; } .reveal ul li::before { content: "\2022"; color: #055c9d; font-weight: bold; display: inline-block; width: 1em; } .reveal ul li ul { list-style: none; } .reveal blockquote { font-size: 75%; box-shadow: none; color: #055c9d; } .hyperlink { font-size: 60%; } .reveal a{ color: #189ab4; } </style> # PUMFs **Public Use Microdata Files** Mathew Vis-Dunbar | Mathew.Vis-Dunbar@ubc.ca ECON 225 | 2022-03-08 --- ## Statistics Canada Products Note: * [Landing page](https://www150.statcan.gc.ca/n1/en/type/data) ---- ## Summaries * **Tables** summaries, subset of variables * **Profiles** geographic region * **Maps** Interactive visuals <span class = "hyperlink">https://www150.statcan.gc.ca/n1/en/type/data</span> Note: Numbers have been crunched for you. The information is readily digestible and easily re-communicated. They usually allow some filtering. Great for a quick reference, or for other quick summations, like changes over time. They link you to the survey tool that was used, which is great for further inquiry. Examples: * [Youth admissions to correctional services, by Indigenous identity and sex](https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=3510000701) * [Average counts of young persons in provincial and territorial correctional services](https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=3510000301) ---- ## Microdata > The microdata used by CRDCN researchers come primarily from Statistics Canada Survey Master files. > > Increasingly, the Research Data Centres (RDCs) are repositories of administrative records from a variety of sources including tax, employment insurance, social assistance, and hospitalization records. Note: It would be nice to have something in between these two to facilitate ease of access to data allowing for unique analyses with more granular access. * [https://www.statcan.gc.ca/en/microdata/data-centres/data](https://www.statcan.gc.ca/en/microdata/data-centres/data) * [https://crdcn.ca/publications-data/data/](https://crdcn.ca/publications-data/data/) ---- ## Microdata * Extremely granular, risk of identifying individuals or individual businesses. * Highly restricted access. <span class = "hyperlink">[https://www.statcan.gc.ca/en/microdata/data-centres/data](https://www.statcan.gc.ca/en/microdata/data-centres/data)</span> Note: It would be nice to have something in between these two to facilitate ease of access to data allowing for unique analyses with more granular access. * [https://www.statcan.gc.ca/en/microdata/data-centres/data](https://www.statcan.gc.ca/en/microdata/data-centres/data) * [https://crdcn.ca/publications-data/data/](https://crdcn.ca/publications-data/data/) ---- ## PUMFs > Public use microdata files contain anonymized, non-aggregated data. Note: * [https://www150.statcan.gc.ca/n1/en/type/data?MM=1#publicusemicrodata](https://www150.statcan.gc.ca/n1/en/type/data?MM=1#publicusemicrodata) ---- ## PUMFs * Meticulously cleaned up to help prevent identification of participants. * May be a subset. * May be a sample. Note: * [https://www150.statcan.gc.ca/n1/en/type/data?MM=1#publicusemicrodata](https://www150.statcan.gc.ca/n1/en/type/data?MM=1#publicusemicrodata) ---- ## Labour Force Survey > The Labour Force Survey provides estimates of employment and unemployment ... [T]he LFS estimates are the first of the major monthly economic data series to be released. > This [PUMF] contains non-aggregated data ... for users who prefer to do their own analysis by focusing on specific subgroups in the population or by cross-classifying variables that are not in our catalogued products. <span class = "hyperlink">[LFS Microdata](https://www23.statcan.gc.ca/imdb/p2SV.pl?Function=getSurvey&SDDS=3701) | [LFS PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/71M0001X)</span> Note: * [LFS Microdata](https://www23.statcan.gc.ca/imdb/p2SV.pl?Function=getSurvey&SDDS=3701) * [LFS PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/71M0001X) ---- ### 2016 Census > This Hierarchical File, 2016 Census Public Use Microdata File (PUMF) product provides access to non-aggregated data covering a sample of 1% of the Canadian households ... > > This comprehensive file is excellent tool for policy analysts, pollsters, social researchers... <span class = "hyperlink">[2016 Census PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/98M0002X)</span> Note: * [2016 Census PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/98M0002X) --- ## Working with PUMFs ---- ## Releases * PUMFs are expensive to create and take time. * When will 2021 Census PUMFs be released??? * Some are released regularly (LFS). * Some are one offs or only produced irregularly. ---- ### Licencing > Statistics Canada Open Licence governs the use of PUMFs. <span class = "hyperlink">[Statistics Canada Open Licence](https://www.statcan.gc.ca/en/reference/licence)</span> Note: * [Statistics Canada Open Licence](https://www.statcan.gc.ca/en/reference/licence) * It basically allows you to do as you want, so long as you give credit. ---- ### Documentation * PUMFs will have stand alone documentation. * Usually in a readme and meta file with a dictionary. * PUMF records will be connected with: * The source survey. * Related products (surveys, summaries, analyses etc). * The survey documentation will go into greater detail. <span class = "hyperlink">[LSF PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/71M0001X) | [LFS survey](https://www23.statcan.gc.ca/imdb/p2SV.pl?Function=getSurvey&Id=1503965)</span> Note: * [LSF PUMF](https://www150.statcan.gc.ca/n1/en/catalogue/71M0001X) * [LFS survey](https://www23.statcan.gc.ca/imdb/p2SV.pl?Function=getSurvey&Id=1503965) --- ## Accessing PUMFs Two portals ---- ## Statistics Cananda * Easy to navigate, filter and follow connections. * Not all downloads are available, esp historical * Downloads are sometimes broken up * [Data Landing Page](https://www150.statcan.gc.ca/n1/en/type/data) * [Statistical Program](https://www150.statcan.gc.ca/n1/en/type/surveys) Note: * The Data landing page omits census data when filtering * The Statisical Program page requires knowing that a PUMF exists. ---- ## Abacus > The Data Liberation Initiative (DLI) is a partnership between post-secondary institutions and Statistics Canada with the goal of improving access to data resources. <span class = "hyperlink">[DLI information](https://www.statcan.gc.ca/en/microdata/dli)</span> Note: * [DLI information](https://www.statcan.gc.ca/en/microdata/dli) ---- ## Abacus * [UBC's portal for acquired data](https://resources.library.ubc.ca/page.php?details=abacus-data-repository&id=1114) * Full access to all PUMFs current and historical * Not the easiest platform to navigate Note: * [DLI information](https://www.statcan.gc.ca/en/microdata/dli) --- ## Finding Data Generally ---- ## General Approaches * Who's responsible for * generating * collecting * aggregating * disseminating Note: * Simply because it's generated doesn't mean it's clean or accessible ---- ## Deaths at the Hands of Police <span class = "hyperlink">[CBC Deadly Force Database](https://newsinteractives.cbc.ca/features/2020/fatalpoliceencounters/) (2000 - 2020)</span> <span class = "hyperlink">[Tracking (In)Justice](https://trackinginjustice.ca/) (2000 - Present)</span> ---- ## Tracking (In)Justice > The CBC data has been critiqued for lacking a transparent methodology. Additionally, policing scholars have noted that while certain practices in the collection of data by the CBC data meet journalistic standards, they may not meet scholarly research standards. ---- ## Lunaris & FRDR > FRDR provides powerful functionality to search for Canadian research data. This federated search tool aggregates metadata from numerous repositories, including datasets deposited in FRDR’s repository platform. ---- ## Lunaris & FRDR * Lunaris is the discovery layer * FRDR is the database * [Repository list](https://www.frdr-dfdr.ca/discover/html/repository-list.html?lang=en) ---- ## Other Sources * Organizations * Municipal * Provincial * Federal * International * UN Oragnizations (WHO) * OECD * For fun: https://www.data-is-plural.com/
{"metaMigratedAt":"2023-06-17T22:49:02.359Z","metaMigratedFrom":"YAML","title":"ECON 225","breaks":true,"slideOptions":"{\"theme\":\"serif\",\"transition\":\"fade\"}","description":"Public Use Microdata Files","contributors":"[{\"id\":\"ac08882d-85e9-4611-a1dd-24f635bf6517\",\"add\":10281,\"del\":1510}]"}
    271 views