---
tags: gps
---
# 2022-GPS-Data-Skills-Course-Intro-to-Data-Management
## Sign in here:
YaohongWang yaw045@ucsd.edu
Hung-Yang (Jason) Chien, hchien@ucsd.edu
John Kim, jok015@ucsd.edu
Jaeyeon Park, jap013@ucsd.edu
Kejun Chen, kec007@ucsd.edu
Meghan Mattioli, mazavala@ucsd.edu
TsuPing Wang, tsw002@ucsd.edu
Sam Cohen, szcohen@ucsd.edu
Nick Heimann, nheimann@ucsd.edu
Emily Davalos, edavalos@ucsd.edu
Salma Shaikh, sshaikh@ucsd.edu
Chenhao Nie. cnie@ucsd.edu
Ziyuan Zhu, ziz063@ucsd.edu
Sora Park, sop006@ucsd.edu
Tino Tirado, ttirado@ucsd.edu
Haoran Jiang, haj005@ucsd.edu
David Reimer, dreimer@ucsd.edu
Junhui Xu, jux008@ucsd.edu
Marissa Myers, maclan@ucsd.edu
Jeffrey Myers, jmyers@ucsd.edu
Wenjun Gong, w7gong@ucsd.edu
Alayna Bone, abone@ucsd.edu
Elise Spencer, Enspencer@ucsd.edu
Yuting Wan y7wan@ucsd.edu
Yue Yu yuy039@ucsd.edu
Emily Irion, eirion@ucsd.edu
Amanda Lee-Low, aleelow@ucsd.edu
Rebecca Howard, r1howard@ucsd.edu
Bing Rethy, brethy@ucsd.edu
Bonnie Devenney, bdevenne@ucsd.edu
Chengan Li, chl030@ucsd.edu
Mariya Nikserest, mniksere@ucsd.edu
Yuki Imura, yimura@ucsd.edu
morgan cohen, m7cohen@ucsd.edu
Yizhuo Liu, yil118@ucsd.edu
Meiyu Su, m2su@ucsd.edu
Chuyu Liu, chl082@ucsd.edu
Nikki Qi, haqi@ucsd.edu
Rawlins, Mackenna, mjrawlins@ucsd.edu
Qihan huang q7huang@ucsd.edu
Sunny Xu, qixu@ucsd.edu
Patricia Resurreccion, paresurr@ucsd.edu
Xinyi Du , x8du@ucsd.edu
Shuting Wang, shw009@ucsd.edu
Brenna Wayne bwayne@ucsd.edu
Yuting Wan y7wan@ucsd.edu
Zhibei Wang, zhw048@ucsd.edu
Broderick Topil, btopil@ucsd.edu
Merik Manzano, mmanzano@ucsd.edu
Austin Brown, aubrown@ucsd.edu
Nicholas Valle, njvalle@ucsd.edu
Vorathip Plengpanit, vplengpa@ucsd.edu
Stevinson Tendon, stendon@ucsd.edu
Dan Bee Lee, dbl001@ucsd.edu
Zizan Wang, ziw011@ucsd.edu
Kelli Maples, kmaples@ucsd.edu
Yuchen Wang, yuw147@ucsd.edu
Hyun Ji Jung, hjjung@ucsd.edu
Collin Boudreaux, cboudreaux@ucsd.edu
Yunxin Liu, yul188@ucsd.edu
Kevin Zhou, kezhou@ucsd.edu
## Collaborative Notes:
# Git lesson Tuesday 2/22
## please sign in
### full name, email address
John Kim, jok015@ucsd.edu
Sora Park, sop006@ucsd.edu
Yuki Imura. yimura@ucsd.edu
Mariya Nikseresht, mniksere@ucsd.edu
Jaeyeon Park, jap013@ucsd.edu
Patricia Resurreccion, paresurr@ucsd.edu
Nick Heimann, nheimann@ucsd.edu
Bonnie Devenney, bdevenne@ucsd.edu
Bing Rethy, brethy@ucsd.edu
Chenhao Nie. cnie@ucsd.edu
Marissa Myers, maclan@ucsd.edu
Rebecca Howard, r1howard@ucsd.edu
Emily Davalos, edavalos@ucsd.edu
Chengan Li, chl030@ucsd.edu
Meiyu Su, m2su@ucsd.edu
morgan cohen, m7cohen@ucsd.edu
Yuchen Wang, yuw147@ucsd.edu
Yizhuo Liu, yil118@ucsd.edu
Wenjun Gong, w7gong@ucsd.edu
Salma Shaikh, sshaikh@ucsd.edu
Zhibei Wang, zhw048@ucsd.edu
Emily Irion, eirion@ucsd.edu
Dan Bee Lee, dbl001@ucsd.edu
Junhui Xu, jux008@ucsd.edu
Sunny Xu, qixu@ucsd.edu
Austin Brown, aubrown@ucsd.edu
Yue Yu yuy039@ucsd.edu
Hyun Ji Jung, hjjung@ucsd.edu
Chuyu Liu, chl082@ucsd.edu
Nicholas Valle, njvalle@ucsd.edu
Vorathip Plengpanit, vplengpa@ucsd.edu
Yuting Wan y7wan@ucsd.edu
Zizan Wang, ziw011@ucsd.edu
Rawlins, Mackenna, mjrawlins@ucsd.edu
Alayna Bone, abone@ucsd.edu
Broderick Topil, btopil@ucsd.edu
Jeffrey Myers, jmyers@ucsd.edu
Ziyuan Zhu,ziz063@ucsd.edu
Yaohong Wang,yaw045@ucsd.edu
Brenna Wayne bwayne@ucsd.edu
Tino Tirado, ttirado@Ucsd.edu
Elise Spencer, Enspencer@ucsd.edu
Shuting Wang, shw009@ucsd.edu
Sam Cohen, szcohen@ucsd.edu
Kelli Maples, kmaples@ucsd.edu
Amanda Lee-Low, aleelow@ucsd.edu
TsuPing Wang, tsw002@ucsd.edu
Collin Boudreauxc, cboudreaux@ucsd.edu
Meghan Mattioli, mazavala@ucsd.edu
43
## Collaborative Notes:
43
# Sign in here
John Kim, jok015@ucsd.edu
Emily Davalos, edavalos@ucsd.edu
Yizhuo Liu, yil118@ucsd.edu
stevinson tendon, stendon@ucsd.edu
Sora Park, sop006@ucsd.edu
Wenjun Gong, w7gong@ucsd.edu
Patricia Resurreccion, paresurr@ucsd.edu
Nick Heimann, nheimann@ucsd.edu
Collin Boudreaux, cboudreaux@ucsd.edu
Sam Cohen, szcohen@ucsd.edu
Bonnie Devenney, bdevenne@ucsd.edu
Zhibei Wang, zhw048@ucsd.edu
Amanda Lee-Low, aleelow@ucsd.edu
Bing Rethy, brethy@ucsd.edu
Chenhao Nie. cnie@ucsd.edu
austin brown, aubrown@ucsd.edu
Rebecca Howard, r1howard@ucsd.edu
Emily Irion, eirion@ucsd.edu
Meiyu Su, m2su@ucsd.edu
Chuyu Liu, chl082@ucsd.edu
yaohong wang, yaw045@ucsd.e du
Brenna Wayne bwayne@ucsd.edu
Jaeyeon Park, jap013@ucsd.edu
Jeffrey Myers, jmyers@ucsd.edu
Marissa Myers, maclan@ucsd.edu
Dan Bee Lee, dbl001@ucsd.edu
Mariya Nikseresht, mniksere@ucsd.edu
alayna bone, abone@ucsd.edu
Nicholas Valle, njvalle@ucsd.edu
Vorathip Plengpanit, vplengpa@ucsd.edu
Sunny Xu, qixu@ucsd.edu
Junhui Xu, jux008@ucsd.edu
Rawlins, Mackenna, mjrawlins@ucsd.edu
Elise Spencer, Enspencer@ucsd.edu
Zizan Wang, ziw011@ucsd.edu
yue yu yuy039@ucsd.edu
Salma Shaikh, sshaikh@ucsd.edu
Meghan Mattioli, mazavala@ucsd.edu
morgan cohen, m7cohen@ucsd.edu
Yuki Imura, yimura@ucsd.edu
Kelli Maples, kmaples@ucsd.edu
Broderick Topil, btopil@ucsd.edu
Hyun Ji Jung, hjjung@ucsd.edu
Kevin Zhou, kezhou@ucsd.edu
TsuPing, Wang, tsw002@ucsd.edu
Shuting Wang, shw009@ucsd.edu
# collabrative notes Tidy data
* Keep Raw data Raw!
* Keep record while cleaning data
* Put all variables in columns !
* put observations in it's own row
* don't combine multiple pieces of info in one cell
* export to format like csv
Take a look at the messy data. Describe how we would clean it up.
Identify what is wrong with this spreadsheet.
Discuss or try the steps you would need to take to clean up the spreadsheet, and to put data all together in one spreadsheet.
https://openrefine.org
library data sets:
https://ucsd.libguides.com/data-statistics
46
# Sign in here: name, email
morgan cohen, m7cohen@ucsd.edu
Jaeyeon Park, jap013@ucsd.edu
John Kim, jok015@ucsd.edu
Yizhuo Liu, yil118@ucsd.edu
zizan Wang, ziw011@ucsd.edu
Yuki Imura, yimura@ucsd.edu
Emily Irion, eirion@ucsd.edu
Rebecca Howard, r1howard@ucsd.edu
Sunny Xu,qixu@ucsd.edu
Mariya Nikseresht, mniksere@ucsd.edu
Bonnie Devenney, bdevenne@ucsd.edu
Nick Heimann, nheimann@ucsd.edu
Zhibei Wang, zhw048@ucsd.edu
Nicholas Valle, njvalle@ucsd.edu
Wenjun Gong, w7gong@ucsd.edu
Stevinson Tendon, stendon@ucsd.edu
Vorathip Plengpanit, vplengpa@ucsd.edu
Patricia Resurreccion, paresurr@ucsd.edu
Rawlins, Mackenna, mjrawlins@ucsd.edu
Salma Shaikh, sshaikh@ucsd.edu
Chenhao Nie. cnie@ucsd.edu
Broderick Topil, btopil@ucsd.edu
Emily Davalos, edavalos@ucsd.edu
Tino Tirado, ttirado@ucsd.edu
Meghan Mattioli, mazavala@ucsd.edu
Jeffrey Myers, jmyers@ucsd.edu
Marissa Myers, maclan@ucsd.edu
Dan Bee Lee, dbl001@ucsd.edu
Chuyu Liu, chl082@ucsd.edu
Yuting Wan y7wan@ucsd.edu
Amanda Lee-Low, aleelow@ucsd.edu
Collin Boudreaux, cboudreaux@ucsd.edu
Alayna Bone, abone@ucsd.edu
Meiyu Su, m2su@ucsd.edu
Bing Rethy, brethy@ucsd.edu
Elise Spencer, Enspencer@ucsd.edu
Kelli Maples, kmaples@ucsd.edu
Brenna Wayne bwayne@ucsd.edu
Hyun Ji Jung, hjjung@ucsd.edu
Sora Park, sop006@ucsd.edu
Kevin Zhou , kezhou@ucsd.edu
Junhui Xu, jux008@ucsd.edu
Sam Cohen, szcohen@ucsd.edu
41
# SQL Notes here:
SQL realtional database
IDs link tables
Primary key
foreign keys
Data download:
https://figshare.com/articles/dataset/Portal_Project_Teaching_Database/1314459
SQL phrases:
SELECT
FROM
41
# 3/3/2022
# SQL Day 2 - Last class for lesson for the Data Managment module.
# Sign In here
John Kim, jok015@ucsd.edu
Nicholas Valle, njvalle@ucsd.edu
Meghan Mattioli, mazavala@ucsd.edu
Wenjun Gong, w7gong@ucsd.edu
Vorathip Plengpanit, vplengpa@ucsd.edu
Rebecca Howard, r1howard@ucsd.edu
Sunny Xu, qixu@ucsd.edu
Sora Park, sop006@ucsd.edu
Broderick Topil, btopil@ucsd.edu
Yizhuo Liu, yil118@ucsd.edu
Jeffrey Myers, jmyers@ucsd.edu
Emily Davalos, edavalos@ucsd.edu
Collin Boudreaux, cboudreaux@ucsd.edu
Marissa Myers, maclan@ucsd.edu
Meiyu Su, m2su@ucsd.edu
Emily Irion, eirion@ucsd.edu
Jaeyeon Park, jap013@ucsd.edu
Yue Yu yuy039@ucsd.edu
Salma Shaikh, sshaikh@ucsd.edu
Bonnie Devenney, bdevenne@ucsd.edu
stevinson tendon, stendon@ucsd.edu
Amanda Lee-Low, aleelow@ucsd.edu
Nick Heimann, nheimann@ucsd.edu
Zhibei Wang, zhw048@ucsd.edu
morgan cohen, m7cohen@ucsd.edu
Sam Cohen, szcohen@ucsd.edu
Junhui Xu, jux008@ucsd.edu
Chuyu Liu, chl082@ucsd.edu
Tino Tirado, ttirado@ucsd.edu
Yuting Wan y7wan@ucsd.edu
Kevin Zhou, kezhou@ucsd.edu
Elise Spencer, Enspencer@ucsd.edu
Hyun Ji Jung, hjjung@ucsd.edu
Mariya Nikseresht, mniksere@ucsd.edu
Kelli Maples, kmaples@ucsd.edu
TsuPing Wang, tsw002@ucsd.edu
Alayna Bone, abone@ucsd.edu
Dan Bee Lee, dbl001@ucsd.edu
Rawlins, Mackenna mjrawlins@ucsd.edu
Yuki Imura, yimura@ucsd.edu
Zizan Wang, ziw011@ucsd.edu
Patricia Resurreccion, paresurr@ucsd.edu
Brenna Wayne bwayne@ucsd.edu
```sql=
select *
from surveys where year=2000;
```
#replace null with "U"
```sql=
select species_id, sex, coalesce(sex, "U") #place U for unknown
from surveys;
```
Creating tables:
Creating a View: #like a subset
```sql=
create TABLE surveys_bookmark AS
select species_id, sex
from surveys
where year=2000;
```
```sql=
create view summer_2000 as
select *
from surveys
where year=2000 and (month >4 and month <10);
select * from summer_2000;
```
```sql=
select sum(weight), count(weight)/count(weight)
from summer_2000
where species_id="PE";
```
````sql
select *
from surveys
join species
on surveys.species_id = species.species_id;
```
#another way to inner join
```sql=
select *
from surveys
join species
using(species_id);
```
```sql=
select surveys.year, surveys.month,surveys.day,species.genus, species.species
from surveys
join species
using(species_id);
```
```sql=
```