# DSCI 310: Class collaborative Notes
## 2023-04-11
https://github.com/chendaniely/dissertation
## 2023-01-31
Insert your name here (one name/person per line): t-shirt (s or 2xl) or water bottle
Jan (Water bottle!)
Abheet
Jeff
Ray
Jiaying
Miles
Justin W
## 2023-01-31
Insert your name here (one name/person per line):
Jan
Rithika
Angela
Jinghan
Zicheng Zhao (Rocky)
Eric Huang
Kashish Joshipura
Ji
Ethan Pang
Justin Wong
Enoch Cheung
Jiaying Liao
Amy the TA
Tony the TA
So Nakamura
Winnie
## 2023-01-27
### Group Project Github Repository URLs
Please paste the entire github URL to your project here (should start with https):
- group 01: https://github.com/jw327/dsci-310-group-01
- group 02: https://github.com/kashish1928/dsci-310-group-02
- group 03: https://github.com/karlie-tr/dsci-310-group-03
- group 04: https://github.com/annabelle-ep/dsci-310-group-04
- group 05: https://github.com/DaliaAh/dsci-310-group-05.git
- group 06:
- group 07: https://github.com/wxw1026/dsci-310-group-07
- group 08: https://github.com/rehan13/dsci-310-group-08-AKRM.git
- group 09: https://github.com/jennalenoble/dsci-310-group-09
- group 10: https://github.com/miniatureseal/dsci-310-group-10-GCC
- group 11: https://github.com/sgskung/dsci-310-group-11
- group 12: https://github.com/calamari99/dsci-310-group-12-dreamteam
- group 13: https://github.com/chris-ckh/dsci-310-group-13
---
- group 14: https://github.com/rhysong/dsci-310-group-14
- group 15:
- group 16: https://github.com/mjbrodie/dsci-310-group-16.git
- group 17: https://github.com/erliuu/dsci-310-group-17
- group 18: https://github.com/Dokkaebi10/dsci-310-group-18
## 2023-01-10
> In the next 10 minutes, I want you to think and write down a non-reproducible or non-auditable workflow you have used before at work, on a personal project, or in course work, that negatively impacted your work somehow (make sure to include this in the story).
1. Ieammates and I n one of my data science courses, we struggled with being able to load the data from shared jupyter notebooks if the files were saved locally. This would create a lot of issues as we would always have to change the path to the data file to be able to load it and run the code.
2. I did not, and accidentally messed up the whole process (Code, plots). So fixing it took me almost double the time of writing that initial project.
3. In a bioinformatics-focused Machine Learning project that I was doing with a couple of other students we were evaluating different models such kNN, SVM and a few others for classification. We were looking at measures such as accuracy, recallo
4. In my research assistant position, after having established our analysis just based on one person's running of our data, while producing our analysis section for our paper and trying to report the stat results, we realized for some reason my models would fit a bunch of singularities and our stats didn't exactly match up.
5. While working at a small bio-tech company I was tasked with updating code and dependancies for a sequencing analysis pipeline. Although the code was version controlled, it was poorly documented, commented and did not follow proper conventions. This resulted in significant difficulty in reproducing
6. In dsci 100, we struggled to find the synchronized progress on each
7. For a personal project, I did not write comments from which data sources I got the data from, so when I returned to the code later, I had to search for where the data originated from.
8. i couldn't get the same result after running the jupyter notebooknotebook fromnotebook from my teammates because of the version of the computers are not the same
9. i copied a few codes that were t made me difficult toi started coding on a unity project on my mac before making the project compatib
10. In a personal project I was making I took an extended break from working on it and because I took that break I completely forgot where I was and what I doing so instead of trying to figure out where I was in the code I decided to scrap it all and completely restart.
11. didn't clear temporary files which changed code behaviour on differernt runs and fi
12. In my first year I took a course called CPSC 210 wherein there was an individual project component.
13. Similar to DSCI 310 the project was divided into 3-4 components or phases. So, for phase 3 I was required to implement JSON files in my project. However, while working on the third phase I forgot to push the changes on github. Later, I needed to access the changes I had made earlier which took me a lot of time since I wasn’t sure of how to access the old commits and had also not saved my work often. Thanks to the TAs I was eventually able to sort
14. While working at a small bio-tech compand with updating code and dependancies for a sequencing analysismadeon third phase I forgot to push the changes on github. Later, I needed to access the changes I had made earlier which took me a lot of time since I wasn’t sure of how to access the old commits and had also not saved my work often. Thanks to the TAs I was eventually able to