uc-ospo-logo
COLASC May 25
UC OSPO https://ucospo.net/
What is an OSPO?
Office to manage and support open source software (OSS) at an institution.
Enhances research, teaching, and community engagement through open source.
Who Benefits?
Finding Data
Tim Dennis, Director of Library Data Science Center
Stats 15: Introduction to Data Science, Dr. Gould
These notes: https://hackmd.io/@timdennis/H16PoWqCC
This presentation will cover:
Key concepts and terms used in data scholarship (e.g., DOI, data citation).
Important tools and resources for finding datasets.
Best practices for evaluating data quality.
Tim Dennis changed 8 months agoView mode Like Bookmark
Library Carpentry - Open Refine Exercises
Exercise 1: Download data:
Go to https://raw.githubusercontent.com/data-lessons/library-openrefine/gh-pages/data/doaj-article-sample.csv. Right click anywhere on the screen and choose Save As … The filename is doaj-article-sample.csv and it should be saved to Desktop.
Exercise 2: Create project
Open OpenRefine.
Click Create project then Choose File. Select doaj-article-sample.csv and then click Next. In the Parse text section, ensure separators are commas and first line is being parsed as a header. Then click Create project.
Exercise 2 (Alternate): Create project from web address
Open OpenRefine: typically located here: http://127.0.0.1:3333
Tim Dennis changed 9 months agoView mode Like Bookmark
Collaborative Lesson Development!
slides: https://hackmd.io/@timdennis/HJs_6ysOR
Welcome!
Congratulations 🎉 on your lesson proposal being selected for the Lessons for Librarians in Open Science
Who are we?
Tim Dennis :fleur_de_lis: , Program Director
Zhiyuan Yao :world_map:, Program Coordinator
Tim Dennis changed 10 months agoSlide mode Like Bookmark
Library Data Science Center
Room 21536, Young Research Library
datascience@ucla.edu
Support Request: https://goo.gl/forms/xKsELsoVa1SJaUq72
These Slides: https://hackmd.io/p/Bk0MPnqjX#
Data Science Center
DSC webpage
Tim Dennis changed a year agoSlide mode Like Bookmark
Tidy Tues :heartpulse: Data Week
Who are we?
Tim Dennis, Director Library Data Sci Center (DSC), UCLA
Leigh Phan, Data Scientist, Library Data Sci Center (DSC), UCLA
Reid Otsuji, Data Curation Specialist and Faculty Liaison Librarian for Research Data Curation, UCSD
What is Tidy Tuesday
A weekly data project for the R community. The goal of TidyTuesday is to apply your R skills, get feedback, explore other’s work, and connect with the greater #RStats community! We encourage everyone of all skills to participate!
Tim Dennis changed 3 years agoSlide mode Like Bookmark
UC - Collaborative Carpentries
slide content from: UC Libraries Forum Preso https://bit.ly/UCCollab
Why?
Local workshops have a small pool of instructors and in-person workshop attendance is limited to about ~30 students at a time
In person workshops usually fill up quickly and have large waitlists
Putting on a workshop is very time intensive
Shift to entirely virtual instruction beginning March 2020
This allowed more instructors from other campuses to teach without needing to travel
Tim Dennis changed 3 years agoSlide mode Like Bookmark
Version control w/the Command Line Git and GitHub
BITSS Research Transparency and Reproducibility Training
Tim Dennis (UCLA)
I am the Director of Library Data Science Center at UCLA
I am also a Carpentries instructor
Slides: https://hackmd.io/@timdennis/HJ0a0nuBD
Etherpad: https://pad.carpentries.org/2021-rt2
Tim Dennis changed 4 years agoSlide mode Like Bookmark
Create a Repository
Login to GitHub: https://github.com/
On https://github.com/ click on the + button, top right corner
Name the repository "hello-world"
GitHub will ask if you want to add a README.md, license or a .gitignore file. Do not do any of that for now.
Undoing Commits & Changes
What questions are there from the first part?
Tim Dennis changed 4 years agoView mode Like Bookmark
The Data Science Center's mission is to foster a welcoming research community by developing data literacy and foundational coding skills through consulting and education. We support researchers & students throughout the full data life cycle of activities: planning research projects and data management plans; data acquisition, storage, cleaning, and usage; analysis; publication; curation; and preservation.
We:
Provide support for full data science life-cycle
Build data science infrastructure that promotes sharing, data use and life-cycle management
Provide support for computational reproducibility and discovery
Deliver & contribute to data science instructional and curricula support
Provide support and referrals for researchers needing to work at scale computationally (beyond the laptop)
Tim Dennis changed 4 years agoView mode Like 1 Bookmark
California Libraries Carpentries Meetup
These slides: https://hackmd.io/@timdennis/cali-carps
View mode: https://hackmd.io/xA426D4XQuasjNRqKq7X2A?view
Today:
Contributing to existing lessons
Lesson Development
Carpentries Incubator
Tim Dennis changed 4 years agoSlide mode Like Bookmark
Accessing Databases via Web APIs
What is an API?
API stands for Application Programming Interface
a set of rules and procedures that facilitate interactions between computers and their applications
Note:
We can consider an API a set of rules and procedures that allow computers to interact with applications
Tim Dennis changed 4 years agoSlide mode Like Bookmark
Register for the New York Times API
Steps are here: https://developer.nytimes.com/get-started
But you will be:
Creating an account (if you don't have one already) at https://developer.nytimes.com/
After you are logged in, starting from https://developer.nytimes.com/, navigate to https://developer.nytimes.com/my-apps
Add new app
Add a name for your app, e.g. 'ucla api workshop'
Select Enable by the Article search API
Select generate
Tim Dennis changed 4 years agoView mode Like Bookmark
Intro to Python for Librarians!
slide: https://hackmd.io/@timdennis/lc-python#/
Etherpad
Please use the Etherpad to take notes/share code, ask questions and explore
https://pad.carpentries.org/2021-ucla-python
Why are we here?
Tim Dennis changed 4 years agoSlide mode Like Bookmark
slides: https://hackmd.io/p/SJhSRzUo4#/
Who are we?
Why Git/GitHub?
Code & data sharing and collaboration
Democratic databases: science on GitHub
Scientists are turning to a software–development site to share data and code
Tim Dennis changed 4 years agoSlide mode Like Bookmark
Jargon Busting (20-30 min total, 5 min in breakout room)
This exercise is an opportunity to gain a firmer grasp on the concepts around data, code or software development in libraries.
We'll group you into breakout rooms of 3-4.
Select someone in your room to take notes in the etherpad for your group.
Talk for three minutes on any terms, phrases, or ideas around code, data or technical aspects of development in libraries that you've come across and perhaps feel you should know better.
Each group adds 3 terms, phrases or ideas to the class etherpad.
Feel free to provide definitions or explanations on other groups terms if you are knowledgeable about them. (The idea is we can build a glossary for the class).
The instructors will group the terms on the etherpad and facilitate a discussion about what we will cover today and where you can go for help on those things we won't cover.
Tim Dennis changed 4 years agoSlide mode Like Bookmark