owned this note
owned this note
Published
Linked with GitHub
# Data Science Help Desk
**Most of this content has been moved to https://osf.io/h9vqt/wiki/home/ and https://osf.io/h9vqt/wiki/Weekly%20Office%20Hours/**
This week's writing hour all are invited to collaborate on envisioning and drafting a document that defines the data science help desk
## 2020-11-06
### Definition of the Data Science Help and consulting desk
#### Action Items
##### Phase 0: Get ready
- [ ] Need short descriptive blurb of what we're doing (Enable science in CALS), for whom we're doing it, and how we're doing it
- See [Mission/Goals/Outcomes](#MissionGoalsOutcomes-of-the-Help-Desk) below
- See [Target Audience](#Target-audience-CALS--AES-specific) below
- See [Mechanisms](#Mechanisms) (aka 'funnel') below
- Make clear what we do that nobody else does, and how we augment/complement CCT, ResBaz, UITS, Stats Consulting, Carpentries, etc.
- [ ] Name - is "help desk" the right term?
- consulting desk?
- data science consulting for researchers
- Focused on **computational and data-intensive (CDI) research**
##### Phase 1: Start engaging and planning
- [x] (Ongoing) Engage with CALS data science ambassadors
- [x] (Ongoing) Office hours
- [x] Ensure that somebody from DIAG or CALS Data Science Ambassador is at ResBaz Coffee&Code and/or Hacky Hour (roster/volunteer schedule)
- [x] Add Coffee&Code & Hacky Hour to [Data Science Institute events calendar](https://datascience.arizona.edu/events)
- [x] Tweet every week about this
- [ ] (Ongoing, regularly) Evaluate how the 'funnel' is working, and refine
- [ ] (Ongoing) Keep track of time spent on different projects/'engagements'
- To evaluate relative amount of effort different projects require, to avoid a few of them monopolizing our resources more than the benefits they bring
- Use data to justify more resources (We could help a larger number of promising projects)
- [ ] (Ongoing) Knowledge base: Start curating answers to common problems
- Idea: Can start with an OSF page, and eventually publish using something like ZenDesk
- [x] Plan Workshop/webinar for end of Q1 2021
- [x] CALS/Parker: Present plan
- Define metrics
- Future is uncertain, how we plan to adapt and learn
##### Phase 2: Commit and spread the word
- [x] Add link on https://cals.arizona.edu/industry-community/get-help
-can be done at CCT level
- [ ] Add description to UA ITS campus computing document
- [ ] Add link to [CALS CCT 'Find an Expert' page](https://cct.cals.arizona.edu/support#find-an-expert)
- [ ] Add link to [Data Science Institute resources page](https://datascience.arizona.edu/resources) under UA Resources
- [ ] Add link to [RDI Core Facilities](https://research.arizona.edu/facilities/search)
- Maybe. CyVerse & stats consulting are there.
- [ ] Offer 1 week support via https://research.cals.arizona.edu/cals-research-development-programs
- [ ] (Regularly) CALS Faculty orientation
- [ ] Announcement: Workshop/webinar (end of Q1 2021) (See 'Plan Workshop/webinar' in Phase 1)
##### Phase 3: Spread the word even more widely
- [ ] (Regularly) Start sending stories to [UArizona Research Stories](https://research.arizona.edu/stories)
- [ ] Data Science Institute [mailing lists](https://datascience.arizona.edu/resources)
- [ ] Announce this initiative on: data7-announce@list.arizona.edu
- [ ] Discuss collaborations on: data7-general@list.arizona.edu
- [ ] Advertise workshops & trainings: data7-events@list.arizona.edu
##### Phase 4: Workshop
- [x] Present workshop/webinar (end of Q1 2021)
- Use workshop as way to create new ongoing relationships, while spreading word further
---
#### Mission/Goals/Outcomes of the Help Desk
- **Mission** Enable science in CALS
- Need short descriptive blurb of what we're doing
**Metrics**
- co-authored and acknowledgment in publications
- data publications
- co-pi on grants
- grants that we contributed to writing of
- number of ad-hoc consultations
#### Expectations for acknowledgments etc
We offer many services. Typically shorter engagements don't warrant co-authorship. We will discuss this at the time of defining a scope of work and as it evolves. In these cases, we appreciate if you would acknowledge our help and cite our work, and let us help you and potentially co-author publication of software and data as standalone research objects. It also helps if you let us and others know how we have enabled your research; what we have made possible that you would not have otherwise been able to do (whether due to lack of knowledge, skills, time or other resources)
We use [PLOS definitions of contributions](https://journals.plos.org/plosone/s/authorship#loc-author-contributions) to define expectations for authorship. But developing software and providing data don't require co-authorship if you cite the software / data separately in your paper.
#### Target audience (CALS / AES specific)
* Grad students
* Postdocs
* Faculty
* USDA
#### Outreach
> Copied these to Action Items
> [name=julianp-diag]
- link on https://cals.arizona.edu/industry-community/get-help
- [x] add description to UA ITS campus computing document
- [x] offer 1 week support via https://research.cals.arizona.edu/cals-research-development-programs
- CALS Faculty orientation
- office hours
- monthly donuts
#### Mechanisms
* Mentoring & Apprenticeships
* Workshops
* Webinars
* Grant writing support
* Office hours ... align w/ ResBaz
* Engage with CALS data science ambassador
[RADICAL (Rutgers) - Mentions using GitHub to manage projects (manage expectations) and keeping track of effort](https://cpb-us-w2.wpmucdn.com/sites.udel.edu/dist/6/8980/files/2019/03/Best-Practices-for-Scalable-Scientific-Software-Development-A-RADICAL-Approach.pdf)
Roles and responsibilities from [Computing Success for Scientists](https://docs.google.com/document/d/10thdQaaaS3AuG7S8RSmBGDeuCPMYZGJSspG14WvjOSw/edit):
> **Roles and responsibilities:** each role is not necessarily exclusive of another and an individual may act in multiple roles.
>
> 1. Trainer: provide webinar, walkthrough, tutorial, and video solutions to address how to use available data with computing and visualizing resources; also provides training outreach
> 2. SME: knowledgeable in domain specific areas: including and not restricted to: data capture, data storage, data retention, data classification, data extraction, data transformation, computing workflows, data visualization, attribution, and domain-specific knowledge (such as agriculture, medical, etc).
> 3. Technical: has knowledge of what a technical solution needs as it relates to scientific endeavors; part of teams developing technical solutions to ensure scientific needs and issues are met and appropriately addressed; interprets scientific needs to technical requirements
> 4. Outreach: provides articles and publications; works with groups to develop best practices; works to increase the profile of the organization; identify new areas to investigate for potential for further development
---
## Background research
[Make a list of user types and paths to engagement for Data Science Helpdesk #339](https://github.com/az-digitalag/organization/issues/339)
### Find existing personas & pathways for DIAG & ResBaz
- [x] OSF (@emily)
- ~~GitHub (@emily)~~
- [x] OLS (@emily)
- [x] HackMD (@emily)
- [x] ResBaz Google Docs (@julianp-diag)
- [x] CALS mission & vision (@julianp-diag)
- [x] DIAG mission & vision (@julianp-diag)
### Links
- [CALS Strategic Planning](https://cals.arizona.edu/about/strategic-planning)
- OLS Mentorship [Personas & Pathways](https://docs.google.com/document/d/1NN3w2jWYvPaYVJ1IMONxnclYwjSmMV8eiAHkGreucOI/edit?usp=sharing) assignment
- OL4 [Personas & Pathways](https://docs.google.com/presentation/d/1ZrB9xLscF4hB7iNOEpzACtyKrkIEX_OsBWL2E_JhMqU/edit?usp=sharing) presentation and examples
- [Atmosphere Personas](https://docs.google.com/presentation/d/1DvNmwSXB1moJvxBN_JSMQu8MYrmNTPpK5rySCcl6AOM/edit?usp=sharing)
- [Drone Processing Pipeline - User Stories](https://hackmd.io/@schnaufer/B12Io5O58#User-Stories)
- [Drone Processing Pipeline - User Needs](https://hackmd.io/@schnaufer/B12Io5O58#User-Needs-amp)
- [Computing Success for Scientists - Shnaufer](https://docs.google.com/document/d/10thdQaaaS3AuG7S8RSmBGDeuCPMYZGJSspG14WvjOSw/edit)
- [CDI Best Practices - Xpert Network](https://sites.udel.edu/xpert-cdi/resources/best-practices/)
- [Workshop - CDI Best Practices - Xpert Network](https://sites.udel.edu/xpert-cdi/2019/08/06/ics-workshop/)
### Existing Resources
- [Presentation](https://docs.google.com/document/d/13avznphXRxPt_cc6glOPkXv1YMsp-vVUPZrBfNM90kc/edit?usp=sharing) from Austin Rutherford, former CALS DS Ambassador
- [Data Science Institute](https://datascience.arizona.edu/) provides programs and events, but for getting personalized help, they recommend Coffee & Code and Hacky Hour
- [Research Bazaar Arizona (ResBaz)](https://researchbazaar.arizona.edu/)
- [UA Libraries - Research Data Management Services](https://data.library.arizona.edu/)
- weekly help sessions (Python, R, GIS, Data Mgmt/Research Data Support)
- Data tools for management and publishing
- Software Carpentries
- Data Carpentry
- Mission: “Data Carpentry trains researchers in the core data skills for efficient, shareable, and reproducible research practices.”
- [CCT](https://cct.cals.arizona.edu/)
- their paid [services](https://cct.cals.arizona.edu/sites/cct.cals.arizona.edu/files/downloads/CCT%20Catalog%20of%20Services.pdf)
- IT [Services](https://it.arizona.edu/services)
- specifically those related to [research](https://it.arizona.edu/research)
- [TRIPODS](https://tripods.math.arizona.edu/) - events, workshops, seminars
- [CALS - Find an Expert](https://cals.arizona.edu/find-an-expert)
- [CALS - Industry and Community - Get Help](https://cals.arizona.edu/industry-community/get-help)
- [List](https://extension.arizona.edu/coronavirus-online-learning) of online learning opportunities
- [CALS - Resources for Researchers](https://research.cals.arizona.edu/)
- Pre-Award Proposals & Contracts
- Information & Tools for Researchers
- USDA NIFA Capacity Grants
- CALS Research Development Programs
- Internal UA Funding
- Compliance Offices
- Research Forms and Template
- For Inventors
- Tribal Projects
- Unmanned Aerial Vehicles
- [CALS - Partner With Us](https://cals.arizona.edu/industry-community/partner-with-us)
* [Carpentries](https://datacarpentry.org/lessons/)
* Ecology curriculum
* Genomics curriculum
* Social Sciences curriculum
* Geospatial data curriculum
* [Image Processing curriculum](https://datacarpentry.org/image-processing/)
* [Python for Atm and Ocean Sciences](https://carpentrieslab.github.io/python-aos-lesson/)
### Reference: CALS Strategic Goals
> #### Research
> - Build on **existing strengths** and identify strategic **new investment areas** to **maximize research achievement**.
> - **Optimize CALS research infrastructure** to support the CALS research mission.
> - Define and measure **resource generation** for research.
> - **Expand communication** on research activities.
> - Build tech transfer, **IP development**, external business relations/development.
### Reference: DIAG Mission & Vision
> #### Mission
>
> Provide open software, data, and computing to enable productive and sustainable agriculture.
> The mission of the UA ag data group is to provide scientists and engineers with open software, data, and computing that will allow more efficient discovery and invention so that we can engineer crops and manage sustainable agricultural landscapes that produce food, energy, and ecosystem services.
>
>
> #### Vision
>
> Faster and more collaborative agricultural science and engineering through shared software and data
> Our software will be used and collaboratively developed by researchers at major land grant universities, global agricultural research centers, and industry. We will enable scientists to spend less time engineering bespoke pipelines and collecting redundant data so that they can spend more time developing algorithms, augmenting existing data with strategic data collection, and analyzing data.
### Reference: UX@UA resources
- [Empathy Maps and Persona Templates](https://uxua.arizona.edu/templates)
- [Methods](https://uxua.arizona.edu/methods)
### Reference: ResBazAZ Personas
> #### Mariko
>
> Mariko is a first year computer science PhD student, who obtained an undergraduate degree in engineering in Japan. During her first semester, Mariko struggled to learn several programming concepts due to unclear and confusing instructions written in English, which is her second language. She doesn't feel comfortable to ask her classmates for help either, because they are quite competitive. One day in her computer science course, there is an anouncement made for a hands-on Python meetup for functional programming, which is exactly what she is trying to study. She brings her computer to the meetup and follows along the exercise. Since many people are asking questions, she feels comfortable to ask questions as well, and recieves some help with preparing her virtual environment. At the meetup, she also learns about ResBazAZ's PhTea and Hacky Hour events. Since then, she attends PhTea weekly to work on her homework assignments, and feels safe to ask for help when the English instructions aren't clear. Mariko feels much more confident in her learning when others can help her.
>
>
> #### Peter
>
> Peter is a 5th year PhD student studying Spanish literature, who has never programmed before because he has always struggled with math and assumes programming will be quite hard. For his dissertation, Peter is studying Don Quixote. One day his advisor sends him an interesting link from the Stanford Digital Humanities Lab, about a project showing the different moods associated with various cities in England in 19th century British literature in a beautiful data visualization. Peter has slowly been documenting the moods and themes associated with different Spanish cities in Don Quixote, but the process has been painful and slow. Peter also has no idea how to make a data visualization for his dissertation, like the one in the Stanford paper. While looking for resources around campus, Peter finds the ResBazAZ twitter page, which has several tweets every week inviting people to come get help with coding. He decides to give it a try and come to Hacky Hour, where he is surprised to find that many people are interested in his project. With the help of other friendly researchers, Peter is able to better search through Don Quixote, and also visualize the data. After the visualization is done, he frequently goes to ShutUpAndWrite for focused writing time on his dissertation.
>
>
> #### Rahim
>
> Rahim is an MBA student in Eller who is interested in tech startups, specifically those dealing with AI. For a business class, Rahim is assigned to research workers in his field of interest and he chooses to talk to AI researchers on campus, who would likely go a tech startup. After searching around, Rahim hears about a Tucson Data Science meetup, where the topic with be reinforcement learning. At the meeting, he sees code run in a Jupyter notebook for the first time -- and it looks really cool! He wants to try it as well. At the meetup, Rahim hears the ResBazAZ announcement and thinks that Hacky Hour would be a good place to interview the AI engineers, and perhaps also ask them how he can get started. After the meeting he asks if he can come to Hacky Hour for this, and recieves a warm welcome and invitation to bring his computer and start learning some hands on deep learning! In the end, Rahim completes his assignment of researching AI engineers, and recieves a lot of praise from his professor. Moreover, Rahim has a new interest in learning to program for himself, which he feels will help him to be a better management if he pursues a career in tech startups.