#### Recommendations of the #### Research Data Services Task Force <p></p> <p> </p> Presentation to Cabinet November 6, 2018 Tim Dennis --- ### Research Data Services Task Force Report #### Team: * Tim Dennis, Chair * Claudia Horning, Library Metadata Team * Jamie Jamison, Data Collections Mgr * Lisa McAulay, Head, Digital Library * Jessica Mentesoglu, Head, DIIT Operations and Services * Rikke Ogawa, Director Biomed Library --- ### Methodology We adopted the process of Peer Benchmarking to: * Select peer institutions * Conduct a targeted environmental scan * Rank peers into "Tiers" of excellence --- ### Our Peers * Cornell * New York University * TU-Delft * UC Berkeley * UC San Francisco * University of Michigan * UNC Chapel Hill * University of Southern California * University of Virginia --- ### Benchmarking Criteria * Identity * Community building * Data Management Service * Data Integrity and Reproducible Research * Program Maturity --- ### Identity * **Research Data Services** is part of a coherent, documented, and integrated program supporting full data life cycle. --- ### Community Building * The Research Data Services program includes training, consulting and outreach programs. --- ### Data Management Services * Program includes actual services a researcher can leverage to * share, publish, and store their data in a repository * A locally managed repository service exists * which represents a local investment in software development or purchasing licensed services, or both. --- ### Data Integrity and Reproducible Research * The program offers consulting specifically about * researchers' methods to promote reusable data/software, and * researchers' options for preserving their data --- ### Program Maturity * The program is clearly well established and perceived as reliable * Characteristics include * dedicated staff * proven track record of quality services. --- ### Results: --- | Tier | Synopsis | Description | Institutions | | ---- | -------- | -----------| ------------ | | 1 | Leader |Robust across all bench-marking criteria | NYU, Cornell | | 2 | Excellent |Strong in most benchmarking criteria, very strong in some areas and weaker in others | UNC, UC Berkeley, TU-Delft, UCSF | --- | Tier | Synopsis | Description | Institutions | | ---- | -------- | -----------| ------------ | | 3 | Developing |Mixed assessment | UVA, UMich, USC | | 4 | Lagging |Weak assessment in all or most areas | **_UCLA_** | --- ## UCLA is behind in developing research data services, and we must take targeted action to improve --- ## Recommendations from the [Task Force Report](https://docs.google.com/document/d/18ogKqtKXZE7W7kj8Sxh1e_9tWqBckeXiQvfxhTB9NPo/export?format=pdf) --- ### We focus on moving from Tier 4 to Tier 3 today, because: * We can accomplish this by ourselves * Moving to Tiers 2 and 1 will require campus partnerships * This **first step** is a prerequisite for moving to 2 and 1 --- #### To move from Tier 4 to Tier 3: 7 Concrete Actions 1. Create a cross-library **Research Data Program** 2. Fill **existing provisions** and add **new roles** (recognize the specialized & technical nature of data) 3. Implement a **data sharing/publishing repository** and related data reuse tools 4. Redesign, unify, and heighten the Library's **research data support web representation** --- #### 7 Concrete Actions (cont.) 6. Provide a **data preservation infrastructure** 7. Develop an external-facing **training, consulting, and outreach program** 8. Create an internal-facing **community-of-practice based training program** --- ## Things we are already working on: --- ## Recommendation #1 We established the Library Data Science Center Note: * science of learning from data; * it studies the methods involved in the analysis and processing of data and proposes technology to improve methods in an evidence-based manner. * Help researchers improve their practices using applications from Data Science research * Provide infrastructural access to an array of tools in Data Science * Build community and data science curricula around these emerging practices --- ## Why Data Science and Why in the Library ? * Campus is ready for the Library to lead in this area * Data Science Services is largely a *Blue Ocean* at UCLA --> There are no competitors in the field with the reach and mandate that the Library has * The Library is open to partnering and has wide array of use cases and engagements * These concerns are interdisciplinary in nature that deal with heterogeneous data at scale * Driven by ubiquity of computing, software, and data - necessarily technical --- ## Things we are working on now (in part) *2. Fill existing provisions* *3. Implement a data sharing/publishing repository* *4. Develop an external-facing training, consulting, and outreach program* *7. Create an internal-facing community of practice-based training program* Note: * Some numbers: * We have started a number of efforts * Under-resourced, need capacity building * need more curators * more trainers * more specialized fte with technical capacity --- ## Recommendation \#2 ### Fill existing provisions * Spatial Data Sciences Librarian (SDSL) * active recruitment * Sciences Data Informationist (SDI) * awaiting posting Note: * Search committee meeting on SDSL and we have viable pool, if candidates still interested * SDI in queue to be posted * New jobs - * Data Science Center responsible for data repository and archival infrastructure * plus building pieces of Data Science and Research Data Management infrastructure (clients to a repo for injest, reuse, visualizing tools (gis, 3d objects) * analytic tools (juypter notebooks, R tools)) * Reproducibility Librarian - a focus on best practices in computational repro including standards for fields, instruction and building research tools to help --- ## Recommendation \#3 ### Implement a data sharing & publishing repository Note: * **Dataverse** will support self-service deposit plus sub-dvns (this is non intelligible but leaving it here as a note to you) * **Dataverse** has been implemented with existing DSC archival staff (Jamie and me) * Caveat -- Will need more trained staff to be effective for campus --- ## Recommendation \#6 ### Develop a campus-facing training, consulting, and outreach program --- ## Recommendation \#7 ### Create an internal-facing community of practice-based training program ### Develop a train-the-trainer methodology Note: * to the campus community in order to teach foundational coding and fundamental data skills needed to conduct research, * meet funder/journal mandates, * and support emerging expectations for transparency and reproducibility in research. * for public-facing librarians and staff to develop skills, knowledge, and confidence to work effectively with researchers about their data. * and an experience-based teaching continuum in which actual research consulting and/or reference encounters are used as case studies or practicums. Document expectations for public facing librarians regarding data consultations and hand-off procedures. * Carpentries membership and capacity building. --- ## One Year Horizon ### Near-term things we are poised to address --- ## More work on Recommendation \#1. ### Create an All-Library Research Data Program Note: * a precondition recommendation for rest of our library-oriented recommendation - led by dsc * responsibilities for consulting and outreach in support for the data life cycle * responsible for continuing to educational curriculum * needs to be unified and share communication * meets reg. for operational & ed., self-improvement, shares ticketing --- ## Exemplary All-Library Research Data Programs ### Look like this --> --- ![](https://i.imgur.com/4xtK3SR.png) --- ![](https://i.imgur.com/Z3FXl6I.png) --- ![](https://i.imgur.com/w0vriF8.png) --- ## Research Data Program * A requirement for moving up our Tiers is to have a RDS program * A vehicle for improving the data savviness of the campus and library * Needs to have at least 20% or more of job dedicated to data * Jobs need to have explicit relationship with RDP * Meet regularly for operations and education * Shared communication tools, tracking system, etc. --- ## Recommendations that Need More Time --- ## Recommendation #4 ### Redesign, unify, and heighten the Library's research data support web representation Note: services and information about data life cycle. Current offerings are misleading, outdated, and scattered. --- ### Examples we liked: --- ![](https://i.imgur.com/ipnZThD.png) --- ![](https://i.imgur.com/CgWorEd.png) --- ## And: ![](https://i.imgur.com/lSTnWnx.png) --- ## Needs to: * Be backed-up by real services (IT, mediated, or high touch) * With staff who have computing, software and data skills * And are engaged in a community-of-practice process of self-improvement and growth --- ## Design-wise needs to: * Be dynamic, filterable, and holistic * Allow for inclusion of campus partners * Should be high level (library.ucla.edu/data or research-data.ucla.edu ) * This will probably not be doable in LibGuides, but that could serve as backend --- ## Recommendation #5 ### Provide a data preservation infrastructure Note: * for locally stored research data that includes versioning, replicas and fixity/authenticity checks, including documentation that is maintained and publicly available. --- ### Summary * Research Data Services (RDS) are integral to continued academic and research excellence * Our peers are more advanced in this arena than we are * The Library needs to invest in Research Data Services to achieve the Strategic Plan Goal of being the "Heart of Research" * And we have to _assess_ our services regularly, working with the Assessment for Change Team is necessary
{"metaMigratedAt":"2023-06-14T18:47:32.661Z","metaMigratedFrom":"YAML","title":"UCLA Library Research Data Services Task Force","breaks":true,"slideOptions":"{\"theme\":\"beige\"}","contributors":"[{\"id\":\"1421b929-3417-4b58-b481-d440da17fe5d\",\"add\":14423,\"del\":6774},{\"id\":\"1dd682b5-acc8-40f7-90fd-c70e56951395\",\"add\":5741,\"del\":3126}]"}
    557 views