FOSS Materials/Useful links
Instant Feedback (please complete before you leave class):
Course Homepage
Course Schedule
Reproducibility tour
General notes
In opening breakouts, do your best as a team to try and answer the follwing questions (all related to the tutorial)
What is the point of starting a GitHub repository at the start of the project? What will you be doing with this?
What differences can you name between
a. installing Conda
b. using Conda to install other tools?
What is the difference between using a tool installed with Conda vs. using a tool in a Docker container?
What is the point of copying data to the datastore (end of the section "Computer and project setup with Github, Conda, and the Data Store")
Breakout notes
Team A
a. installing Conda -management system only, can't run anything b. using Conda to install other tools? Conda is a package managment system. It helps you manage workflows etc, but doesnt' actually do the work you want to do.
Team B
What differences can you name between
a. installing Conda
b. using Conda to install other tools?
What is the difference between using a tool installed with Conda vs. using a tool in a Docker container?
What is the point of copying data to the datastore (end of the section "Computer and project setup with Github, Conda, and the Data Store")
Team C
keep track of changes, having and building clear documentation of what you work does (building it as you go), enables collaboration through tracking and push pulling
this is a tool that helps you to install things
b. using Conda to install other tools?
equivalent (? but better than) to pip install, let's you install pacakges and dependencies for python and other tools
Question: when we use docker, are we just talking to a virtual computer that hosts the programs instead of downloading it onto our own computer? We think this is the difference but we're all still a bit confused on what docker is "doing"
Github can't host large files but the datastore can
Jason says scratch returns to system and gets erased, datastore is persistent
Team D
What is the point of starting a GitHub repository at the start of the project? What will you be doing with this?
What differences can you name between
a. installing Conda:
b. using Conda to install other tools?
What is the difference between using a tool installed with Conda vs. using a tool in a Docker container?
What is the point of copying data to the datastore (end of the section "Computer and project setup with Github, Conda, and the Data Store")
Team E
What is the point of starting a GitHub repository at the start of the project? What will you be doing with this?
What differences can you name between
a. installing Conda
b. using Conda to install other tools?
Conda is installed in your computer, whereas when you use conda to install other tools those will be in the conda environment of your choice.
What is the difference between using a tool installed with Conda vs. using a tool in a Docker container?
Conda allows for more modular uses of tools. With Conda you still will have to install some of the dependencies, or make sure they are in the same environment as the tool you will use. Whereas a Docker container will contain everything you need for the intended task, and nothing more.
What is the point of copying data to the datastore (end of the section “Computer and project setup with Github, Conda, and the Data Store”)
Team F
What is the point of starting a GitHub repository at the start of the project? What will you be doing with this? Storing/versioning code and changes over time, including attribution. Backup. Project website. Invite collaborators. Get feedback/issues from the public.
What differences can you name between
a. installing Conda
Installed via an installer, i.e. not a package.
b. using Conda to install other tools?
These are packages managed by conda in a specific environment, which can be used to have multiple versions of packages on the same computer but isolated from each other.
The tool is running in the docker container and has access to those resources that the docker container exposes, vs with conda, the tool has access to the computer directly. How you launch the tool is a different sequence, i.e. conda activate, then run vs docker run.
So you can use (mount) the data easily and efficiently to containers and virtual machines. Allow others to use data and results.
Team G
What is the point of starting a GitHub repository at the start of the project? What will you be doing with this? a. To establish a central repository for all work amd ensure that work can be coordinated effectively without wasting time and energy. b. Creating a repository for our team to work on our capstone.
What differences can you name between
a. installing Conda - Conda is a tool [that lets you efficiently install other tools] You can't use conda to install conda
b. using Conda to install other tools? Conda can be used to install & maintain (update) the tools/packages.
What is the difference between using a tool installed with Conda vs. using a tool in a Docker container? a. Conda installs the tools natively in your operating system/vm, whereas a tool in a docker container exist only within the container. For tools to be used in a container, they must be installed everytime the container is first booted up.
What is the point of copying data to the datastore (end of the section "Computer and project setup with Github, Conda, and the Data Store") a.Allows for the data to be used and shared within Cyverse.