or
or
By clicking below, you agree to our terms of service.
New to HackMD? Sign up
Syntax | Example | Reference | |
---|---|---|---|
# Header | Header | 基本排版 | |
- Unordered List |
|
||
1. Ordered List |
|
||
- [ ] Todo List |
|
||
> Blockquote | Blockquote |
||
**Bold font** | Bold font | ||
*Italics font* | Italics font | ||
~~Strikethrough~~ | |||
19^th^ | 19th | ||
H~2~O | H2O | ||
++Inserted text++ | Inserted text | ||
==Marked text== | Marked text | ||
[link text](https:// "title") | Link | ||
 | Image | ||
`Code` | Code |
在筆記中貼入程式碼 | |
```javascript var i = 0; ``` |
|
||
:smile: | ![]() |
Emoji list | |
{%youtube youtube_id %} | Externals | ||
$L^aT_eX$ | LaTeX | ||
:::info This is a alert area. ::: |
This is a alert area. |
On a scale of 0-10, how likely is it that you would recommend HackMD to your friends, family or business associates?
Please give us some advice and help us improve HackMD.
Do you want to remove this version name and description?
Syncing
xxxxxxxxxx
BioMonth 2021
tags:
bioinformatics
This is the place to ask questions about the workshop content! We use the Zoom chat only for posting links, reporting Zoom problems and such.
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →`
) for thecode blocks
. Otherwise, it's like a Google doc: it allows simultaneous editing.Code of conduct
We strive to follow the Code of Conduct developed by The Carpentries organisation to foster a welcoming environment for everyone. In short:
TO DO before the course
Zoom instructions
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →You can type your questions here. We will answer them, and this document will store the answers for you for later use!
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →General and practical matters
Q0: I have difficulty pasting my questions into HackMD. Do you have some instructions on how to write here?
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →Question 1: What is the schedule for today 16 March?
Q13: I cannot access most of the slides from the course page. These are marked with an earth logo.
Q17: Slides available after the course?
and then type: control/cmd+p (on mac, cmd+p works)
Q18: Course certificate?
Using Puhti
Question 2: How one knows how much memory an action requires?
seff
command: https://docs.csc.fi/computing/performance/Question 3: I failed to set the SHH key to CSC servers.
Answer: Let's take a look at this with you when there's the next hands-on break!
Which machine are you using? Mac or windows…
Question 4: Should I be able to connect to puhti.csc.fi using the same user and password that I use when I login to the CSC web pages? – I have a CSC login password as well as Haka password. I guess CSC login password should work for Puhti?
Question 5: Is it possible to add Graphab software to Puhti? https://sourcesup.renater.fr/www/graphab/en/home.html
Question 6: I have successfully connected via windows Powershell, putty and NoMachine - which one should I use, is there some reason to use one rather than another?
Q7: In putty what do we write in host name?
Q8: Can you demonstrate NoMachine login too?
https://docs.csc.fi/support/tutorials/nomachine-usage/
Q9: Do we need NoMachine (graphical access) during this course?
Q10: Do Puhti and Mahti rsync automatically or the user needs to do it?
Q11: Just to be sure: Is it so that files that are 90 days old are always removed from scratch, OR files that haven't been used in 90 days?
Q12: It says after the log in that scratch cleaning is not yet active (contrary to the slide)… When will this be implemented? (This is crucial since there were some problems with Allas and I still have many files that may not be properly backed up.)
Q14: In the last lines of the disk areas tutorial, what does the $USER part mean?
echo $USER
Q15: In the tutorial, you give permissions for the merged.fasta on the personal folder after copying. Do these permissions get also passed to the copied file in scratch?
Q16: Is it possible to ask for modules/software to be added to the list?
Q19: About the storage bonus excercise, https://github.com/CSCfi/csc-env-eff/blob/master/hands-on/disk-areas/disk-areas-exercise-fastdisks.md, how to send the script to compute node? There is no srun / sbatch specified in the script file, will it run in login node or in the compute one?
sbatch script_name.sh
NOTE: At the moment, Puhti is under unscheduled break; Please do this exercise when Puhti is back to normal.
Although a command is not preceded by
srun
in a batch script, it will still be executed in the compute node. Not usingsrun
may result in different action, though. See the template scripts per application for more use cases.Q20: Request for Relion: We would need it. It would be nice to put it to the list as it is now the most commonly used suite for cryo-EM.
Q21: Is it possible to take a moment to show how you would open a Jupyter Notebook using this sinteractive option?
Q22: Having a "tree: command not found"
Q23: Hi. I wonder that is it possible to accidentally modify files that do not belong to my own project?
Q24: I wonder how I can copy-paste into PuTTY?
Q25: mkdir: missing operand ? what is the problem here?
mkdir my_new_folder
Q26: Also on the EnaDataGet "No WGS set file available for AKYA02000000, format fasta"
pythium_iwayamai.fasta
from the first pythium excercise?enaDataGet ERR1558694 -f fastq
export LC_ALL="en_US.UTF-8"
Q27: What is this
ls
command showing?Q28: How to open Jupyter notebook on Puhti when you dont have ssh keys in place?
On interactive node load module that has jupyter notebook and then launch notebook as below:
If you have SSH keys set-up with Puhti, you can just follow the instructions that are displayed once jupyter is started. In case you use password to login to Puhti follow this ssh-tunneling process:
Point your browser to http://localhost:8888/?token=tokenkey (copy and paste this url from the instructions displayed once jupyter is launched)
If you want to use the server version, we recommend you set up the ssh-keys. Later this year, another setup called Open on Demand will at least partially provide another solution for this.
Q29: Is there any lower limit for billing? For example, if one needs to explore data - thus using less resources - can this be done "for free"?
Q30: #SBATCH -e and #SBATCH -o for error and output?
Q31: Can you explain how file editing goes: 'Copy the example above into a file called my_serial.bash '
nano my_serial.bash
. If you are on an interactive run, first typemodule load nano
and thennano my_serial.bash
. Using nano: https://www.nano-editor.org/dist/latest/cheatsheet.htmlQ32: When seff "perl: fatal: Bad job/step specified: slurm-5236590.out" MY BAD! bad input :)
Q33: I created the command file in nano, run it and it is said 'submitted batch job XXX'. But then, nothring happens? I cannot see the job in the queue either. it cannot run that long! – How to check the output then?
squeue -u $USER
won't show it anymore after it is completed. If you miss the output, you can however check output file (by default written into a file named slurm-XXXXXXX.out where XXXXXXX is a unique number corresponding to the job ID of the job), or use seff command to see what happened.Q34: Is there a difference between file types .bash and .sh?
Q35: Handy trick, Kimmo. Can we have that somewhere in tutorials/hackmd? How to setup the script with sbatch variables with the sinteractive command again?
Q36: I get this error, while trying to run sbatch my_serial.bash:
sbatch: error: AssocMaxSubmitJobLimit
sbatch: error: Batch job submission failed: Job violates accounting/QOS policy (job submit limit, user's size and/or time limits)
Q37: In the interactive jobs tutorial, you recomend using NoMachine for tools with a graphical interface. Would this apply when using a Rstudio as an interactive job? Would it improve the performance?
Q38: I had this experience that based on different sample size each time the mem usage was different, so sometimes I was allocating too much memory and sometimes the job would be stopped because of not enough memory! is there any option to know how much memory would be actually need?
Data management
Using Allas
Question 1: If you do not remember which protocol (swift/s3) you have used, can it be determined from the allas object?
Question 2: What if the bucket doesn't exist in Allas, when using rclone copy?
Question 3: Is there the 5G limit with a-put command?
Q4: Does the connection to allas get interrupted automatically after 8h? So is there a timer so we can check for how long we have been logged on to allas?
Q5: Is there a python/R interface for a-tools?
Q6: Would the other big files (>5G) go to the same segment-bucket as the previously moved ones?
Q6: Can I directly transfer all data in my poject_200XXX direcotry using a-put /scracth/Project_200XXX?
Q7: No autocomplete in Allas? Tricky to navigate!
Q8: What does -P mean in command rclone -P?
Q9: When running backups, is it typically OK to do this from the login node? Do we need to go to compute node?
Q10: About publicity: Allas is not 'eternal' link, so it cannot be used as a way to share data in publications, or can it?
Q11: What does -b mean in command
a-put pythium/* -b xxxx-genomes-ap
?--bucket
, "Define a name of the bucket into which the data is uploaded".a-put -h
helps when wondering what these options meanQ12: How can one save the big result files of sbatch-runs directly to Allas? Should the compression be avoided, if the result files are then later shared with other people in other institutions (published)?
-k
is the trick here, before you launch the batch job. The you can use a-put with options –nc or –asis if you dont want to use the automatic compression or packing.Q12: I wonder if there is a command to empty the console in PuTTY?
clear
Q13: To download data from outside CSC to Allas, how it is done? Just replacing the path with 'pythium'( as the command was 'a-put pythium')?
Q14: Can you just upload file from your computer to Allas via cPouta?
Q15: Is there a way to copy files from Funet FileSender link straight to Allas, i.e. without the need to first copy to your own computer?
Containers
Q1: Are containers executed from the concole?
Question 2: Singularity for Mac? Could not find a command-line version just a “beta desktop version”
Question 3: So I guess for a Mac user the strategy for building hpc compatible singularity images would be to build using docker (have it installed) and convert to singularity on Puhti?
Q4: I have a notable lag inputting text with sinteractive. Is it my connection or a feature of the system?
Q5: I wonder what am I doing wrong. Trying to open the text editor by typing 'nano' in the console but nothing happens…?
module load nano
first!Q6: Follow-up Q to Q5: After creating test.sh, the following error appears:[llindfor@r07c52 cont]$ sbatch test.sh
sbatch: error: Batch job submission failed: Invalid account or account/partition combination specified
Q7: Where should the output come for the batch job test.sh?
Q8: When using R in Puhti, we use the command "module load r-env-singularity" to load it. Is there some connection to the singularity with this?
Q9: Error writing test.sh: Permission denied. Why's that…?
chmod u+x test.sh
and then try againQ10: The
singularity build pytorch_20.03-py3.sif docker://nvcr.io/nvidia/pytorch:20.03-py3
command takes really long time I’ve been onINFO: Creating SIF file...
for almost 10 minutessingularity build alpine.sif docker://library/alpine:latest
Q11: Is it faster ifs we use more cpus/threads? Like:
sinteractive -c 4
Q12: Feedback: I wish that in all tutorials there would be a comment e.g. on the right side of the commands that we are supposed to run in console. Its often difficult to understand what each line is actually doing.
Q13: Do we have sudo privilige inside the singularity container to e.g. modify program settings?
Q14: Can we use the singularity inspect to see what tools are installed inside the container?
Q15: Env?
Q16: I'm seeing lots of warnings?
unset XDG_RUNTIME_DIR
-this will hide the warnings :) We could add this to our tutorial!Q17: Singularity wrapper can be used as a replace to
--bind
, meaning that it binds the host dir with inside the container itself?Q18: So, outside the sinteractive it would be:
export SINGULARITY_CACHEDIR=/scratch/project_200XX
for both cache and tmpdir?Q19: Is there some good community web-page covering nextflow pipelines combined to containers? I mean something other than nf-core?
Q20: At the uncomment step (
// publishDir 'results'
) I can't seem to find that line in fastqc.nf.The only "uncommentable" line is
// publishDir params.outdir
. Butparams.outdir = "results"
has been definied previously, so I think this should work the same wasy, no?Q21: How to to define
<image_path>
in here nextflow run <nextflow_script> -with-singularity <image_path>
How do I know which of the files is the image?shub://IARCbioinfo/nf_coverage_demo:v2.3
Q22: I didn't know we cannot use "nano" in the sinteractive! Wondering how should I open txt files?
module load nano
Q23: Can you explain again the thing about over-writting the param file?
reads
= hardcoded,params.reads
= changeableQ24: cPouta is free for researcher associated with University? And in future it will be good for sensitive data? But if project is over can it be still used e.g. Google cloud or amazon and how much is price (approx). –Do billing units apply in the case of Research in higher education institutions (=one of the free-of-use cases)? –Why do you call them billing units, that's confusing!
More about terms of use etc: https://research.csc.fi/terms-of-use-and-pricing?inheritRedirect=true
Billing units are these things we use to measure the usage of our services: if your use case is one of the free-of-charge use cases (like research in higher education institution here), you just apply for BUs, but they don’t cost really anything.
See the billing units chapter here: https://research.csc.fi/pricing
We call them billing units, because someone is paying for the use: if not the end-user (like in the free-use-cases), then it's the ministry :)
Q25: How should I change the Singularity cache location to my Project directory
Q26:
Q27:
Q28:
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →