Program is in my bit
toolkit, can be installed in a new environment like so if needed:
conda create -y -n bit -c conda-forge -c bioconda -c defaults -c astrobiomike bit=1.8.08
curl -L -o example-bracken-output-1.tsv https://ndownloader.figshare.com/files/24086822
curl -L -o example-bracken-output-2.tsv https://ndownloader.figshare.com/files/24086822
Script is bit-combine-bracken-and-add-lineage
. There is a help menu with -h
. It takes as input (-i
) a file either with 1 column of input file names only, or 2 tab-delimited columns with input file name in column 1 and the desired sample name in column 2, e.g.:
paste <( ls ex*bracken*.tsv ) <( printf "sample-1\nsample-2\n" ) > bracken-sample-name-map.tsv
cat bracken-sample-name-map.tsv
example-bracken-output-1.tsv sample-1
example-bracken-output-2.tsv sample-2
Running the script:
bit-combine-bracken-and-add-lineage -i bracken-sample-name-map.tsv -o Combined-taxonomy.tsv
This has a few things in it, but can be chopped down to what you want:
head Combined-taxonomy.tsv | column -ts $'\t'
domain phylum class order family genus species tax_id tax_level sample-1_num sample-1_frac sample-2_num sample-2_frac
Bacteria Firmicutes Bacilli Bacillales Listeriaceae Listeria Listeria monocytogenes 1639 S 399031 0.10780 399031 0.10780
Bacteria Firmicutes Bacilli Bacillales Listeriaceae Listeria Listeria grayi 1641 S 204 0.00006 204 0.00006
Bacteria Firmicutes Bacilli Bacillales Listeriaceae Listeria Listeria ivanovii 1638 S 555 0.00015 555 0.00015
Bacteria Firmicutes Bacilli Bacillales Listeriaceae Listeria Listeria welshimeri 1643 S 92 0.00002 92 0.00002
Bacteria Firmicutes Bacilli Bacillales Listeriaceae Listeria Listeria innocua 1642 S 53 0.00001 53 0.00001
Bacteria Firmicutes Bacilli Bacillales Listeriaceae Listeria Listeria seeligeri 1640 S 30 0.00001 30 0.00001
Bacteria Firmicutes Bacilli Bacillales Listeriaceae Listeria Listeria sp. PSOL-1 1844999 S 28 0.00001 28 0.00001
Bacteria Firmicutes Bacilli Bacillales Listeriaceae Listeria Listeria weihenstephanensis 1006155 S 15 0.00000 15 0.00000
Bacteria Firmicutes Bacilli Bacillales Listeriaceae Brochothrix Brochothrix thermosphacta 2756 S 4 0.00000 4 0.00000
Interactive function plots and tables are here. BRAILLE update (31-Mar-2021) Previous update docs https://hackmd.io/@astrobiomike/BRAILLE-notes-17-Mar-2021 https://hackmd.io/@astrobiomike/BRAILLE-update-24-Feb-2021 https://hackmd.io/@astrobiomike/BRAILLE-update-3-Feb-2021 https://hackmd.io/@astrobiomike/BRAILLE-notes-12-Dec-2020
Nov 24, 2024Overview Metagenomics attempts to sequence all the DNA present in a sample. It can provide a window into the taxonomy and functional potential of a mixed community. There are a ton of things that can be done with metagenomics data as this non-exhaustive overview figure begins to highlight: <a href="https://astrobiomike.github.io/images/metagenomics_overview.png "><img src="https://astrobiomike.github.io/images/metagenomics_overview.png "></a> This page is an introduction to some concepts about one of the things we can try to do with metagenomics data: recovering metagenome-assembled genomes (MAGs). Key concepts
Jul 25, 2024<a href="https://github.com/AstrobioMike/AstrobioMike.github.io/raw/master/images/GToTree-logo-1200px.png "><img src="https://github.com/AstrobioMike/AstrobioMike.github.io/raw/master/images/GToTree-logo-1200px.png "></a>
Jul 22, 2024GUI used was Jetstream2 exosphere: https://jetstream2.exosphere.app/ Summary info The base image created below is publicly available as "STAMPS-2023" and includes: conda v23.5.2 / mamba v1.4.9 jupyterlab v3.6.3 in base conda env an anvio-dev conda environment R v4.3.1 / Rstudio Server (2023.06.1-524) with:BiocManager 1.30.21 remotes 2.4.2
Jul 8, 2024or
By clicking below, you agree to our terms of service.
New to HackMD? Sign up