一/二/三級分析:如何歸類? === ###### tags: `基因體` ###### tags: `生物資訊`, `基因體`, `一級分析`, `二級分析`, `三級分析` <br> [TOC] <br> ## 做個「基因體資訊清整」小總結: ### ==FastQC== - 定序儀跑完,產生 fastq (raw fastq) ```raw fastq —> FastQC —-> updated fastq``` - **目的:** - 排除「品質差、沒有信心」的鹼基序列 - **範圍:** - 可以屬於一級分析 / 二級分析,看定序廠商有沒有做 - 或是同時提供 raw & updated file ### ==BQSR + ApplyBQSR== - 序列組裝完,產生 bam (raw bam) ```raw bam —> BQSR + ApplyBQSR —-> updated bam``` - **目的:** - 拿已知 SNP 資訊供校正,並做正規化,重新調整分數,產生一堆 base metrics 資訊 - **範圍:** - 屬於二級分析,目前似乎已經不跑 ApplyBQSR ,可能反而提高變異點的假陽性率 ### ==VCF Filter== - Variant Caller 跑完,產生 vcf (raw vcf) ```raw vcf —> VCF Filter —-> updated vcf``` - **目的:** - 排除「品質差、沒有信心」的變異資訊 - **範圍:** - 可以屬於二級分析 / 三級分析,看二級廠商有沒有做 - 或是同時提供 raw & updated file) <br> ## 三級分析 - ### [Next Generation Sequencing Data: Tertiary Analysis](https://bioinfoinc.com/next-generation-sequencing-data-tertiary-analysis/) > The core of tertiary analysis is what we refer to as ‘interpretation.’ Interpretation involves > - the biological classification of observed variants, > - determination of the clinical relevance of these variants, > - the deemed action-ability of these variants in terms of treatment options, > - and extends to the ordering physician in terms of how clinically helpful the results or recommendations are. - ### [[data bricks] Tertiary analysis](https://docs.databricks.com/applications/genomics/tertiary/index.html) - [Joint genotyping pipeline](https://docs.databricks.com/applications/genomics/tertiary/joint-genotyping-pipeline.html) - [基因型分型](https://zh.wikipedia.org/wiki/%E5%9F%BA%E5%9B%A0%E5%9E%8B%E5%88%86%E5%9E%8B) > 追溯某個體的遺傳學父親或母親,只需十至二十段基因組區間即可(如單核苷酸多態性,英語:SNP) - [GloWGR: Whole genome regression](https://docs.databricks.com/applications/genomics/tertiary/glowgr.html) - [Hail 0.2](https://docs.databricks.com/applications/genomics/tertiary/hail.html) <br> ## 參考資料 - ### [BIOINFORMATICS 101:GENOME ANALYSIS TOOLKIT(GATK) 4](https://medicine.musc.edu/-/sm/medicine/departments/centers/bioinformatics/f/bio101-bioinformatics-gatk4.ashx?la=en)  - ### [所謂「標準分析」的定義並沒有一定的標準](https://medium.com/@chungtsai/9c7c9521059d) > A-Tsai (阿才), Sep 22, 2019·11 min read [](https://i.imgur.com/OnBinZP.png) [](https://i.imgur.com/fExM0Rx.png) - 簡單的分辨方法是, - 若最後廠商交付的只有 FASTQ 檔案,就是只做到一級分析(Primary Analysis); - 若是交付 VCF 檔案,則一定可以確定有做到二級分析 (Secondary Analysis) - 或再加部分的三級分析。 - ### [ClinOme -- a User Friendly Computational Tool to Generate Automated Clinical Reports from Raw NGS Data](http://www.actrec.gov.in/pi-webpages/AmitDutt/Clinome.html) [](https://i.imgur.com/xRDnDfH.png) <br> ## 關鍵字 - 一級分析 (Primary Analysis) - 二級分析 (Secondary Analysis) - 三級分析 (Tertiary Analysis)
×
Sign in
Email
Password
Forgot password
or
By clicking below, you agree to our
terms of service
.
Sign in via Facebook
Sign in via Twitter
Sign in via GitHub
Sign in via Dropbox
Sign in with Wallet
Wallet (
)
Connect another wallet
New to HackMD?
Sign up