DOC_ID : T15-0002

QIIME2_TA module :

DOC_ID : M20-3000
Editor : Anita
Reviewer :Angela

Function :

We use taxonomy classifiers to determine the closest taxonomic affiliation with some degree of confidence or consensus, based on alignment, k-mer frequencies, etc.

This module contains many steps :

Classified by greengene pretraining classifier :
- Generate taxonomy_gg.qza
- tabulate: Interactively explore Metadata in an HTML table (taxonomy_gg.qza ⇒ taxonomy_gg.qzv)
- barplot: Visualize taxonomy with an interactive bar plot (table_dada2.qza and taxonomy_gg.qza ⇒ taxa_gg-bar-plots.qzv)
Classified by silva pretraining classifier :
- Generate taxonomy_silva.qza
- tabulate: Interactively explore Metadata in an HTML table (taxonomy_silva.qza ⇒ taxonomy_silva.qzv)
- barplot: Visualize taxonomy with an interactive bar plot (table_dada2.qza and taxonomy_silva.qza ⇒ taxa_silva-bar-plots.qzv)
Creating a taxonomy tsv table and add taxonomy data to biom file (for FAPROTAX analysis)
- taxonomy_gg.qza ⇒ taxonomy_gg.tsv ⇒ table_with_gg_taxonomy.biom
- taxonomy_silva.qza ⇒ taxonomy_silva.tsv ⇒ table_with_silva_taxonomy.biom

Ref : https://docs.qiime2.org/2021.8/tutorials/overview/#taxonomy-classification-and-taxonomic-analyses

Installation :

All software are included in GA environment.

Note :

►執行分析前請先利用CreateProject.sh創建一個專案資料夾，請參閱Project standard folder structure文件。

►執行模組需確認所屬計算節點(–partition) : 一般節點的使用者建議使用ct56 ; 生醫節點的使用者建議使用ngs48G^註1。

►欲了解模組使用的方式，請執行模組的 -h 指令

#註1 : 欲確認使用者身分，請登入國網中心iService後，選取會員中心/計畫管理/我的計畫，若計畫名稱為”國家生醫數位資料與分析運算雲端服務平台III”即為生醫節點使用者

Description :

Tested environment	GAQM20.0.0.2
Software version	qiime2=2021.8.0=py38h5479fe0_0 biom-format=2.1.10=py38h0b5ebd8_0
Usage(Slurm)	Command in Slurm (Taiwania III)Quick start `sbatch -A $projectID --mail-user=$email --export='projDir='$(pwd)'/,sampleName=manifest_PE.txt,metaData=metadata.tsv' modules/QIIME2_TA.sh` User self-provided classifier `sbatch -A $projectID --mail-user=$email --export='projDir='$(pwd)'/,sampleName=manifest_PE.txt,metaData=metadata.tsv, classifier=userpath/AA-classifier.qza,classifierName=AA' modules/QIIME2_TA.sh`
Usage(Linux console)	Command in linux consoleQuick start `bash modules/QIIME2_TA.sh -p $(pwd) -s manifest_PE.txt -m metadata.tsv` User self-provided classifier `bash modules/QIIME2_TA.sh -p $(pwd) -s manifest_PE.txt -m metadata.tsv -c userpath/AA-classifier.qza -n AA`
#For Slurm operation, please refer to “Basic operation of Taiwania III“

Usage :

The following explains the usage of module parameters :

Parameter	Description	Remark
QIIME2_TA.sh	Module of taxonomy analysis	分析的模組需存放在[modules]資料夾中
projDir	分析專案的資料夾路徑（專案資料夾結構說明）	Script需在分析專案的資料夾執行， $(pwd) 會傳回使用者現在所在的路徑
sampleName	欲執行分析的Pair-end manifest file,相關格式要求可參考文件或官網資料格式 : *.txt資料路徑 : 分析專案的資料夾	例如:sampleName=manifest_PE.txt 是讀取存放在projDir裡的manifest_PE.txt檔案
metaData	欲執行分析的樣品元數據檔案,相關格式要求可參考文件或官網資料格式 : .tsv資料路徑 : 分析專案的資料夾►此檔案在建置時要注意每個欄位不可以有單獨sample為一個組別的情形*	例如:metaData=metadata.tsv是讀取存放在projDir裡的metadata.tsv檔案
classifier	視需求可自行提供分類器的檔案,相關格式要求可參考官網資料格式 : .qza資料路徑 : userpath/►此module預設有greengene及silva的taxonomy 分類器*	例如:classifier=userpath/AA-classifier.qza, 會讀取存放在user自行設定的path/資料夾裡的AA-classifier.qza檔案
classifierName	若有自行提供分類器的檔案,請將其命名	例如:classifierName=AA, 生成的檔案會自動冠名taxonomy_AA.qza taxonomy_AA.qzv taxa_AA-bar-plots.qzv table_with_AA_taxonomy.biom

基因體研究的全方位解決對策