DOC_ID : P04-0002
Editor : Anita
Reviewer : Lincat

Background

QIIME 2 is a powerful, extensible, and decentralized microbiome analysis package with a focus on data and analysis transparency. QIIME 2 enables researchers to start an analysis with raw DNA sequence data and finish with publication-quality figures and statistical results.

Learn more

Environment requirement

Hardware : CPU : 10 cores
RAM : 45 Gb
Software :The pipeline is developed based on GA System. All software used in the pipeline are included in pre-defined software environments (GA application collection ). Please install suitable environment before start the pipeline(How to do it!) .

Reference library

Classifier reference BuildNameContentQIIME2v2020.6gg-13-8-99-515-806-nb-classifier.qzasilva-138-99-515-806-nb-classifier.qza

Project folder structure

Project Name                  project.sh
              q2manifest.txt
                 q2metadata.tsv
- Raw
SingleEnd: SampleName_R1.fastq.gz
PairedEnd: SampleName_R1.fastq.gz, SampleName_R2.fastq.gz
- Processed
                rep-seq-dada2.qza/qzv
                table-dada2.qza
                taxonomy.qza/qzv
                feature-table.biom               …
- QC
fastqc: sample_fastqc.html / sample_fastqc.json
qualimap : sample folder
- Log
Summary.log
service_id.ER, OU
- Report
             MultiQC : multiqc_report.html
             core-metrics-results
             Alpha_diversity_indices.qza
             taxa-bar-plots.qzv
             feature-table.tsv             …*:Files prepared by users are marked in Red

Manifest File

Pair-end manifest file :

Description :
User have to create manifest file first. Manifest file described sample identifiers to absolute filepaths of raw reads file (fastq.gz or .fastq). The manifest file should indicates the direction of the reads in each raw read file. The manifest format is Metadata-compatible, user can re-use the manifest file to bootstrap Sample Metadata, too.The following example illustrates a simple fastq manifest file for paired-end read data for four samples. The fastq.gz absolute filepaths may contain environment variables $HOME or $PWD (e.g. $PWD/raw/S1_R1.fastq.gz ).

Ref : https://docs.qiime2.org/2020.6/tutorials/importing/#fastq-manifest-formats

Manifest format 1

sample-id	forward-absolute-filepath	reverse-absolute-filepath
S1	/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S1_R1.fastq.gz	/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S1_R2.fastq.gz
S2	/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S2_R1.fastq.gz	/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S2_R2.fastq.gz
S3	/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S3_R1.fastq.gz	/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S3_R2.fastq.gz
S4	/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S4_R1.fastq.gz	/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S4_R2.fastq.gz
…	…	…

Manifest format 2

sample-id,absolute-filepath,direction

S1,/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S1_R1.fastq.gz,forward

S1,/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S1_R2.fastq.gz,reverse

S2,/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S2_R1.fastq.gz,forward

S2,/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S2_R2.fastq.gz,reverse

…

►Download demo file: q2manifest.txt

Single-end manifest file :

Just like with fastq.gz, the absolute filepaths in the manifest for any fastq files must be accurate. The following example illustrates a simple fastq manifest file for fastq single-end read data for two samples.

sample-id	absolute-filepath
S1	/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S1_R1.fastq
S2	/user_home/GA_bundle/DemoDataSet/QIIME2_Demo/raw/S2_R1.fastq

Metadata :

Description :
Metadata provides the key information of each sample. User should manually define sample metadata in metadata file before beginning a QIIME 2 analysis. The ID column is essential and the first column of metadata file. The ID must be unique and also list in manifest file too. User can all kinds of factor associated with sample ID. These factor would be used in following taxonomical, statistical and diversity analysis. You have to add comment directive to declare column types in your factor. The comment directive must appear directly below the header. The row’s first cell must be #q2:types to indicate the row is a comment directive. Subsequent cells may contain the values categorical or numeric (both case-insensitive). Your metadata file can be tested by online tools in official web site.

Reference: ver. 2020.6 [ref : https://docs.qiime2.org/2020.6/tutorials/metadata/]

ex .

QIIME 2 metadata的檔案格式為.tsv，檔案內第一個欄位”#SampleID”是必須的，其餘欄位可依user自行配置填寫，欄位間需用”Tab”鍵做間隔。

#SampleID	Subject	Pet	age	sex	BarcodeSequence
#q2:types	categorical	categorical	categorical	categorical	categorical
S1	Left upper eyelid	Dog	20-29	male
S2	Right upper eyelid	Dog	20-29	male
S3	Right outer nose	Dog	20-29	male
S4	Left armpit	None	20-29	female
…
S44	Left outer nose	Cat	20-29	female
S45	Right outer nose	Cat	20-29	female

►Download demo file: q2metadata.tsv

The diagram of workflow

QIIME2_DPP_PE : QIIME2_DPP_PE perform Data import, denoise, dereplicate, and filters chimeras of paired-end sequences.(Ref)

QIIME2_TA : We use taxonomy classifiers to determine the closest taxonomic affiliation with some degree of confidence or consensus (which may not be a species name if one cannot be predicted with certainty!), based on alignment, k-mer frequencies, etc. (Ref)

QIIME2_DA : QIIME 2 analysis allows the use of phylogenetic trees for diversity metrics such as Faith’s Phylogenetic Diversity and UniFrac distance as well as feature-based analyses in Gneiss. The tree provides an inherent structure to the data, allowing us to consider an evolutionary relationship between organisms. Alpha diversity measures the level of diversity within individual samples. Beta diversity measures the level of diversity or dissimilarity between samples. (Ref)

PICRUSt2 : PICRUSt2 (Phylogenetic Investigation of Communities by Reconstruction of Unobserved States) is a software for predicting functional abundances based only on marker gene sequences. (Ref)

FAPROTAX : FAPROTAX is a manually constructed database that maps prokaryotic taxa (e.g. genera or species) to metabolic or other ecologically relevant functions (e.g. nitrification, denitrification or fermentation), based on the literature on cultured representatives. (Ref)

Module list

Module code	Supported References	Node	module Information
QIIME2_DPP_PE.2.0.0.sh		ngs48G	Data preprocessing and mapping
QIIME2_TA.2.0.0.sh	QIIME2_v2020.6	ngs48G	Taxonomy analysis
QIIME2_DA.2.0.0.sh		ngs16G	Diversity analysis
PICRUSt2.2.0.0.sh		ngs48G	Predicte gene abundances
FAPROTAX.2.0.0.sh		ngs16G	annotation gene marker and phenotype

Workflow script configuration

Color theme of Script
Gray(Dark Gray): Script comment
Red (Pale Red): module name
Green (Dark Emerald): Variables defined by use

The workflow script demonstrate genome variant calling pipeline implemented by using QIIME2, PICRUSt2 and FAPROTAX modules. The work script are included several functional section. The detail of each section show below.

1. Parameter setup :

# *****************************************
# ***** Global parameters ******
# *****************************************
# set your email for job message senting
email = user@your.email

# set your project account for submit job
projectID = MST-XXXXXXXX

2. Data preprocessing :

QIIME2_DPP_PE perform Data import, denoise, dereplicate, and filters chimeras of paired-end sequences.

Ref : https://docs.qiime2.org/2020.6/tutorials/overview/

# ****************************************************
# ****** Data preprocessing ******
# ***SampleN_R1,R2 => rep-seq-dada2.qza ***
# **** SampleN_R1,R2 => table-dada2.qza ****
# *** SampleN_R1,R2 => taxonomy.qza/qzv ***
# ****************************************************

# ===== Data preprocessing =====
QIIME2_DPP_PE=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=’$(pwd)‘/,sampleName=’q2manifest.txt‘, metaData=“q2metadata.tsv”‘ QIIME2_DPP_PE.2.X.Y.sh)
echo -e $(date)’\tQIIME2_DPP_PE\t’$QIIME2_DPP_PE’\tSubmit’ >> log/Summary.log

# ===== (!!Adjust denoise parameters!!) Data preprocessing =====
QIIME2_DPP_PE=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=’$(pwd)‘/,sampleName=’q2manifest.txt‘, metaData=“q2metadata.tsv”,denoise=220:220:0:0:2.0:2.0:2:pooled‘ QIIME2_DPP_PE.2.X.Y.sh)
echo -e $(date)’\tQIIME2_DPP_PE\t’$QIIME2_DPP_PE’\tSubmit’ >> log/Summary.log

Parameter	Description	Remark
QIIME2_DPP_PE.2.X.Y.sh	Module of data preprocessing	分析的模組需存放在[modules]資料夾中
projDir	分析專案的資料夾路徑（專案資料夾結構說明）	Script需在分析專案的資料夾執行， $(pwd) 會傳回使用者現在所在的路徑
sampleName	The name of pair-end manifest file, refer to the document or official website for relevant format requirements, please.File format : *.txtFile path : 分析專案的資料夾	例如:sampleName=q2manifest.txt 是讀取存放在projDir資料夾裡的q2manifest.txt檔案
metaData	The name of metadata file, refer to the document or official website for relevant format requirements, please.File format : *.tsvFile path : 分析專案的資料夾	例如:sampleName=q2metadata.tsv 是讀取存放在projDir資料夾裡的q2metadata.tsv檔案
denoise	The parameters of adjust denoise. The setting of related parameters, please refer to the official website.►此參數為選擇性參數, 若無設定此參數, 模組會自動調用預設值 240:240:0:0:2.0:2.0:2:pooled	例如:denoise=220:220:0:0:2.0:2.0:2:pooled 程式會依其設定的denoise參數進行處理

3. Diversity analysis :

QIIME 2 analysis allows the use of phylogenetic trees for diversity metrics such as Faith’s Phylogenetic Diversity and UniFrac distance as well as feature-based analyses in Gneiss. The tree provides an inherent structure to the data, allowing us to consider an evolutionary relationship between organisms.
Alpha diversity measures the level of diversity within individual samples. Beta diversity measures the level of diversity or dissimilarity between samples.

Ref : https://docs.qiime2.org/2020.6/tutorials/overview/

# ****************************************************
# ******* Diversity analysis *******
# ****** rep-seq-dada2.qza => *.qza/ *.qzv ******
# ******** table-dada2.qza => *.qza/ *.qzv ********
# ****************************************************

# ===== Diversity analysis =====
QIIME2_DA=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=’$(pwd)‘/,sampleName=‘q2manifest.txt’, metaData=‘q2metadata.tsv’‘ QIIME2_DA.2.X.Y.sh)
echo -e $(date)’\tQIIME2_DA\t’$QIIME2_DA’\tSubmit’ >> log/Summary.log

# ****************************************************
# ******* Diversity analysis *******
# ****** rep-seq-dada2.qza => *.qza/ *.qzv ******
# ******** table-dada2.qza => *.qza/ *.qzv ********
# ****************************************************

# ===== (!!!Further analysis the column in metadata!!!) Diversity analysis =====
QIIME2_DA=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=’$(pwd)‘/,sampleName=‘q2manifest.txt’, metaData=‘q2metadata.tsv’,fieldName=“Subject:sex”‘ QIIME2_DA.2.X.Y.sh)
echo -e $(date)’\tQIIME2_DA\t’$QIIME2_DA’\tSubmit’ >> log/Summary.log

Parameter	Description	Remark
QIIME2_DA.2.X.Y.sh	Module of diversity analysis	分析的模組需存放在[modules]資料夾中
projDir	分析專案的資料夾路徑（專案資料夾結構說明）	Script需在分析專案的資料夾執行， $(pwd) 會傳回使用者現在所在的路徑
sampleName	The name of pair-end manifest file, refer to the document or official website for relevant format requirements, please.File format : *.txtFile path : 分析專案的資料夾	例如:sampleName=q2manifest.txt 是讀取存放在projDir資料夾裡的q2manifest.txt檔案
metaData	The name of metadata file, refer to the document or official website for relevant format requirements, please.File format : *.tsvFile path : 分析專案的資料夾	例如:metaData=q2metadata.tsv是讀取存放在projDir資料夾裡的q2metadata.tsv檔案
fieldName	在執行分析時視需求可以進一步分析樣品Metadata檔案內之欄位名稱►此參數為選擇性參數, 若欲分析的欄位多於一個, 請在欄位名稱中以冒號(:)做間隔(e.g. Subject:sex)	例如:fieldName=Subject 會再將Metadata檔案裡的Subject欄位做進一步分析

4. Taxonomic analysis :

We use taxonomy classifiers to determine the closest taxonomic affiliation with some degree of confidence or consensus (which may not be a species name if one cannot be predicted with certainty!), based on alignment, k-mer frequencies, etc.

Ref : https://docs.qiime2.org/2020.6/tutorials/overview/

# ****************************************************
# ******* Taxonomy analysis *******
# ****** rep-seq-dada2.qza => *.qza/ *.qzv ******
# ******** table-dada2.qza => *.qza/ *.qzv ********
# ****************************************************

# =====Taxonomy analysis =====
QIIME2_TA=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=‘$(pwd)’/,sampleName=‘q2manifest.txt’, metaData=“q2metadata.tsv”‘ QIIME2_TA.2.X.Y.sh)
echo -e $(date)’\tQIIME2_TA\t’$QIIME2_TA’\tSubmit’ >> log/Summary.log

# =====(!!user self-provided classifier!!) Taxonomy analysis =====
QIIME2_TA=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=‘$(pwd)’/,sampleName=‘q2manifest.txt’, metaData=“q2metadata.tsv”,classifier=userpath/AA-classifier.qza,classifierName=AA‘ QIIME2_TA.2.X.Y.sh)
echo -e $(date)’\tQIIME2_TA\t’$QIIME2_TA’\tSubmit’ >> log/Summary.log

Parameter	Description	Remark
QIIME2_TA.2.X.Y.sh	Module of taxonomy analysis	分析的模組需存放在[modules]資料夾中
projDir	分析專案的資料夾路徑（專案資料夾結構說明）	Script需在分析專案的資料夾執行， $(pwd) 會傳回使用者現在所在的路徑
sampleName	The name of pair-end manifest file, refer to the document or official website for relevant format requirements, please.File format : *.txtFile path : 分析專案的資料夾	例如:sampleName=q2manifest.txt 是讀取存放在projDir資料夾裡的q2manifest.txt檔案
metaData	The name of metadata file, refer to the document or official website for relevant format requirements, please.File format : *.tsvFile path : 分析專案的資料夾	例如:metaData=q2metadata.tsv是讀取存放在projDir資料夾裡的q2metadata.tsv檔案
classifier	According to the needs, user can provide the classifier file byself, refer to the official website for relevant format requirements, please.File format : .qzaFile path : userpath*	例如:classifier=userpath/AA-classifier.qza, 會讀取存放在user自行設定的path/資料夾裡的AA-classifier.qza檔案
classifierName	The name of user self-provided classifier file	例如:classifierName=AA, 生成的檔案會自動冠名taxonomy_AA.qza taxonomy_AA.qzv taxa_AA-bar-plots.qzv table_with_AA_taxonomy.biom

5. Predicte gene abundances :

PICRUSt2 (Phylogenetic Investigation of Communities by Reconstruction of Unobserved States) is a software for predicting functional abundances based only on marker gene sequences.

Ref : https://github.com/picrust/picrust2/wiki

# ****************************************************
# ******* Predicte gene abundances *******
# ****** feature-table.biom => *.tsv.gz/ *.txt ******
# ****************************************************

# =====Predicte gene abundances =====
PICRUSt2=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v “projDir=‘$(pwd)’/,sampleName=‘dna-sequences.fasta’,tableBiom=‘feature-table.biom’” PICRUSt2.2.X.Y.sh)
echo -e $(date)’\tPICRUSt2\t’$PICRUSt2’\tSubmit’ >> log/Summary.log

Parameter	Description	Remark
PICRUSt2.2.X.Y.sh	Module of predicting functional abundances	分析的模組需存放在[modules]資料夾中
projDir	分析專案的資料夾路徑（專案資料夾結構說明）	Script需在分析專案的資料夾執行， $(pwd) 會傳回使用者現在所在的路徑
sampleName	The name of amplicon sequenceFile format : *.fastaFile path : processed/	例如:sampleName=”dna-sequences.fasta” 會將存放在processed/資料夾裡的rep-seqs_dada2.qza之檔案解壓縮後, 讀取內部的dna-sequences.fasta之資料
tableBiom	The name of biom table with the abundance of each amplicon sequence variant in the sample.File format : *.biomFile path : processed/	例如:tableBiom=feature-table.biom 會讀取放在processed/資料夾裡的檔案feature-table.biom

6. Annotation gene marker and phenotype :

FAPROTAX is a manually constructed database that maps prokaryotic taxa (e.g. genera or species) to metabolic or other ecologically relevant functions (e.g. nitrification, denitrification or fermentation), based on the literature on cultured representatives.Functions represented in FAPROTAX focus on marine and lake biogeochemistry, particularly sulfur, nitrogen, hydrogen and carbon cycling, although other functions (e.g. plant pathogeneicity) are also included.

Ref : http://www.loucalab.com/archive/FAPROTAX/lib/php/index.php?section=Instructions

# ****************************************************
# *** annotation gene marker and phenotype ***
# ****** *_taxonomy.biom => *.tsv.gz/ *.txt ******
# ****************************************************

# ===== annotation gene marker and phenotype =====
FPO=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v “projDir=‘$(pwd)’/,inFile=‘table-with-taxonomy.biom’,output=‘Case’” FAPROTAX.2.X.Y.sh)
echo -e $(date)’\tFAPROTAX\t’$FPO’\tSubmit’ >> log/Summary.log

Parameter	Description	Remark
FAPROTAX.2.X.Y.sh	module of annotation gene marker and phenotype	分析的模組需存放在[modules]資料夾中
projDir	分析專案的資料夾路徑（專案資料夾結構說明）	Script需在分析專案的資料夾執行， $(pwd) 會傳回使用者現在所在的路徑
inFile	為樣品中每個擴增子序列變體豐度並加上分類數據的BIOM表File format : *.biomFile path : processed/	例如:File=table-with-taxonomy.biom 會讀取放在processed/資料夾裡的table-with-taxonomy.biom檔案
output	輸出的檔案名稱File format : .tsv / .txtFile path : report/	例如:output=Case 會在report/資料夾生成:1. Case-func_table_with_taxonomy.tsv2. Case_report.txt

Demo Workflow script :

#!/bin/bash

# ************* QIIME2 Microbiota analysis
# ************* Platform Taiwania
# ************* Script version 2.0 (Applies to modules from version 2.0.0)
## [projDir] project <= manifest.txt, metadata.tsv
##   +– [rawDir] raw <= Sample1,2,3_R1/_R2.fastq.gz
##   +– [prcDir] processed => *.qza, *.qzv
##   +– [anaDir] analyzed
##   +– [rptDir] report => *.qzv,*.tsv.gz, *.tsv, *.txt
##   +– [tmpDir] temp
##   +– [logDir] log => Summary.log
##   +– [qcDir] QC

## 1. ##
# *****************************************
# ****** Global parameters ******
# *****************************************

# set your email for job message senting
email = user@your.email
# set your project account for submit job
projectID = MST-XXXXXXXX

# *****************************************
# ************* raw read QC *************
# *****************************************
#===== fastqc module =====
#Use Raw_Read_Quality_Check-Ver2 pipeline

## 2. ##
# *****************************************
# ****** Data preprocessing *****
# *** SampleN_R1,R2 => rep-seq-dada2.qza***
# **** SampleN_R1,R2 => table-dada2.qza****
# *** SampleN_R1,R2 => taxonomy.qza/qzv****
# *****************************************

# ===== Data preprocessing =====
QIIME2_DPP_PE=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=’$(pwd)‘/,sampleName=’q2manifest.txt’, metaData=”q2metadata.tsv”‘ QIIME2_DPP_PE.2.X.Y.sh)
echo -e $(date)‘\tQIIME2_DPP_PE\t’$QIIME2_DPP_PE’\tSubmit’ >> log/Summary.log

# ===== (!!Adjust denoise parameters!!) Data preprocessing =====
QIIME2_DPP_PE=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=’$(pwd)‘/,sampleName=’q2manifest.txt’, metaData=”q2metadata.tsv”,denoise=220:220:0:0:2.0:2.0:2:pooled’ QIIME2_DPP_PE.2.X.Y.sh)
echo -e $(date)‘\tQIIME2_DPP_PE\t’$QIIME2_DPP_PE’\tSubmit’ >> log/Summary.log

## 3. ##
# *****************************************
# ******* Diversity analysis ********
# *** rep-seq-dada2.qza => *.qza/ *.qzv ***
# **** table-dada2.qza => *.qza/ *.qzv ****
# *****************************************

# ===== Diversity analysis =====
QIIME2_DA=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=’$(pwd)‘/,sampleName=’q2manifest.txt’, metaData=’q2metadata.tsv” QIIME2_DA.2.X.Y.sh)
echo -e $(date)‘\tQIIME2_DA\t’$QIIME2_DA’\tSubmit’ >> log/Summary.log

# ===== (!!!Further analysis the column in metadata!!!) Diversity analysis =====
QIIME2_DA=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=’$(pwd)‘/,sampleName=’q2manifest.txt’, metaData=’q2metadata.tsv’,fieldName=”Subject:sex”‘ QIIME2_DA.2.X.Y.sh)
echo -e $(date)‘\tQIIME2_DA\t’$QIIME2_DA’\tSubmit’ >> log/Summary.log

## 4. ##
# *****************************************
# ******* Taxonomy analysis ********
# *** rep-seq-dada2.qza => *.qza/ *.qzv ***
# **** table-dada2.qza => *.qza/ *.qzv ****
# *****************************************

# =====Taxonomy analysis =====
QIIME2_TA=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=’$(pwd)‘/,sampleName=’q2manifest.txt’, metaData=”q2metadata.tsv”‘ QIIME2_TA.2.X.Y.sh)
echo -e $(date)‘\tQIIME2_TA\t’$QIIME2_TA’\tSubmit’ >> log/Summary.log

# =====(!!user self-provided classifier!!) Taxonomy analysis =====
QIIME2_TA=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v ‘projDir=’$(pwd)‘/,sampleName=’q2manifest.txt’, metaData=”q2metadata.tsv”,classifier=userpath/AA-classifier.qza,classifierName=AA’ QIIME2_TA.2.X.Y.sh)
echo -e $(date)‘\tQIIME2_TA\t’$QIIME2_TA’\tSubmit’ >> log/Summary.log

## 5. ##
# ****************************************
# ***** Predicte gene abundances *****
# **feature-table.biom => *.tsv.gz/*.txt**
# ****************************************

# =====Predicte gene abundances =====
PICRUSt2=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v “projDir=’$(pwd)‘/,sampleName=’dna-sequences.fasta’,tableBiom=’feature-table.biom'” PICRUSt2.2.X.Y.sh)
echo -e $(date)‘\tPICRUSt2\t’$PICRUSt2’\tSubmit’ >> log/Summary.log

## 6. ##
# ****************************************
# * annotation gene marker and phenotype *
# ** *_taxonomy.biom => *.tsv.gz/ *.txt **
# ****************************************

# =====annotation gene marker and phenotype =====
FPO=$(qsub -P $projectID -W group_list=$projectID -m e -M $email -v “projDir=’$(pwd)‘/,inFile=’table-with-taxonomy.biom’,output=’Case'” FAPROTAX.2.X.Y.sh)
echo -e $(date)‘\tFAPROTAX\t’$FPO’\tSubmit’ >> log/Summary.log

Resource

Demo data set : QIIM2 Demo data set (ref)

Report template: Microbiome QIIME2 report Template (ref)

基因體研究的全方位解決對策

基因體研究的全方位解決對策

QIIME2-Microbiota_Analysis

Background

Environment requirement

Reference library

Project folder structure

Manifest File

Pair-end manifest file :

Single-end manifest file :

Metadata :

The diagram of workflow

Module list

Workflow script configuration

1. Parameter setup :

2. Data preprocessing :

3. Diversity analysis :

4. Taxonomic analysis :

5. Predicte gene abundances :

6. Annotation gene marker and phenotype :

Demo Workflow script :

Resource

Leave a comment 取消回覆