DOC_ID : T11-0001
Doc_ID: A08-0001NCBIBioProj33317scikit0.24.1
Editor: Anita
Reviewer:
Description
參考QIIME2 forum上的文章,使用RESCRIPt下載來自NCBI Genbank的序列和分類,並訓練適用於QIIME2分析的分類器。
Source
Download URL : –
File size :
- ncbi-refseqs-unfiltered.qza 198KB
- ncbi-refseqs-taxonomy-unfiltered.qza 19.1KB
Genome assemble version : BioProj33317
Detail information :
使用RESCRIPt下載來自NCBI Genbank的序列和分類,並訓練適用於QIIME2分析的classifiers,請確認已完成qiime2 standard analysis environment的安裝
#Activate standard analysis environment
conda activate qiime2
#移動到/work/使用者帳號資料夾
cd /work/u5777333/
#創建放置參考序列及相應的分類法文件的資料夾
mkdir -p NCBIclassifier/BioProject_33317
#移動至資料夾
cd NCBIclassifier/BioProject_33317
#安裝RESCRIPt
conda install -c conda-forge -c bioconda -c qiime2 -c defaults xmltodict
pip install git+https://github.com/bokulich-lab/RESCRIPt.git
# 使用RESCRIPt下載來自NCBI Genbank的序列和分類資料
qiime rescript get-ncbi-data \
–p-query ‘33317[BioProject]’ \
–o-sequences ncbi-refseqs-unfiltered.qza \
–o-taxonomy ncbi-refseqs-taxonomy-unfiltered.qza
#Filter unusually short 16S rRNA gene sequences
qiime rescript filter-seqs-length-by-taxon \
–i-sequences ncbi-refseqs-unfiltered.qza \
–i-taxonomy ncbi-refseqs-taxonomy-unfiltered.qza \
–p-labels Archaea Bacteria \
–p-min-lens 900 1200 \
–o-filtered-seqs ncbi-refseqs.qza \
–o-discarded-seqs ncbi-refseqs-tooshort.qza
#using the –m-ids-to-keep-file parameter to only retain features found in our filtered sequences file
qiime rescript filter-taxa \
–i-taxonomy ncbi-refseqs-taxonomy-unfiltered.qza \
–m-ids-to-keep-file ncbi-refseqs.qza \
–o-filtered-taxonomy ncbi-refseqs-taxonomy.qza
#利用RESCRIPt來訓練classifiers
qiime rescript evaluate-fit-classifier \
–i-sequences ncbi-refseqs.qza \
–i-taxonomy ncbi-refseqs-taxonomy.qza \
–o-classifier ncbi-refseqs-classifier.qza \
–o-evaluation ncbi-refseqs-classifier-evaluation.qzv \
–o-observed-taxonomy ncbi-refseqs-predicted-taxonomy.qza
Bundle files
File list |
ncbi-refseqs-classifier-evaluation.qzv ncbi-refseqs.qza ncbi-refseqs-tooshort.qza ncbi-refseqs-classifier.qza ncbi-refseqs-taxonomy.qza ncbi-refseqs-unfiltered.qza ncbi-refseqs-predicted-taxonomy.qza ncbi-refseqs-taxonomy-unfiltered.qza |