【4.1.1】qimme2介绍

一、基本介绍

https://docs.qiime2.org

qzv数据的查看

  1. 可以在 https://view.qiime2.org/ 上查看

QIIME2是微生物组分析流程QIIME的全新版,采用Python3全新编写

  1. 更易于安装:QIIME2引入了Miniconda软件包管理器,没有管理员权限也可以轻松安装;同时发布了docker镜像,下载即可运行
  2. 分析流程化:分析流程更加标准化,不让用户盲然下面该做什么;
  3. 可视化增强:QIIME后发制人,超越引用6964次的mothur流程,就是其可视化方面的优势,现可视化结果更加漂亮,且全新采用交互式图形结果,点选可查看细节,更易于分析
  4. 方便合作:项目很少一个组可完成,多人多地结果图表方便共享,适合当下科研合作的需求 可扩展:支持自定义功能并加入分析流程;高手可以自己写包,加入QIIME2的流程中
  5. 分析可重复:全新定义了文件系统,即包括分析数据、也包括分析过程和结果,每一步的结果,均可追溯全部分析过程,方便检查和重复

三、qiime2功能介绍

Usage: qiime [OPTIONS] COMMAND [ARGS]...
Commands:
  info                Display information about current deployment.
  tools               Tools for working with QIIME 2 files.
  dev                 Utilities for developers and advanced users.
  alignment           Plugin for generating and manipulating alignments.
  composition         Plugin for compositional data analysis.
  cutadapt            Plugin for removing adapter sequences, primers, and other unwanted sequence from sequence data.
  dada2               Plugin for sequence quality control with DADA2.
  deblur              Plugin for sequence quality control with Deblur.
  demux               Plugin for demultiplexing & viewing sequence quality.
  diversity           Plugin for exploring community diversity.
  emperor             Plugin for ordination plotting with Emperor.
  feature-classifier  Plugin for taxonomic classification.
  feature-table       Plugin for working with sample by feature tables.
  fragment-insertion  Plugin for extending phylogenies.
  gneiss              Plugin for building compositional models.
  longitudinal        Plugin for paired sample and time series analyses.
  metadata            Plugin for working with Metadata.
  phylogeny           Plugin for generating and manipulating phylogenies.
  quality-control     Plugin for quality control of feature and sequence data.
  quality-filter      Plugin for PHRED-based filtering and trimming.
  sample-classifier   Plugin for machine learning prediction of sample metadata.
  taxa                Plugin for working with feature taxonomy annotations.
  vsearch             Plugin for clustering and dereplicating with vsearch.

3.1 数据导入

Usage: qiime tools import [OPTIONS]
Options:
  --type TEXT             每步分析都会产生qza文件,都会有相应的语义类型,避免用户不合理的分析过程。
  --input-path PATH       输入文件路径,文件格式为sample-id,absolute-filepath,direction
  --output-path ARTIFACT  输出文件:.qza格式的文件名
  --input-format TEXT     输入数据格式:双端测序数据可设置为PairedEndFastqManifestPhred33V2
  --show-importable-types   查看所有的语义类型
  --show-importable-formats 查看所有输入数据格式
  --help                    查看说明

示例

# 导入原始序列
qiime tools import \ 
--type 'SampleData[PairedEndSequencesWithQuality]' \   #成对的带质量序列,要求序列ID与样品编号存在对应关系
--input-path sample_list.txt \ 
--input-format PairedEndFastqManifestPhred33 \ 
--output-path demux-paired-end.qza 
# 导入.biom格式的特征表:转换成qza格式
qiime tools import \ 
--type 'FeatureTable[Frequency]' \     # FeatureTable[Frequency]: 频率,即Feature表(OTU表),为每个样品中对应OTU出现频率的表格
--input-path feature_table.biom \ 
--input-format BIOMV210Format  \ 
--output-path feature_table.qza

3.2 生成OTU表

Usage: qiime dada2 [OPTIONS] COMMAND [ARGS]...
主要作用是去除低质量序列、嵌合体;再生成OTU表,现在叫Feature表,因为不再使用聚类方法,相当于QIIME时代100%相似度的OTU表。
Options:
  --version    Show the version and exit.
  --citations  Show citations and exit.
  --help       Show this message and exit.
Commands:
  denoise-paired  Denoise and dereplicate paired-end sequences
  denoise-pyro    Denoise and dereplicate single-end pyrosequences
  denoise-single  Denoise and dereplicate single-end sequences

示例

qiime dada2 denoise-paired \ 
--i-demultiplexed-seqs demux-paired-end.qza \ 
--o-table feature_table.qza \ 
--o-representative-sequences rep-seqs-dada2.qza \   #输出代表性序列文件
--o-denoising-stats stats-dada2.qza \   #过程统计
--p-trim-left-f 0 \      #表示从forward左端的第0位置开始
--p-trim-left-r 0 \      #表示从reverse左端的第0位置开始
--p-trunc-len-f 250 \    #表示forward总共保留的长度为250
--p-trunc-len-r 250     #表示reverse总共保留的长度为250 

3.3 操作特征表

Usage: qiime feature-table [OPTIONS] COMMAND [ARGS]...
  Description: This is a QIIME 2 plugin supporting operations on sample by
  feature tables, such as filtering, merging, and transforming tables.
Commands:
  core-features       Identify core features in table
  filter-features     Filter features from table
  filter-samples      Filter samples from table
  filter-seqs         Filter features from sequences
  group               Group samples or features by a metadata column
  heatmap             Generate a heatmap representation of a feature table
  merge               Combine multiple tables
  merge-seqs          Combine collections of feature sequences
  merge-taxa          Combine collections of feature taxonomies
  presence-absence    Convert to presence/absence
  rarefy              Rarefy table
  relative-frequency  Convert to relative frequencies
  subsample           Subsample table
  summarize           Summarize table
  tabulate-seqs       View sequence associated with each feature
  transpose           Transpose a feature table.

特征表统计

qiime feature-table summarize \ 
--i-table feature_table.qza \ 
--o-visualization feature_table.qzv
# QIIME2生成的图表结果文件类型,以.qzv为扩展名,末尾的v代表visual

过滤特征表:过滤样本

qiime feature-table filter-samples \ 
--i-table feature_table.qza \ 
--m-metadata-file sample-to-keep.tsv \ 
--o-filtered-table feature_table_filtered.qza

3.4 多样性分析

Usage: qiime diversity [OPTIONS] COMMAND [ARGS]...
  Description: This QIIME 2 plugin supports metrics for calculating and
  exploring community alpha and beta diversity through statistics and
  visualizations in the context of sample metadata.
Commands:
  adonis                     adonis PERMANOVA test for beta group significance
  alpha                      Alpha diversity
  alpha-correlation          Alpha diversity correlation
  alpha-group-significance   Alpha diversity comparisons
  alpha-phylogenetic         Alpha diversity (phylogenetic)
  alpha-rarefaction          Alpha rarefaction curves
  beta                       Beta diversity
  beta-correlation           Beta diversity correlation
  beta-group-significance    Beta diversity group significance
  beta-phylogenetic          Beta diversity (phylogenetic)
  beta-rarefaction           Beta diversity rarefaction
  bioenv                     bioenv
  core-metrics               Core diversity metrics (non-phylogenetic)
  core-metrics-phylogenetic  Core diversity metrics (phylogenetic and non-
                             phylogenetic)
  filter-distance-matrix     Filter samples from a distance matrix.
  mantel                     Apply the Mantel test to two distance matrices
  pcoa                       Principal Coordinate Analysis
  pcoa-biplot                Principal Coordinate Analysis Biplot
  procrustes-analysis        Procrustes Analysis 

核心多样性(无系统发育树)

qiime diversity core-metrics \ 
--i-table feature_table.qza \ 
--p-sampling-depth 138000 \ 
--m-metadata-file sample_metadata.tsv \ 
--output-dir /data/hushy/XJ/data_BFV/qiime2/diversity  #自动创建目录
# alpha多样性observed_otus(OTU数量)指数 observed_otus_vector.qza
# alpha多样性香农熵(考虑物种和丰度)指数 shannon_vector.qza
# alpha多样性evenness(均匀度,考虑物种和丰度)指数 evenness_vector.qza
# beta多样性bray_curtis距离矩阵 bray_curtis_distance_matrix.qza
# beta多样性可视化结果bray_curtis_emperor.qzv,图中Axis1 表示尽可能最大解释数据变化的主坐标成分,Axis2 为解释余下的变化度中占比例最大的主坐标成分,Axis3 等依次类推。
#Richness, Chao1,Shannon三个指数是常用的评估丰富度的指标,数值越高表明样品包含的物种丰富度就越高。
#Jaccard距离(群落差异的定性度量,即只考虑种类,不考虑丰度)
#Bray-Curtis距离(群落差异的定量度量)
#非加权UniFrac距离(包含特征之间的系统发育关系的群落差异定性度量)
#加权UniFrac距离(包含特征之间的系统发育关系的群落差异定量度量)

alpha多样性可视化及组间显著性分析

qiime diversity alpha-group-significance \ 
--i-alpha-diversity shannon_vector.qza \ 
--m-metadata-file sample_metadata.tsv \ 
--o-visualization shannon_vector.qzv
这里是一个广告位,,感兴趣的都可以发邮件聊聊:tiehan@sina.cn
个人公众号,比较懒,很少更新,可以在上面提问题,如果回复不及时,可发邮件给我: tiehan@sina.cn