rAcecaR: Antimicrobial Resistance (AR) Bioinformatics Pipeline¶
🚨 Warning 🚨
The results generated by this pipeline are not CLIA certified and should not be considered diagnostic.
Introduction¶
The ODHL_rAcecaR pipeline is designed for the detection and characterization of antimicrobial resistance (AMR) genes and other relevant genomic features from high-throughput sequencing data. This pipeline processes bacterial whole genome sequencing (WGS) and metagenomic sequencing data to identify antimicrobial resistance determinants, sequence types, and genomic features relevant to public health surveillance and epidemiological investigations.
This workflow is developed and maintained by the Ohio Department of Health Laboratory (ODHL)
and is built using Nextflow
, a robust workflow management system that ensures reproducibility and scalability across HPC clusters, cloud environments, and local machines. The pipeline was built off the work of the CDC Phoenix Pipeline
. The workflow leverages Docker
and Singularity containers
to provide a standardized execution environment.
Pipeline Workflow¶
The ODHL_rAcecaR pipeline consists of the following stages:
1. Read Quality Control (QC)¶
- Tool:
FastQC
- Purpose: Evaluates sequencing read quality for potential issues such as adapter contamination and sequencing errors.
2. Read Trimming and Filtering¶
- Tool:
Fastp
- Purpose: Removes low-quality bases and adapter sequences to improve downstream analysis.
3. Taxonomic Classification¶
- Tool:
Kraken2
-
Purpose: Assigns taxonomic labels to sequencing reads based on a comprehensive reference database.
-
Tool:
Krona
- Purpose: Generates interactive visualizations of taxonomic classification results.
4. Genome Assembly and Quality Assessment¶
- Tool:
SPAdes
-
Purpose: Performs de novo genome assembly from short-read sequencing data.
-
Tool:
QUAST
- Purpose: Assesses the quality and completeness of genome assemblies.
5. Sequence Typing & Resistance Gene Detection¶
- Tool:
MLST
-
Purpose: Identifies sequence types (STs) of bacterial genomes for epidemiological tracking.
-
Tool:
AMRFinderPlus
- Purpose: Detects known antimicrobial resistance (AMR) genes and point mutations associated with resistance.
6. Plasmid & Virulence Gene Identification¶
- Tool:
PlasmidFinder
-
Purpose: Identifies plasmid sequences associated with resistance gene transmission.
-
Tool:
Gamma
- Purpose: Detects horizontally transferred genes associated with virulence and resistance.
7. Phylogenetic Analysis & Similarity Metrics¶
- Tool:
FastANI
-
Purpose: Measures genome-wide similarity between bacterial isolates.
-
Tool:
Mash
- Purpose: Quickly estimates genetic distance between samples for clustering analyses.
Input Requirements¶
The ODHL_rAcecaR pipeline processes paired-end or single-end Illumina sequencing reads in FASTQ format. A sample sheet (CSV/TSV) should be provided with metadata for each sample with other input. View the Getting Started page
and Preparing Files page
for more information.for more information.
Output Summary¶
View the Output page
for more information.
Citations & Acknowledgments¶
This pipeline was developed by the Ohio Department of Health Laboratory (ODHL) to support public health surveillance of antimicrobial resistance (AMR). For contribution and contact information, view the contribution page
.