Skip to content

rAcecaR: Antimicrobial Resistance (AR) Bioinformatics Pipeline

Hits

🚨 Warning 🚨
The results generated by this pipeline are not CLIA certified and should not be considered diagnostic.


Introduction

The ODHL_rAcecaR pipeline is designed for the detection and characterization of antimicrobial resistance (AMR) genes and other relevant genomic features from high-throughput sequencing data. This pipeline processes bacterial whole genome sequencing (WGS) and metagenomic sequencing data to identify antimicrobial resistance determinants, sequence types, and genomic features relevant to public health surveillance and epidemiological investigations.

This workflow is developed and maintained by the Ohio Department of Health Laboratory (ODHL) and is built using Nextflow, a robust workflow management system that ensures reproducibility and scalability across HPC clusters, cloud environments, and local machines. The pipeline was built off the work of the CDC Phoenix Pipeline. The workflow leverages Docker and Singularity containers to provide a standardized execution environment. Nextflow


Pipeline Workflow

The ODHL_rAcecaR pipeline consists of the following stages:

1. Read Quality Control (QC)

  • Tool: FastQC
  • Purpose: Evaluates sequencing read quality for potential issues such as adapter contamination and sequencing errors.

2. Read Trimming and Filtering

  • Tool: Fastp
  • Purpose: Removes low-quality bases and adapter sequences to improve downstream analysis.

3. Taxonomic Classification

  • Tool: Kraken2
  • Purpose: Assigns taxonomic labels to sequencing reads based on a comprehensive reference database.

  • Tool: Krona

  • Purpose: Generates interactive visualizations of taxonomic classification results.

4. Genome Assembly and Quality Assessment

  • Tool: SPAdes
  • Purpose: Performs de novo genome assembly from short-read sequencing data.

  • Tool: QUAST

  • Purpose: Assesses the quality and completeness of genome assemblies.

5. Sequence Typing & Resistance Gene Detection

  • Tool: MLST
  • Purpose: Identifies sequence types (STs) of bacterial genomes for epidemiological tracking.

  • Tool: AMRFinderPlus

  • Purpose: Detects known antimicrobial resistance (AMR) genes and point mutations associated with resistance.

6. Plasmid & Virulence Gene Identification

  • Tool: PlasmidFinder
  • Purpose: Identifies plasmid sequences associated with resistance gene transmission.

  • Tool: Gamma

  • Purpose: Detects horizontally transferred genes associated with virulence and resistance.

7. Phylogenetic Analysis & Similarity Metrics

  • Tool: FastANI
  • Purpose: Measures genome-wide similarity between bacterial isolates.

  • Tool: Mash

  • Purpose: Quickly estimates genetic distance between samples for clustering analyses.

Input Requirements

The ODHL_rAcecaR pipeline processes paired-end or single-end Illumina sequencing reads in FASTQ format. A sample sheet (CSV/TSV) should be provided with metadata for each sample with other input. View the Getting Started page and Preparing Files page for more information.for more information.


Output Summary

View the Output page for more information.


Citations & Acknowledgments

This pipeline was developed by the Ohio Department of Health Laboratory (ODHL) to support public health surveillance of antimicrobial resistance (AMR). For contribution and contact information, view the contribution page.


Last update: 2025-01-29