Linked-read data processing pipeline. Getting you from raw linked reads to assemblies, genotypes, or phased haplotypes. Batteries included 🔋
✨ Now works with TELLseq, stLFR, and non-linked read data! ✨
Detailed installation instructions are described in the documentation.
conda create -n harpy -c bioconda -c conda-forge harpy
conda activate harpypixi global install -c conda-forge -c bioconda harpy
# or locally #
pixi init -c conda-forge -c bioconda projectname && cd projectname && pixi add harpydocker pull quay.io/biocontainers/harpyFind the most recent tag here and replace $TAG with it or use the TAG=$(curl ...) part below to pull the latest version name using the repository's API.
TAG=$(curl -s "https://quay.io/api/v1/repository/biocontainers/harpy/tag/" | cut -d'"' -f6)
apptainer pull docker://quay.io/biocontainers/harpy:$TAGJust call harpy or harpy --help on the command line to get started! If installed via container, then call the containerized-Harpy however you are used to using containers on your system.
harpy module options... args...No data? No problem! Harpy lets you simulate genomic variants from an existing genome and can also create linked-read data from an existing genome! You can see what haplotagging (or other linked read) data and Harpy are like without paying a cent! A simple tutorial on simulating both of these can be found here.
Dimens PV, Franckowiak RP, Iqbal A, Grenier JK, Munn PR, Therkildsen NO. Harpy: a pipeline for processing haplotagging linked-read data. Bioinform Adv. 2025 Jun 5;5(1):vbaf133. doi: 10.1093/bioadv/vbaf133. PMID: 40575478; PMCID: PMC12198493.
