Maptcha

Maptcha addresses the hybrid genome scaffolding problem, which involves combining contigs and long reads to create a more complete and accurate genome assembly. Hybrid scaffolding aims to leverage the high accuracy of contigs with the long-range information provided by long reads.

Maptcha's inputs are a FASTA file of contigs and a FASTA file of long reads, and the output is a FASTA file of scaffolds generated from them.

We have three major phases:

Contig Expansion: Initially, the algorithm extends contigs using long reads that align with the ends of these contigs. This phase also involves detecting and connecting successive pairs of contigs using direct long read links, resulting in the generation of partial scaffolds.
Long Read Island Construction: Not all long reads contribute to the initial partial scaffolds, especially those residing in the gap regions between successive scaffolds in the target genome. In this phase, the algorithm identifies long reads that do not map to any first-generation partial scaffolds. These reads are utilized to construct partial scaffolds corresponding to the "island" regions of long reads, forming the second generation of partial scaffolds.
Link Scaffolds with Bridges: In the final phase, the algorithm aims to bridge the first and second generation scaffolds using long reads that serve as bridges between them. This crucial step produces the final set of scaffolds, providing a comprehensive assembly of the genome.

Citation

If you use Maptcha in your research, please cite:

Bhowmik, O., Rahman, T. & Kalyanaraman, A. Maptcha: an efficient parallel workflow for hybrid genome scaffolding. BMC Bioinformatics 25, 263 (2024). https://doi.org/10.1186/s12859-024-05878-4

Installation Instructions

Requirements: Maptcha has the following dependencies:

C++14 (or greater) compliant compiler
MPI library (MPI-3 compatible)
Python 3 (or greater) compliant compiler

Step-by-Step Guide

Clone the Maptcha Repository:

git clone https://github.com/Oieswarya/Maptcha.git
cd Maptcha

Install necessary dependencies:
```
make install-dependencies
```
Compile the source files and setup directories:
```
make all
```
Check if Maptcha is properly installed:
```
./maptcha.sh -h
```

Usage

Run the maptcha.sh script from the root directory:

./maptcha.sh -c path/to/contigs.fa -lr path/to/longreads.fa [options]

-c, --contigs      Path to the contigs input file
-lr,--longreads    Path to the long reads input file
Options:
-o, --output       Output directory (default: $HOME/Maptcha/Output/)
-t, --threads      Number of threads to use (default: 16)
-n, --nodes        Number of nodes to use (default: 2)
-p, --processes    Number of processes per node (default: 2)
-h, --help         Show this help message

Note: This code has been tested on high-performance cluster (HPC) systems with MPI and OpenMP compatibility and has been tested for both PBS and SLURM job scheduling systems.

For a quick test, you can use the provided test input. Navigate within the Maptcha repository and run the `maptcha.sh` script.

~/Maptcha/maptcha.sh -c ~/Maptcha/TestInput/minia_Coxiellaburnetii_contigs.fa -lr ~/Maptcha/TestInput/CoxiellaBurnetii_longreads.fa

The final scaffolds will be located here: ~/Maptcha/Output/Final/finalAssembly.fa, within the Output folder of the Maptcha directory.

Tips:

On some clusters, you may need to load specific modules before installing dependencies and and then also while running Maptcha.
Ensure that you have the appropriate permissions to execute the job script.

Maptcha utilizes the following tools:

JEM-Mapper: JEM-Mapper GitHub Repository
Hifiasm: Hifiasm GitHub Repository

For more detailed usage and configuration options, please refer to the documentation within each tool's repository:

If you encounter any issues, please feel free to open an issue on the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 211 Commits
Hifiasm		Hifiasm
JEM-Mapper		JEM-Mapper
TestInput		TestInput
src		src
Makefile		Makefile
README.license.GPL.txt		README.license.GPL.txt
README.md		README.md
maptcha.sh		maptcha.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Maptcha

Citation

Installation Instructions

Step-by-Step Guide

Usage

For a quick test, you can use the provided test input. Navigate within the Maptcha repository and run the `maptcha.sh` script.

About

Releases

Packages

Languages

Oieswarya/Maptcha

Folders and files

Latest commit

History

Repository files navigation

Maptcha

Citation

Installation Instructions

Step-by-Step Guide

Usage

For a quick test, you can use the provided test input. Navigate within the Maptcha repository and run the maptcha.sh script.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

For a quick test, you can use the provided test input. Navigate within the Maptcha repository and run the `maptcha.sh` script.

Packages