Rcorrector

Described in:

Song, L., Florea, L., Rcorrector: Efficient and accurate error correction for Illumina RNA-seq reads. GigaScience. 2015, 4:48.

Rcorrrector includes the program Jellyfish2

What is Rcorrector?

Rcorrector(RNA-seq error CORRECTOR) is a kmer-based error correction method for RNA-seq data.

Rcorrector can also be applied to other type of sequencing data where the read coverage is non-uniform, such as single-cell sequencing.

Install

Clone the GitHub repo, e.g. with git clone https://github.com/mourisl/rcorrector.git
Run make in the repo directory During the make procedure, the script will check whether you have jellyfish2 in $PATH. If not, it will download and compile jellyfish2 from its repository.

Usage

Usage: perl run_rcorrector.pl [OPTIONS]
OPTIONS:
	Required
	-s seq_files: comma separated files for single-end data sets
	-1 seq_files_left: comma separated files for the first mate in the paried-end data sets
	-2 seq_files_right: comma separated files for the second mate in the paired-end data sets
	-i seq_files_interleaved: comma sperated files for interleaved paired-end data sets
	Optional
	-k INT: kmer_length (<=32, default: 23)
	-od STRING: output_file_directory (default: ./)
	-t INT: number of threads to use (default: 1)
	-trim : allow trimming (default: false)
	-maxcorK INT: the maximum number of correction within k-bp window (default: 4)
	-wk FLOAT: the proportion of kmers that are used to estimate weak kmer count threshold, lower for more divergent genome (default: 0.95)
	-ek INT: expected number of kmers; does not affect the correctness of program but affects the memory usage (default: 100000000)
	-stdout: output the corrected reads to stdout (default: not used)
	-verbose: output some correction information to stdout (default: not used)
	-stage INT: start from which stage (default: 0)
		0-start from begining(storing kmers in bloom filter) ;
		1-start from count kmers showed up in bloom filter;
		2-start from dumping kmer counts into a jf_dump file;
		3-start from error correction.

Output

For each input file, Rcorrector will generate the corresponding output file with "*.cor.fq/fa" in the directory specified by "-od".

In the header line for each read, Rcorrector will append some information.

"cor": some bases of the sequence are corrected
"unfixable_error": the errors could not be corrected
"l:INT m:INT h:INT": the lowest, median and highest kmer count of the kmers from the read

Example

We put a small sample data set, you can run them by:

perl run_rcorrector.pl -1 Sample/sample_read1.fq -2 Sample/sample_read2.fq

Contact

[email protected]

Terms of use

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received (LICENSE.txt) a copy of the GNU General Public License along with this program; if not, you can obtain one from http://www.gnu.org/licenses/gpl.txt or by writing to the Free Software Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA

Support

Create a GitHub issue.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
Sample		Sample
ErrorCorrection.cpp		ErrorCorrection.cpp
ErrorCorrection.h		ErrorCorrection.h
File.h		File.h
KmerCode.cpp		KmerCode.cpp
KmerCode.h		KmerCode.h
KmerInfo.h		KmerInfo.h
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
ReadStore.h		ReadStore.h
Reads.h		Reads.h
Store.h		Store.h
main.cpp		main.cpp
run_rcorrector.pl		run_rcorrector.pl
utils.h		utils.h
verify.cpp		verify.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rcorrector

What is Rcorrector?

Install

Usage

Output

Example

Contact

Terms of use

Support

About

Releases

Packages

Languages

License

XuanrZhang/Rcorrector

Folders and files

Latest commit

History

Repository files navigation

Rcorrector

What is Rcorrector?

Install

Usage

Output

Example

Contact

Terms of use

Support

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages