AssignTaxonomy

AssignTaxonomy is an all Julia implementation of the RDP Naive Bayesian Classifier algorithm for assigning taxonomic classifications based on DNA sequences. Most users will only need to use the assign_taxonomy function on a pair of fasta files (one with target sequences, one with a reference database). However, additional functions are provided for reading in reference and target fasta files, for those who prefer to work with Julia data structures (e.g. vectors of DNA sequences, arrays of taxonomic classifications).

Results can be easily converted to DataFrame or saved to CSV

using AssignTaxonomy, CSV, DataFrames

my_results = assign_taxonomy(targets,refs)
df = DataFrame(my_results)
CSV.write("my_results.csv",my_results)

You can also store and reuse log_probabilities from the classifier. Basically, training the model of your reference data once and then re using it on new target data.

using AssignTaxonomy

my_results,my_lp = assign_taxonomy(targets,refs,keep_lp = true)
my_new_results = get_targets(some_other_target_fasta,refs,lp = my_lp)
all_my_results = AssignTaxonomy.classification_result(vcat(values(my_results),values(my_results)))

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/workflows		.github/workflows
build		build
docs		docs
src		src
test		test
.appveyor.yml		.appveyor.yml
.gitignore		.gitignore
5sp_16S.fasta		5sp_16S.fasta
LICENSE		LICENSE
Project.toml		Project.toml
README.md		README.md
my-out.fasta		my-out.fasta
rdp_train_set_16.fa		rdp_train_set_16.fa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AssignTaxonomy

About

Releases 1

Packages

Languages

License

EvoArt/AssignTaxonomy.jl

Folders and files

Latest commit

History

Repository files navigation

AssignTaxonomy

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages