Fixing minor CRAN check issues.
- Experimenting with using
''
instead ofNA
for missing values at the suggestion of Davis Vaughan (issue #212). - Fix failing CRAN tests due to
vctrs
update.
Minor update
- Fixed
taxonomy
printing error - Removed
default.stringsAsFactors
since it is depreciated.
LARGE CHANGES:
The beginning of a complete rewrite of the taxa
package to make the more basic component classes more like base R vectors.
The taxmap
class is not yet reimplemented, but will be similar to the class in the previous versions of taxa.
The old version of taxa
has been incorporated into the metacoder
package until this version of taxa is mature, at which time metacoder
will also use this version.
- Fixed bug in
n_obs
that would cause an error when used on an object with tables with columns named by numbers. - Various minor bug fixes
- Numeric column names in tables in
taxmap
are now supported - Various small bug fixes
- Parsers now correclty handle zero-length inputs (issue #185).
taxonomy_table
optionadd_id_col
now works (issue #191).
- The
parse_tax_data
optionclass_col
now accepts negative column indexes, meaning "all other columns".
- Added the
taxonomy_table
function that converts the information in ataxmap
ortaxonomy
object into a table with taxa as rows and ranks as columns. - Added the
print_tree
function that prints text-based trees oftaxmap
ortaxonomy
objects (issue #173). - Added
get_dataset
function to get a single data set fromtaxmap
objects. Useful for piping with%>%
. filter_taxa
andfilter_obs
can now subset anything that has names, length, and can be subset, not just tables, lists, and vectors. For example,DNAbin
objects from theape
package can now be used intaxmap
objects (issue #178).
- Parsers are somewhat faster and use less RAM (issue #177).
taxmap
andtaxonomy
parsers now treat taxa with the same name and same place in the taxonomy, but different ranks, database IDs, or authorities, as different taxa.filter_obs
can now filter multiple datasets at once if they are the same length (issue #179).select_obs
andarrange_obs
can now work on multiple datasets at once.
- Made the
"taxon_rank"
value for theclass_key
options work withextract_tax_data
. - Fixed bug in
taxmap
print method when printing tables with only a taxon ID column (issue #181).
- Option
target
in many functions renamed todata
to make it more intuitive.
parse_tax_data
can now incorporate rank information which can be accessed byresult$taxon_ranks()
(issue #113).taxmap
print methods now have more information and color (issue #124).- Added
leaves_apply
function that works likesubtaxa_apply
, but on leaves (issue #126). - Functions with a
value
option now return named taxon indexes by default, instead of unnamed taxon indexes (issue #128). lookup_tax_data
andextract_tax_data
can now use "fuzzy" matching when looking up taxon names, so taxon names can be misspelled and still be founds.lookup_tax_data
andextract_tax_data
now only look up unique sequence IDs, improving download speed.filter_obs
now can filter out observations in non-target data sets that are associated with taxa that are removed whendrop_taxa = TRUE
(issue #143). This is done usingfilter_taxa
, so thesupertaxa
,subtaxa
, andreassign_obs
options are now available tofilter_obs
to control how taxon removal is done.lookup_tax_data
andextract_tax_data
now have progress bars instead of printing lots of text when downloading information.mutate_obs
now creates new vector/tables if the data set specified does not exist (issue #121).- Add
filter_taxa
optionkeep_order
that preserves input taxon order. It isTRUE
by default, which changes how it used to work. Set toFALSE
for old behavior. - Using NSE with an ambiguous name (appears in multiple datasets) now produces a warning (issue #153).
- The
simplify
option in many functions is now always handled the same way: If all vectors in a list are names, then unique key-value pairs are returned. Otherwise, names are ignored and unique values are returned. - The
leaves
option now behaves likesubtaxa
, returning all leaves for each taxon. The old behavior can be replicated by setting the newsimplify
option toTRUE
(issue #127).
filter_taxa
now has better error messages for invalid inputs (issue #117).- Fix a bug that caused an error in
filter_taxa
when no taxa pass filter (issue #116). - Fixed a bug in
parse_tax_data
whenclass_key
was not named (issue #131). - Fixed bug in
hierarchy
print method withtaxon_id
class was not used (issue #138). - Fixed bug in
parse_tax_data
when all classification data isNA.
- Fixed bug in
taxmap
print method when printing zero-length lists and vectors (issue #148).
- Fixed a few problems with using duplicated inputs to
subset
(issue #88, issue #89) - Fixed a bug that caused an error when using unnamed vectors (issue #86)
- Fixed a bug that prevents using sequence accession numbers (issue #85)
- Fixed bug in
lookup_tax_data
andextract_tax_data
that caused an error when one of the queries failed too download. - Fixed bug that caused "data" argument of
obs_apply
to not work when passed as a variable (issue #97)
- Added
map_data_
for mapping without using NSE. - Make default dataset for
n_obs
andn_obs_1
and make them available for NSE (issue #91 parse_tax_data
/extract_tax_data
can now parse things likephylum;Nitrosopumilales;order;Nitrosopumilaceae;family;
and split out the rank and taxon names by using multiple matches to theclass_regex
whenclass_sep
is NULL.extract_tax_data
now gives warnings if a regex does not match.- Added
n_supertaxa_1
function to get number of immediate supertaxa (always 1 or 0). - Added
branches
function to go withroots
,leaves
, andstems
. (issue #56) - Added
internodes
andis_internode
functions to go withroots
,leaves
,branches
, andstems
. Useful for removing uninformative taxonomic ranks/taxa. - Started to incorporate ability for
taxon
,taxon_name
,taxon_id
,taxon_rank
, andtaxa
to handleNULL
inputs as first class citizens to handle cases when you have essentially a blank taxon (use case comes fromtaxize
package) #95 #107 - data parsers: Put long, often unused columns last (issue #93)
- When parsing classifications that have per-taxon info add input id column (issue #92)
- New function
classification
as an abstraction to get either hierarchy of taxon indexes, names, or ids (issue #57) - New function
get_data_frame
for bothTaxonomy
andTaxmap
objects that wraps aroundget_data
to coerce into adata.frame
. (issue #58) (PR #105)
- In the output of the taxmap parsing functions like
parse_tax_data
, I moved "taxon_id" and "input_index" columns to front and "input" to rear. Also "tax_data" now comes before "class_data".
- Released to CRAN.