- The
legacy
argument in ID translation functions (UUIDtoBarcode
,UUIDtoUUID
,barcodeToUUID
, andfilenameToBarcode
) has been defunct and removed.
UUIDtoBarcode
ensures that results are ordered based on the input UUIDs.- Include informative error message regarding translation of UUIDs from legacy files.
UUIDtoBarcode
returns barcodes consistent with Genomic Data Commons API update
makeSummarizedExperimentFromGISTIC
andsplitAssays
are now defunct.
makeGRangesListFromExonFiles
example removed from the vignette, the GDClegacy
endpoint has been deprecated. For more information see the GDC API release notes version v3.28.0.
- Use https instead of http in
getFileName
helper. - Warn when column names in assays are not mappable and subsequently dropped in
generateMap
. - Updated
qreduceTCGA
documentation for clarity.
- The
UUIDhistory
function allows users to map old UUIDs to new UUIDs according to the latest data release for UUIDs that were affected and no longer query-able. - The
slides
argument has been added to thefilenameToBarcode
function for translating slide file names into barcodes. Currently, the API returns all barcodes of the associated case ID. - Add sections in the vignette regarding GDC Data Updates and UUID history lookup
- Update examples in package to new GDC Data Release, see vignette.
- Use
AnnotationHub
to download chain file in main vignette. - Slide file names now resolve to a single TCGA barcode in
filenameToBarcode
(Thanks @hermidalc) - Improved error messages and documentation for
makeGRangesListFromExonFiles
UUIDtoBarcode
with thefrom_type = "file_id"
argument now returns the IDs in the proper order when more than oneUUID
is input.- Update
makeGRangesListFromCopyNumber
examples with new names from API e.g., 'associated_entities.entity_submitter_id'
makeSummarizedExperimentFromGISTIC
has been moved toRTCGAToolbox
.splitAssays
now deprecated forTCGAsplitAssays
to avoid conflict withMultiAssayExperiment::splitAssays
- Properly identifies genome annotation (
hg*
) inoncoPrintTCGA
qreduceTCGA
now works with updates toseqlevelsStyle
where genome annotation include patch versions when available
correctBuild
attempts to provide the official name of a particular human genome build to agree with changes inGenomeInfoDb
isCorrect
checks that the build name matches the official name
- Documentation improvements to
simplifyTCGA
- Improvements to
findGRangesCols
to locate ranged columns in aDataFrame
- Fixed a bug in
UUIDtoBarcode
where only the first record was returned (#26, @DarioS) - Fixed a bug in
filenameToBarcode
when multiple inputs were used (#22, @DarioS)
README.md
now includes a cheat sheet for referencemergeColData
andoncoPrintTCGA
sections updated/included in the vignette
translateBuild
more robust to consistent inputstranslateBuild
returns vector output instead of single string as beforemakeSummarizedExperimentFromGISTIC
now has a more open interface with...
input toRTCGAToolbox::getGISTICPeaks
oncoPrintTCGA
now usesseqlevels
from input throughout
oncoPrintTCGA
: Create anoncoPrint
visualization for mutation data- Support
aliquot_ids
as input toUUIDtoBarcode
function - Additional sections in the vignette:
CpGtoRanges
,UUIDtoBarcode
foraliquot_ids
TCGAprimaryTumors
allows users to select all primary tumors for a givencuratedTCGAData
MultiAssayExperiment
object (suggested by @vjcitn)
- Now merging clinical data using both rows and columns in
mergeColData
- Added informative error when query results are empty in
UUIDtoBarcode
- Updates to
makeGRangesListFromExonFiles
to useS4Vectors::splitAsList
(@hpages)
trimColData
added to remove any extra columns from thecolData
slot (thanks to @vjcitn)CpGtoRanges
translates CpG islands to genomic positions using an annotation package andminfi
- Overhaul of the barcode translation services allows accurate translation of identifiers
splitAssays
now separates all assays by sample codes contained therein by default, previous behavior had default values- Documentation for
simplifyTCGA
was modified to include similar operations, such as,symbolsToRanges
,mirToRanges
,CpGtoRanges
, etc. - Vignette includes comprehensive examples of new functionality
getFileNames
renamed togetFileName
TCGAsampleSelect
now allows multiple sample type inputs as thesampleCodes
argumentgetSubtypeMap
updates column names to accurately represent patient identifiers- More robust checks were added to
splitAssays
to ensure valid sample codes in the input and provided as arguments makeGRangesListFromExonFiles
is optimized to usedplyr
when available and fast operations fromIRanges
- Various enhancements to
*toRanges
functions, including re-using underlying common helper function - The internal
weightedmean
function inqreduceTCGA
has been updated for correctness - The
keep
arugment inqreduceTCGA
and related functions was changed tokeep.assay
imputeAssay
added to impute data for MultiAssayExperiment assaysUUIDtoUUID
translation available to translate from file to case IDs- A suite of functions is available to enhance existing MultiAssayExperiment
datasets:
qreduceTCGA
,mirToRanges
,symbolsToRanges
. Thanks to @lwaldron
- Various changes to examples for compatibility with RaggedExperiment
- Bug fix to internal functions for finding GRanges columns
uniformBuilds
cleans up a vector of inconsistently labelled builds by changing the build with the lowest frequency
- The
UUIDtoUUID
function can translate from case to file UUIDs and vice versa imputeAssay
allows missing data imputation using KNN forMultiAssayExperiment
assays
- exported the internal helper,
filenameToBarcode
. See examples
- Minor changes in response to review, avoid switching from logical to numeric index, updated vignette introduction
- Fix examples to updated
GenomicDataCommons
interface - Move
RTCGAToolbox
toSuggests
field in DESCRIPTION - Removed
BiocFileCache
fromImports
field
- Added a group of ID translation helper functions (see ?ID-translation)
- Added a group of helper functions that work with
curatedTCGAData
UUIDtoBarcode
function added thanks to @seandavi- Renamed
makeGRangesListFromTCGA
tomakeGRangesListFromCopyNumber
makeSummarizedExperimentFromGISTIC
is now available to convertRTCGAToolbox
'sFirehoseGISTIC
data class toSummarizedExperiment
- Created a function to merge external
colData
to aMultiAssayExperiment
colData
slot - Revamped vignette documentation
- Improvements to
TCGAbiospec
andTCGAbarcode
- Updated
sampleTypes
andclinicalNames
datasets - Updated DESCRIPTION file with appropriate imports and exports
- Various improvements to
findGRangesCols
generateMap
is now updated to the recentMultiAssayExperiment
API with improved example- Updated
getFileNames
to most recentRTCGAToolbox
API - Various updates to data generating scripts in
data-raw
folder - Format updates to NEWS file
- Added tests
- Package renamed to
TCGAutils
for working with TCGA data
TCGAtranslateID
now works with GDC API
- Code cleaned up
- Added proper import directives
makeGRangesListFromDataFrame
now moved toGenomicRanges
makeSummarizedExperimentFromDataFrame
now moved toSummarizedExperiment
getFileNames
function will obtain filenames used inRTCGAToolbox
- Improved
getFileNames
withxml2
andrvest
dependencies and removes theXML
dependency
TCGAextract
now uses thefindGRangesCols
to automatically detect ranged data columns- Arguments in functions
TCGA*
now renamed to matchMultiAssayExperiment
conventions - Informative errors in
TCGAextract
makeGRangesListFromTCGA
data builds onmakeGRangesListFromDataFrame
makeGRangesListFromDataFrame
andmakeRangedSummarizedExperimentFromDataFrame
will be moving to standard Bioconductor packages soon.tcga
andccle
functions soon to be deprecated.- Upcoming:
TCGAbarcode
will be modified for efficiency
- Add your own identifier parsing function for generating a
sampleMap
ingenerateMap
! - Add proper genome build to ranged based objects.
- Return
SummarizedExperiment
class for certain data types. - Fix genome build bugs
makeRSE
function for creating aRangedSummarizedExperiment
object from a data frame.- Bug fixes to
getRangeNames
including the option to enter a regular expression vector for finding ranged column names. matchClinical
renamed toTCGAmatchClinical
getRangedNames
function will try to extract minimum necessary names for creating ranges (works on a vector of names)- minor bug fixes to
TCGAbiospec
,TCGAextract
,makeGRangesList
- Package renamed to
BiocInterfaces
! TCGA
specific functions now start with the lettersTCGA
- Included: more examples of use of the
TCGAbarcode
function - Updated
makeGRangesList
function to work withtcga
andccle
data parameter functions
- Added a
NEWS.md
file to track changes to the package. - TCGAmisc now a standalone package! (previously in
RTCGAToolbox
) - Provides helper functions for converting raw data into S4 objects (e.g.,
GRangesList
) - Provides functions for creating a MultiAssayExperiment object such as:
generateTCGAmap
cleanExpList