-
Notifications
You must be signed in to change notification settings - Fork 22
Converting TOGA results into different formats
Bogdan Kirilenko edited this page Jul 13, 2023
·
4 revisions
Parent page - Interpreting results
(https://github.com/hillerlab/TOGA/issues/56) TODO: check whether the issue is documented
TOGA saves the resulting annotation into a bed-12 formatted file.
bedToGenePred query_annotation.bed stdout | genePredToGtf stdin query_annotation.gtf
genePredToGtf has some options that could be useful. genePredToGtf - Convert genePred table or file to gtf.
usage:
genePredToGtf database genePredTable output.gtf
If database is 'file' then track is interpreted as a file
rather than a table in database.
options:
-utr - Add 5UTR and 3UTR features
-honorCdsStat - use cdsStartStat/cdsEndStat when defining start/end
codon records
-source=src set source name to use
-addComments - Add comments before each set of transcript records.
allows for easier visual inspection
Note: use a refFlat table or extended genePred table or file to include
the gene_name attribute in the output. This will not work with a refFlat
table dump file. If you are using a genePred file that starts with a numeric
bin column, drop it using the UNIX cut command:
cut -f 2- in.gp | genePredToGtf file stdin out.gp