You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Asking for ability to plot numerical data from an arbitrary column of coordinate sorted, bgzip compressed and tabix indexed data file.
This kind of data is quite common output from various genome wide analysis programs, e.g. differential methylation analysis on individual CpG sites, which have data only on 1/16:th of genome with irregular intervals and has multiple putatively interesting numbers, such as p-value, difference between the two conditions, methylation level on cases and methylation level on controls. This data format is often post-processed in further analysis, so keeping all data in single rows is preferable.
From implementation perspective documentation and alias for gwas format would work for most situations. The syntax for gwas format definition is revealed here. Improved features would be (1) optionally distinct start and end coordinates (currently it's only a single pos) (2) ability to name the columns for the popup display and (3) getting the header defined by the tabix index (as given by tabix -H command/option)
@kpalin I don't understand what is to be done with the file header, if any ("-H"). What information in there would be useful? Are you thinking of a track line? Feel free to zip and provide an example.
Allowing for a track line would be a good idea also but I was thinking about the column names treated the same as names array above. Here's sample_data.zip.
Asking for ability to plot numerical data from an arbitrary column of coordinate sorted, bgzip compressed and tabix indexed data file.
This kind of data is quite common output from various genome wide analysis programs, e.g. differential methylation analysis on individual CpG sites, which have data only on 1/16:th of genome with irregular intervals and has multiple putatively interesting numbers, such as p-value, difference between the two conditions, methylation level on cases and methylation level on controls. This data format is often post-processed in further analysis, so keeping all data in single rows is preferable.
From implementation perspective documentation and alias for
gwas
format would work for most situations. The syntax forgwas
format definition is revealed here. Improved features would be (1) optionally distinct start and end coordinates (currently it's only a single pos) (2) ability to name the columns for the popup display and (3) getting the header defined by the tabix index (as given by tabix -H command/option)Suggestion for track definition syntax would be
Previously discussed in the discussion about pull request #1540
The text was updated successfully, but these errors were encountered: