You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
Thank you very much for this wonderful tool and all the great documentation and resources for the community.
For scRNA-seq preprocessing, there is an option to regress out confounding variables in the ScaleDatavars.to.regress parameter. I understand that RunTFIDF should account for confounding variables like sequencing depth.
Looking at a scATAC dataset analyzed in this vignette, I would expect to see greater euchromatin (more open chromatin) in pluripotent stem cells (HSC) compared to background cells.
I ran two different types of DAR analysis: 1. typical wilcoxon test FindMarkers(human.atac, ident.1 = 'HSC') and 2. LR while regressing out nCount_ATACFindMarkers(human.atac, ident.1 = 'HSC', test.use = 'LR',latent.vars = 'nCount_ATAC' )
Indeed, the lattern identifies more genomic features with increased accessibility in HSC suggesting sequencing-depth related confounding effects are still present in the normalized data. I have below two histograms which show the distribution of log2FC between HSC vs background cells.
histogram of avg log2FC using wilcoxon histogram of avg log2FC using LR ~ nCount_ATAC
I need normalized counts that have removed confounding effects like from nCount_ATAC. I'm wondering if it is valid to use the typical NormalizeDataScaleData preprocessing workflow on scATAC dataset to leverage vars.to.regress option? Or would the team have any alternatives?
This discussion was converted from issue #1776 on August 29, 2024 05:13.
Heading
Bold
Italic
Quote
Code
Link
Numbered list
Unordered list
Task list
Attach files
Mention
Reference
Menu
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hello,
Thank you very much for this wonderful tool and all the great documentation and resources for the community.
For scRNA-seq preprocessing, there is an option to regress out confounding variables in the
ScaleData
vars.to.regress
parameter. I understand thatRunTFIDF
should account for confounding variables like sequencing depth.Looking at a scATAC dataset analyzed in this vignette, I would expect to see greater euchromatin (more open chromatin) in pluripotent stem cells (HSC) compared to background cells.
I ran two different types of DAR analysis: 1. typical wilcoxon test
FindMarkers(human.atac, ident.1 = 'HSC')
and 2. LR while regressing outnCount_ATAC
FindMarkers(human.atac, ident.1 = 'HSC', test.use = 'LR',latent.vars = 'nCount_ATAC' )
Indeed, the lattern identifies more genomic features with increased accessibility in HSC suggesting sequencing-depth related confounding effects are still present in the normalized data. I have below two histograms which show the distribution of log2FC between HSC vs background cells.
histogram of avg log2FC using wilcoxon
histogram of avg log2FC using LR ~ nCount_ATAC
I need normalized counts that have removed confounding effects like from
nCount_ATAC
. I'm wondering if it is valid to use the typicalNormalizeData
ScaleData
preprocessing workflow on scATAC dataset to leveragevars.to.regress
option? Or would the team have any alternatives?Thank you so much for all your support!
Beta Was this translation helpful? Give feedback.
All reactions