Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Keep track of the number of cells with at least one expressed gene in samples and pseudo-bulks #1273

Open
3 tasks
arteymix opened this issue Oct 28, 2024 · 4 comments
Assignees
Labels
single cell Issues related to single-cell data support

Comments

@arteymix
Copy link
Member

arteymix commented Oct 28, 2024

This is important to know how many cells contribute to a given pseudo-bulk or sample.

It is not equal to the number of cells per sample since we do not retain all the genes upon import.

  • number of cells with at least one gene expressed
  • number of genes with at least one cell expressing it
  • number of non-zero cell-by-gene values
@arteymix arteymix self-assigned this Oct 28, 2024
@arteymix arteymix added the single cell Issues related to single-cell data support label Oct 28, 2024
@arteymix
Copy link
Member Author

We have an way of computing this for MEX. It might be better to use the pre-computed value once this is implemented.

@arteymix
Copy link
Member Author

arteymix commented Oct 29, 2024

For MEX, were actually computing the NNZ in the matrix, which is different.

We should also consider keeping that number and also the number of genes with at least one cell expressing it in a sample.

@arteymix
Copy link
Member Author

arteymix commented Oct 29, 2024

If we are going to do this, we need to validate that there are no actual zeroes in the sparse data vector.

Also, for log-transformed data, zero is -inf, this might be something we have to deal with.

arteymix added a commit that referenced this issue Oct 29, 2024
…etrics

Add 3 fields in BioAssay to keep track of the number of cells, design
elements and cell-by-design-element pairs with a non-zero expression
value.

See #1273 for details.
@arteymix
Copy link
Member Author

Ok, the code is there, I just need to implement the logic for filling the fields.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
single cell Issues related to single-cell data support
Projects
None yet
Development

No branches or pull requests

1 participant