Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using RNA for MotifSeq #34

Open
epi-gene opened this issue Mar 24, 2020 · 9 comments
Open

Using RNA for MotifSeq #34

epi-gene opened this issue Mar 24, 2020 · 9 comments
Assignees
Labels
help wanted Extra attention is needed question Further information is requested

Comments

@epi-gene
Copy link

Would there be a possibility to simulate RNA squiggles and perform motif search ?

@Psy-Fer
Copy link
Owner

Psy-Fer commented Mar 24, 2020

Hello,

yes, just use the following flag
--scrappie_model squiggle_r94_rna

@Psy-Fer Psy-Fer self-assigned this Mar 25, 2020
@Psy-Fer Psy-Fer added help wanted Extra attention is needed question Further information is requested labels Mar 25, 2020
@epi-gene
Copy link
Author

Hi. I tried the above but the motifseq is yielding no results. The hit probabilities are 0 for all the reads.

@epi-gene
Copy link
Author

Is there a way to calculate them manually ?

@Psy-Fer
Copy link
Owner

Psy-Fer commented Mar 26, 2020

Could you please give me an example of the output?
I'll look into it. I have some RNA data on hand to double check.

@epi-gene
Copy link
Author

fast5 | readID | model | start | end | length | distance_score | model_mean | model_stdev | Z-score | p-value | hit_Probability
8e91fa7d-fd33-422b-8dfb-69915b17aa8d.fast5 | 8e91fa7d-fd33-422b-8dfb-69915b17aa8d | last50 | 11839 | 12071 | 232 | 399.438241083274 | 135.4 | 11.465672 | 23.0285883883015 | 1 | 0

@epi-gene
Copy link
Author

You did mention

The p-values and hit probabilities provided are based on loose modelling of negative background scores for a number of k-mers. It is currently only modelled on R9.4 model, not R10 or RNA
in the readme.

@epi-gene
Copy link
Author

@Psy-Fer Would MotifSeq be able to detect multiple Motif hits within a single read ? Or would it display only the high scoring hit ?

@Psy-Fer
Copy link
Owner

Psy-Fer commented Apr 30, 2020

Currently, only the highest scoring hit.

I plan on allowing it to record more than the the top hit.

A round about way of "hacking" a solution, is to just mask out the region of the best hit, and try again. So, just take the values between the start and stop site, and change them to the mean current, or make them 99999 and they will get filtered out altogether. Obviously that isn't idea, and i'll try fixing this.

@epi-gene
Copy link
Author

Ok. Will try that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants