Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fragmented genomes #42

Open
giovannaVeiga opened this issue Nov 8, 2023 · 3 comments
Open

Fragmented genomes #42

giovannaVeiga opened this issue Nov 8, 2023 · 3 comments

Comments

@giovannaVeiga
Copy link

Hi everyone,

Do you think I can run make lastz chains in fragmented genomes? One has 757 scaffolds and the other has 60,750. I already used RAGTAG to improve the assembly and they are already masked. I am using a 32 CPU server to run the analysis and I am planning to run TOGA too.

Thanks in advance

@MichaelHiller
Copy link
Collaborator

Yes, TOGA has the ability to join orthologous gene fragments that are split across scaffolds.
The flag to enable this is now on by default.

It is always a bit hard to predict how well this works, but we showed in Fig 4 that this can be quite effective.
image

@giovannaVeiga
Copy link
Author

giovannaVeiga commented Nov 16, 2023 via email

@MichaelHiller
Copy link
Collaborator

Right, information on U12 and comprehensive isoform knowledge is more difficult to get for other species. Of course, having them, would help.
I wouldn't worry too much about U12 introns, as there are only 500-700 U12 introns in total and some are also GT .. AG.

But if you include several isoforms per gene, then list which transcripts belong to which gene in the isoform file. Otherwise, TOGA will treat each isoform (transcript) as a gene and the orthology types would be wrong.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants