Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ProteomicsLFQ memory consumed #432

Open
daichengxin opened this issue Oct 18, 2024 · 12 comments
Open

ProteomicsLFQ memory consumed #432

daichengxin opened this issue Oct 18, 2024 · 12 comments
Assignees
Labels
bug Something isn't working

Comments

@daichengxin
Copy link
Collaborator

daichengxin commented Oct 18, 2024

Description of the bug

The errors are reported when i ran PXD001819 LFQ datasets (about 10G mzML files). It looks like it's running out of memory? But the available memory is 120G. So I'm not sure if this is normal or not.

terminate called after throwing an instance of 'std::bad_alloc'
  what():  std::bad_alloc

Command used and terminal output

ProteomicsLFQ       -threads 2       -in UPS1_12500amol_R1.mzML UPS1_12500amol_R2.mzML UPS1_12500amol_R3.mzML UPS1_125amol_R1.mzML UPS1_125amol_R2.mzML UPS1_125amol_R3.mzML UPS1_25000amol_R1.mzML UPS1_25000amol_R2.mzML UPS1_25000amol_R3.mzML UPS1_2500amol_R1.mzML UPS1_2500amol_R2.mzML UPS1_2500amol_R3.mzML UPS1_250amol_R1.mzML UPS1_250amol_R2.mzML UPS1_250amol_R3.mzML UPS1_50000amol_R1.mzML UPS1_50000amol_R2.mzML UPS1_50000amol_R3.mzML UPS1_5000amol_R1.mzML UPS1_5000amol_R2.mzML UPS1_5000amol_R3.mzML UPS1_500amol_R1.mzML UPS1_500amol_R2.mzML UPS1_500amol_R3.mzML UPS1_50amol_R1.mzML UPS1_50amol_R2.mzML UPS1_50amol_R3.mzML       -ids UPS1_12500amol_R1_comet_feat_perc_pep_filter.idXML UPS1_12500amol_R2_comet_feat_perc_pep_filter.idXML UPS1_12500amol_R3_comet_feat_perc_pep_filter.idXML UPS1_125amol_R1_comet_feat_perc_pep_filter.idXML UPS1_125amol_R2_comet_feat_perc_pep_filter.idXML UPS1_125amol_R3_comet_feat_perc_pep_filter.idXML UPS1_25000amol_R1_comet_feat_perc_pep_filter.idXML UPS1_25000amol_R2_comet_feat_perc_pep_filter.idXML UPS1_25000amol_R3_comet_feat_perc_pep_filter.idXML UPS1_2500amol_R1_comet_feat_perc_pep_filter.idXML UPS1_2500amol_R2_comet_feat_perc_pep_filter.idXML UPS1_2500amol_R3_comet_feat_perc_pep_filter.idXML UPS1_250amol_R1_comet_feat_perc_pep_filter.idXML UPS1_250amol_R2_comet_feat_perc_pep_filter.idXML UPS1_250amol_R3_comet_feat_perc_pep_filter.idXML UPS1_50000amol_R1_comet_feat_perc_pep_filter.idXML UPS1_50000amol_R2_comet_feat_perc_pep_filter.idXML UPS1_50000amol_R3_comet_feat_perc_pep_filter.idXML UPS1_5000amol_R1_comet_feat_perc_pep_filter.idXML UPS1_5000amol_R2_comet_feat_perc_pep_filter.idXML UPS1_5000amol_R3_comet_feat_perc_pep_filter.idXML UPS1_500amol_R1_comet_feat_perc_pep_filter.idXML   UPS1_500amol_R2_comet_feat_perc_pep_filter.idXML UPS1_500amol_R3_comet_feat_perc_pep_filter.idXML UPS1_50amol_R1_comet_feat_perc_pep_filter.idXML UPS1_50amol_R2_comet_feat_perc_pep_filter.idXML UPS1_50amol_R3_comet_feat_perc_pep_filter.idXML       -design PXD001819.sdrf_openms_design.tsv       -fasta uniprot_yeast_ups_decoy.fasta       -protein_inference aggregation       -quantification_method feature_intensity       -targeted_only false       -feature_with_id_min_score 0.10       -feature_without_id_min_score 0.75       -mass_recalibration false       -Seeding:intThreshold 1000       -protein_quantification unique_peptides       -alignment_order star              -psmFDR 0.01       -proteinFDR 0.01       -picked_proteinFDR true       -out_cxml PXD001819.sdrf_openms_design_openms.consensusXML       -out PXD001819.sdrf_openms_design_openms.mzTab       -out_msstats PXD001819.sdrf_openms_design_msstats_in.csv    -PeptideQuantification:extract:batch_size 1000          -debug 0       2>&1 | tee proteomicslfq.log

Relevant files

log file: proteomicslfq.log

System information

quantms 1.3.0

@daichengxin daichengxin added the bug Something isn't working label Oct 18, 2024
@jpfeuffer
Copy link
Collaborator

I have analysed this dataset many times. Never had issues.

@jpfeuffer
Copy link
Collaborator

You can trace the memory consumption while it's running.

@ypriverol
Copy link
Member

Me too, but the last release of quantms 1.3.0 uses OpenMS 3.2.0. We have other ongoing problems with this version and mzTab export in ProteinQuantifier. @timosachsenberg Can you help us here?

@timosachsenberg
Copy link

timosachsenberg commented Oct 18, 2024

log does not indicate that it is the export. is there a way we can find out where/when this regression was introduced?

@daichengxin
Copy link
Collaborator Author

I retried openms 3.2.0 and traced the memory. It does exceed the memory. Why is it out of memory? mzML only has 10G. I haven't encountered this before either

@timosachsenberg
Copy link

Thanks for checking. This it is really suspicious. Can you reproduce this e.g., for one or two files?

@daichengxin
Copy link
Collaborator Author

daichengxin commented Oct 18, 2024

I can reproduce this in two files. But a single file are work. Test files: https://www.dropbox.com/scl/fi/jgbw0pvnm18cga1kwgy54/proteomicslfq.zip?rlkey=6igoyec9ffztk9p8f4uriukct&st=osldx9cp&dl=0

@timosachsenberg
Copy link

I can confirm that it uses 400gb for two small files during feature extraction.
My first guess would be that something inside e.g. the OpenSWATH code might have changed.

@timosachsenberg
Copy link

Likely related to a different conversion using TRFP.
The file works with ProteoWizard msconvert

@timosachsenberg
Copy link

is this fixed?

@ypriverol
Copy link
Member

We reverted to the previous version of ThermoRawfileparser which is not ideal. We will explore more in detail with the new version of ThermoRawFileParser.

To be honest I have lost track of this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants