Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extending OpenMS for compress support #7560

Open
ypriverol opened this issue Jul 31, 2024 · 6 comments
Open

Extending OpenMS for compress support #7560

ypriverol opened this issue Jul 31, 2024 · 6 comments

Comments

@ypriverol
Copy link
Contributor

Multiple users are using quantms in AWS for FLQ and TMT analysis. One of the feature requests is to decrease the size of the intermediate files, ideXML, and consensusXML. We have been discussing this internally and would be great to have native support in OpenMS tools including the proteomicsLFQ, and the Search Adapters for compressed files. Please provide your feedback before starting the implementation @timosachsenberg.

Related to issue bigbio/quantms#393

@jpfeuffer
Copy link
Contributor

Reading should work already if the file type is registered as a possible filetype for id and cxml.

Writing should also be easy by adding something like this:

https://techoverflow.net/2020/01/13/how-to-gzip-compress-on-the-fly-in-c-using-boostiostreams/

to the writeTo methods of affected xmlhandlers.
We could probably abstract this across all handlers if we let them accept non std ostreams.

Alternatively you can probably just use named pipes on any unix system and compress on the fly right now without any changes in OpenMS.

@ypriverol
Copy link
Contributor Author

@jpfeuffer we have one student working on this issue. I was talking to @timosachsenberg today about this feature and he agreed is a good thing to do. What do you think about the feature? I think in AWS is crucial to have smaller files to transfer because you have to pay for space and bandwidth.

@jpfeuffer
Copy link
Contributor

Yes, useful feature in general. You can try with unix tools first to see how much it helps and how much it affects the runtime.

@timosachsenberg
Copy link
Contributor

I think the mzML handler is one example that already implements this

@jpfeuffer
Copy link
Contributor

Hmm I'm not sure if we support writing gzipped mzml. I did not find any relevant code in a quick search.

@timosachsenberg
Copy link
Contributor

Ah ok then its only in the reader...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants