create zip/tgz on special dataset's endpoint #35

TomasKulhanek · 2018-03-08T11:33:12Z

I’d add some my ideas about zip/tgz of RAW data of repository.
I propose to consider streaming the data instead of ZIP/TGZ them in batch thus a zip/tgz is made on demand
Reason to do that

User can select specific folders/files to be included in the zip stream.
No need to store extremely big zip on server, no need to wait until big archive is made
If you need to save space on repository storage, there can be configured compression on filesystem transparently – there are tools like BTRFS, no need to implement it on level of D6.2 code base
Streaming is simple – made prototype – cgi script which compress the user’s directory on demand and sends stream directly to http session channel.
Such stream can be redirected directly to other service: virtual folder storage.

TomasKulhanek added this to the D6.2 Repository Instance milestone Mar 8, 2018

TomasKulhanek assigned TomasKulhanek, chrishmorris and andreagia Mar 8, 2018

TomasKulhanek added the enhancement label Mar 20, 2018

TomasKulhanek changed the title ~~zip/tgz of dataset's RAW data files~~ create zip/tgz on special dataset's endpoint Mar 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

create zip/tgz on special dataset's endpoint #35

create zip/tgz on special dataset's endpoint #35

TomasKulhanek commented Mar 8, 2018

create zip/tgz on special dataset's endpoint #35

create zip/tgz on special dataset's endpoint #35

Comments

TomasKulhanek commented Mar 8, 2018