Replies: 1 comment 1 reply
-
Indeed, we reach the limits of SingleFile at this stage. Have you looked at what is being done around the WARC format, like https://github.com/webrecorder/pywb? There is also the Web Bundles format (see https://web.dev/web-bundles/) which is supposed to solve this kind of problems but the implementation is not finished yet AFAIK. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
This is probably out of scope of this project currently, but I figured maybe there's a solution for this anyway ;)
Basically I'm trying to save some pages where a bunch of JS is loaded which in turn loads some JSON files from within the JavaScript. Obviously SingleFile can't really be aware of these fetches as it would have to parse the JavaScript and figure out where the fetches occur. But I was wondering if there is some way to save these kinds of pages regardless?
An example site where this occurs: https://ncase.me/trust/
One possible approach I can think of might be to "record" fetch requests/responses while refreshing the page and then monkey patch the
fetch
andXMLHttpRequest
to return the recorded responses whenever a matching request is encountered.Are there any other web archiving tools that attempt to solve this problem?
Beta Was this translation helpful? Give feedback.
All reactions