Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need Help: Getting this Error: `open_http': 400 BAD REQUEST #307

Open
hupictz opened this issue Oct 3, 2024 · 36 comments
Open

Need Help: Getting this Error: `open_http': 400 BAD REQUEST #307

hupictz opened this issue Oct 3, 2024 · 36 comments

Comments

@hupictz
Copy link

hupictz commented Oct 3, 2024

Getting snapshot pages.../System/Library/Frameworks/Ruby.framework/Versions/2.3/usr/lib/ruby/2.3.0/open-uri.rb:359:in open_http': 400 BAD REQUEST (OpenURI::HTTPError) from /System/Library/Frameworks/Ruby.framework/Versions/2.3/usr/lib/ruby/2.3.0/open-uri.rb:737:in buffer_open'
from /System/Library/Frameworks/Ruby.framework/Versions/2.3/usr/lib/ruby/2.3.0/open-uri.rb:212:in block in open_loop' from /System/Library/Frameworks/Ruby.framework/Versions/2.3/usr/lib/ruby/2.3.0/open-uri.rb:210:in catch'
from /System/Library/Frameworks/Ruby.framework/Versions/2.3/usr/lib/ruby/2.3.0/open-uri.rb:210:in open_loop' from /System/Library/Frameworks/Ruby.framework/Versions/2.3/usr/lib/ruby/2.3.0/open-uri.rb:151:in open_uri'
from /System/Library/Frameworks/Ruby.framework/Versions/2.3/usr/lib/ruby/2.3.0/open-uri.rb:717:in open' from /Library/Ruby/Gems/2.3.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader/archive_api.rb:13:in get_raw_list_from_api'
from /Library/Ruby/Gems/2.3.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:92:in block in get_all_snapshots_to_consider' from /Library/Ruby/Gems/2.3.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:91:in times'
from /Library/Ruby/Gems/2.3.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:91:in get_all_snapshots_to_consider' from /Library/Ruby/Gems/2.3.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:105:in get_file_list_curated'
from /Library/Ruby/Gems/2.3.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:164:in get_file_list_by_timestamp' from /Library/Ruby/Gems/2.3.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:309:in file_list_by_timestamp'
from /Library/Ruby/Gems/2.3.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:192:in download_files' from /Library/Ruby/Gems/2.3.0/gems/wayback_machine_downloader-2.3.1/bin/wayback_machine_downloader:72:in <top (required)>'
from /usr/local/bin/wayback_machine_downloader:22:in load' from /usr/local/bin/wayback_machine_downloader:22:in

'

@tokiyui
Copy link

tokiyui commented Oct 4, 2024

I also found a similar error.

$ wayback_machine_downloader http://www.ogurayui.jp/
Downloading http://www.ogurayui.jp/ to websites/www.ogurayui.jp/ from Wayback Machine archives.

Getting snapshot pages....../usr/share/ruby/open-uri.rb:364:in open_http': 400 BAD REQUEST (OpenURI::HTTPError) from /usr/share/ruby/open-uri.rb:741:in buffer_open'
from /usr/share/ruby/open-uri.rb:212:in block in open_loop' from /usr/share/ruby/open-uri.rb:210:in catch'
from /usr/share/ruby/open-uri.rb:210:in open_loop' from /usr/share/ruby/open-uri.rb:151:in open_uri'
from /usr/share/ruby/open-uri.rb:721:in open' from /home/user/.local/share/gem/ruby/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader/archive_api.rb:13:in get_raw_list_from_api'
from /home/user/.local/share/gem/ruby/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:92:in block in get_all_snapshots_to_consider' from /home/user/.local/share/gem/ruby/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:91:in times'
from /home/user/.local/share/gem/ruby/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:91:in get_all_snapshots_to_consider' from /home/user/.local/share/gem/ruby/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:105:in get_file_list_curated'
from /home/user/.local/share/gem/ruby/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:164:in get_file_list_by_timestamp' from /home/user/.local/share/gem/ruby/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:309:in file_list_by_timestamp'
from /home/user/.local/share/gem/ruby/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:192:in download_files' from /home/user/.local/share/gem/ruby/gems/wayback_machine_downloader-2.3.1/bin/wayback_machine_downloader:72:in <top (required)>'
from /home/user/bin/wayback_machine_downloader:25:in load' from /home/user/bin/wayback_machine_downloader:25:in

'

@intercoop
Copy link

same issue !

@hupictz
Copy link
Author

hupictz commented Oct 6, 2024

Hi Guys,

I found a solution for this.

Use this updated version, and it works.

credits to ShiftaDeband

=========================

  1. Install the required ruby version.

  2. Download zip/clone fork from ShiftaDeband https://github.com/ShiftaDeband/wayback-machine-downloader.

  3. Navigate to wayback-machine-downloader\bin

  4. On Windows, you can launch Power Shell by shift + right-click the empty space in the folder and select Open Power Shell.

  5. From here, you can run the program with ruby wayback_machine_downloader instead of normal wayback_machine_downloader if you installed it as a gem.

You can also uninstall the original nonfunctional gem if you installed it previously with the gem uninstall wayback_machine_downloader

Note: I tried this and its working now (Tested on 6th October 2024)

Don't forget to give star to ShiftaDeband

@hupictz hupictz closed this as completed Oct 6, 2024
@hupictz
Copy link
Author

hupictz commented Oct 6, 2024

If you need more help - check this screenshot.

Screen Shot 2024-10-06 at 1 11 20 PM

@hupictz hupictz reopened this Oct 6, 2024
@intercoop
Copy link

Hi Guys,

I found a solution for this.

Use this updated version, and it works.

credits to ShiftaDeband

=========================

  1. Install the required ruby version.
  2. Download zip/clone fork from ShiftaDeband https://github.com/ShiftaDeband/wayback-machine-downloader.
  3. Navigate to wayback-machine-downloader\bin
  4. On Windows, you can launch Power Shell by shift + right-click the empty space in the folder and select Open Power Shell.
  5. From here, you can run the program with ruby wayback_machine_downloader instead of normal wayback_machine_downloader if you installed it as a gem.

You can also uninstall the original nonfunctional gem if you installed it previously with the gem uninstall wayback_machine_downloader

Note: I tried this and its working now (Tested on 6th October 2024)

Don't forget to give star to ShiftaDeband

Thanks hupictz,But it doesn't work for me

@Jacek216
Copy link

Jacek216 commented Oct 6, 2024

If you need more help - check this screenshot.

Hi! It really works like on your screenshot, thanks for help!
But the site opens in wrong format, not like in web.archive.org, do you have the same problem?
screen

@Jacek216
Copy link

Jacek216 commented Oct 6, 2024

Thanks hupictz,But it doesn't work for me

use these commands for Windows PowerShell, after installing ruby and downloading wayback-machine-downloader:
cd downloads
cd wayback-machine-downloader-feature-httpGet
cd bin
ruby wayback_machine_downloader example.com --from 20220331

@intercoop
Copy link

Thanks hupictz,But it doesn't work for me

use these commands for Windows PowerShell, after installing ruby and downloading wayback-machine-downloader: cd downloads cd wayback-machine-downloader-feature-httpGet cd bin ruby wayback_machine_downloader example.com --from 20220331

Thank you very much, it has taken effect !

@Jacek216
Copy link

Jacek216 commented Oct 6, 2024

Thank you very much, it has taken effect !

Nice! Your site opens correctly as shown on web.archive.org? or you have the same problem after moving the downloading files to your remote site, like on my previous screenshot?

@intercoop
Copy link

Thank you very much, it has taken effect !

Nice! Your site opens correctly as shown on web.archive.org? or you have the same problem after moving the downloading files to your remote site, like on my previous screenshot?

It only contains HTML files,Do not download CSS and image files,I can't find the reason why this situation is happening

@Jacek216
Copy link

Jacek216 commented Oct 6, 2024

It only contains HTML files,Do not download CSS and image files,I can't find the reason why this situation is happening

Yes, the same with different sites

P.S. everything is okey now, depends on site

@hupictz
Copy link
Author

hupictz commented Oct 6, 2024

Thank you very much, it has taken effect !

Nice! Your site opens correctly as shown on web.archive.org? or you have the same problem after moving the downloading files to your remote site, like on my previous screenshot?

It only contains HTML files,Do not download CSS and image files,I can't find the reason why this situation is happening

@intercoop I am able to download all types of files.

.png, .css, .js, .gif etc

check this screenshot

Screen Shot 2024-10-06 at 7 09 28 PM

@SofiaEngvall
Copy link

Here is my error message from kali linux:

┌──(kali㉿kali)-[~]
└─$ wayback_machine_downloader https://digistump.com/wiki/digispark -p 500 -c 5
Downloading https://digistump.com/wiki/digispark to websites/digistump.com/ from Wayback Machine archives.

Getting snapshot pages.............../usr/lib/ruby/3.1.0/open-uri.rb:364:in `open_http': 400 BAD REQUEST (OpenURI::HTTPError)
        from /usr/lib/ruby/3.1.0/open-uri.rb:741:in `buffer_open'
        from /usr/lib/ruby/3.1.0/open-uri.rb:212:in `block in open_loop'
        from /usr/lib/ruby/3.1.0/open-uri.rb:210:in `catch'
        from /usr/lib/ruby/3.1.0/open-uri.rb:210:in `open_loop'
        from /usr/lib/ruby/3.1.0/open-uri.rb:151:in `open_uri'
        from /usr/lib/ruby/3.1.0/open-uri.rb:721:in `open'
        from /var/lib/gems/3.1.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader/archive_api.rb:13:in `get_raw_list_from_api'
        from /var/lib/gems/3.1.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:92:in `block in get_all_snapshots_to_consider'
        from /var/lib/gems/3.1.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:91:in `times'
        from /var/lib/gems/3.1.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:91:in `get_all_snapshots_to_consider'
        from /var/lib/gems/3.1.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:105:in `get_file_list_curated'
        from /var/lib/gems/3.1.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:164:in `get_file_list_by_timestamp'
        from /var/lib/gems/3.1.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:309:in `file_list_by_timestamp'
        from /var/lib/gems/3.1.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:192:in `download_files'
        from /var/lib/gems/3.1.0/gems/wayback_machine_downloader-2.3.1/bin/wayback_machine_downloader:72:in `<top (required)>'
        from /usr/local/bin/wayback_machine_downloader:25:in `load'
        from /usr/local/bin/wayback_machine_downloader:25:in `<main>'

I'll look at the fork. Thanks!

@hupictz
Copy link
Author

hupictz commented Oct 7, 2024

If you need more help - check this screenshot.

Hi! It really works like on your screenshot, thanks for help! But the site opens in wrong format, not like in web.archive.org, do you have the same problem? screen

@Jacek216 No, for me, it works fine. I am not facing any issues.

@vachi
Copy link

vachi commented Oct 7, 2024

Also worked for me thank you @hupictz

Hi Guys,

I found a solution for this.

Use this updated version, and it works.

credits to ShiftaDeband

=========================

  1. Install the required ruby version.
  2. Download zip/clone fork from ShiftaDeband https://github.com/ShiftaDeband/wayback-machine-downloader.
  3. Navigate to wayback-machine-downloader\bin
  4. On Windows, you can launch Power Shell by shift + right-click the empty space in the folder and select Open Power Shell.
  5. From here, you can run the program with ruby wayback_machine_downloader instead of normal wayback_machine_downloader if you installed it as a gem.

You can also uninstall the original nonfunctional gem if you installed it previously with the gem uninstall wayback_machine_downloader

Note: I tried this and its working now (Tested on 6th October 2024)

Don't forget to give star to ShiftaDeband

@muhammadbaqirjafari
Copy link

Facing same issue.

@hupictz
Copy link
Author

hupictz commented Oct 19, 2024

@muhammadbaqirjafari try this one > #307 (comment)

this is working for everyone.

@intercoop
Copy link

intercoop commented Oct 23, 2024

@muhammadbaqirjafari try this one > #307 (comment)

this is working for everyone.

Hello hupictz :

Getting snapshot pages. found 0 snaphots to consider.

No files to download.
Possible reasons:
* Site is not in Wayback Machine Archive.
* To timestamp too much in the past.

why this situation is happening when i download sites?

@hupictz
Copy link
Author

hupictz commented Oct 23, 2024

@intercoop I am also facing the same issue.

There is no issue with the code. The problem is on the Internet Archive side.

It's temporarily offline. My guess is. When it comes back online, it should work.

One of these has to come back online for it to start working.

https://web.archive.org/cdx/search/xd
or
https://web.archive.org/cdx/search/cdx

@hupictz
Copy link
Author

hupictz commented Oct 30, 2024

Now, the download is working. (confirmed on 30th Oct 2024)

If you guys have any issues. Please report it here. I'm happy to help.

@intercoop
Copy link

Now, the download is working. (confirmed on 30th Oct 2024)

If you guys have any issues. Please report it here. I'm happy to help.

Hello, hupictz :
Thank you for your reminding ! I have one more question,Many times, it is not possible to download all the images, only some of them. Even though the Wayback contains this image, it has not been fully downloaded. What do you think about this problem?

@ahmedqat
Copy link

Now, the download is working. (confirmed on 30th Oct 2024)

If you guys have any issues. Please report it here. I'm happy to help.

Hello, I am trying to download this specific website which is now taken down

https://linc1.xmu.edu.my/doc

The method you outlined works wonders and manages to download the files, however when i click index.html and open it in a browser it just loads forever, any ideas why?

This is the command I used:

ruby wayback_machine_downloader linc1.xmu.edu.my -f 20240816130919

Thanks

@AlexBO222
Copy link

The script doesn't load URLs with suffixes like this:
im_ cs_ if_ js_.

It obviously considers them erroneous and does not load, and there are half of the sites from many.

As an example, these folders from the domain wellmetmk.ru
20130328113257im_
20130328113257cs_
20130328113257if_
20130328113257js_

@tobypeschel
Copy link

It only contains HTML files,Do not download CSS and image files,I can't find the reason why this situation is happening

@intercoop

Maybe an issue with timestamps? If you're providing timestamps (with -t and -f) it's possible the CSS and image files are outside the range. Try opening one of the archived images or stylesheets in your browser, and check the timestamps in the URLs (e.g. https://web.archive.org/web/20060111095815/example.com/img/logo.png – is the timestamp 20060111095815 within the range you provided?).

@hupictz
Copy link
Author

hupictz commented Nov 1, 2024

ny times, it

@intercoop I can download all the images without any issues. If you share more details, I will try to help.

@AlexBO222
Copy link

Manually open everything, the files exist. But the script with the --all key should load the files through all 301 redirects. And redirects just lead to such files in folders with labels. And because of these labels, the script thinks that the folder with a time stamp does not fit, but it does, it just has a suffix _im _js _cs and so on.

@AlexBO222
Copy link

And even if you do not specify any restrictions, the redirect to such a folder will not be loaded either, because the script does not process such folders and considers them an error. The script starts with loading a normal folder without any labels, and in them there is a redirect to such folders with labels and it does not load them.

@milescrawford
Copy link

Got latest version from gem, ran plainly with no options: wayback_machine_downloader www.example.info and still getting 400:

Getting snapshot pages../System/Library/Frameworks/Ruby.framework/Versions/2.6/usr/lib/ruby/2.6.0/open-uri.rb:378:in `open_http': 400 BAD REQUEST (OpenURI::HTTPError)
	from /System/Library/Frameworks/Ruby.framework/Versions/2.6/usr/lib/ruby/2.6.0/open-uri.rb:756:in `buffer_open'
	from /System/Library/Frameworks/Ruby.framework/Versions/2.6/usr/lib/ruby/2.6.0/open-uri.rb:226:in `block in open_loop'
	from /System/Library/Frameworks/Ruby.framework/Versions/2.6/usr/lib/ruby/2.6.0/open-uri.rb:224:in `catch'
	from /System/Library/Frameworks/Ruby.framework/Versions/2.6/usr/lib/ruby/2.6.0/open-uri.rb:224:in `open_loop'
	from /System/Library/Frameworks/Ruby.framework/Versions/2.6/usr/lib/ruby/2.6.0/open-uri.rb:165:in `open_uri'
	from /System/Library/Frameworks/Ruby.framework/Versions/2.6/usr/lib/ruby/2.6.0/open-uri.rb:736:in `open'
	from /Library/Ruby/Gems/2.6.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader/archive_api.rb:13:in `get_raw_list_from_api'
	from /Library/Ruby/Gems/2.6.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:92:in `block in get_all_snapshots_to_consider'
	from /Library/Ruby/Gems/2.6.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:91:in `times'
	from /Library/Ruby/Gems/2.6.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:91:in `get_all_snapshots_to_consider'
	from /Library/Ruby/Gems/2.6.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:105:in `get_file_list_curated'
	from /Library/Ruby/Gems/2.6.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:164:in `get_file_list_by_timestamp'
	from /Library/Ruby/Gems/2.6.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:309:in `file_list_by_timestamp'
	from /Library/Ruby/Gems/2.6.0/gems/wayback_machine_downloader-2.3.1/lib/wayback_machine_downloader.rb:192:in `download_files'
	from /Library/Ruby/Gems/2.6.0/gems/wayback_machine_downloader-2.3.1/bin/wayback_machine_downloader:72:in `<top (required)>'
	from /usr/local/bin/wayback_machine_downloader:23:in `load'
	from /usr/local/bin/wayback_machine_downloader:23:in `<main>'

@jacob-willden
Copy link

@milescrawford, have you tried the solution from @hupictz above that uses the fork from @ShiftaDeband? I got the same error as you when running the latest version of the gem (2.3.1), but when I downloaded the fork and ran it using the instructions from @hupictz, I was able to use it without issue.

Note: If you're on Mac, Linux, or BSD, you can use whatever terminal program you like to run the fork in Ruby instead of PowerShell, which is Windows exclusive.

@42linux
Copy link

42linux commented Nov 15, 2024

Hi Guys,

I found a solution for this.

Use this updated version, and it works.

credits to ShiftaDeband

=========================

  1. Install the required ruby version.
  2. Download zip/clone fork from ShiftaDeband https://github.com/ShiftaDeband/wayback-machine-downloader.
  3. Navigate to wayback-machine-downloader\bin
  4. On Windows, you can launch Power Shell by shift + right-click the empty space in the folder and select Open Power Shell.
  5. From here, you can run the program with ruby wayback_machine_downloader instead of normal wayback_machine_downloader if you installed it as a gem.

You can also uninstall the original nonfunctional gem if you installed it previously with the gem uninstall wayback_machine_downloader

Note: I tried this and its working now (Tested on 6th October 2024)

Don't forget to give star to ShiftaDeband

Thank you !

@mackdreye
Copy link

Thanks hupictz,But it doesn't work for me

use these commands for Windows PowerShell, after installing ruby and downloading wayback-machine-downloader: cd downloads cd wayback-machine-downloader-feature-httpGet cd bin ruby wayback_machine_downloader example.com --from 20220331

Holee crud this worked! thanks in perpetuity...this made my life. I had given up on resurrecting my old content... this changes my life...

@gearhead
Copy link

gearhead commented Nov 25, 2024

I installed this version of wayback-machine-downloader:
https://github.com/ShiftaDeband/wayback-machine-downloader
and it 'works' in that it downloads the site and I no longer get the 400 error, but the resulting main page is blank. I can see all the rest of the content in the folders, but the resulting index.html does nothing. Just a white screen with an empty frame where the 'Wayback Machine' banner was. I was able to edit the index.html file and now can get the main page, but all the active server controls are broken.

This web site I am trying to get is all asp based, so it may be that asp breaks this tool.

The site I am interested in is this: https://web.archive.org/web/20190220201426/http://www.twinsaabs.com/index.asp
It is built with Active Server Files and it seems like this construct is messing up the download. Instead of the original *.asp file, I get a directory with files like:

'intro.asp?nsteps=2'
'step1.asp?'
'step1.asp?nsteps=2'
'step1.asp?nsteps=3&nointro=1'
'intro.asp?' 

I have never created a website with active server pages and am not sure how it is supposed to be, but this does not seem right. Could it be that the downloader is interpreting the asp and not downloading the file?

StrawberryMaster added a commit to StrawberryMaster/wayback-machine-downloader that referenced this issue Dec 3, 2024
including extra config settings, a proper rate limit, and a logger. Fixes: hartator#307 hartator#291 hartator#281 hartator#269 and probably others too
@vmackey
Copy link

vmackey commented Dec 5, 2024

Hi Guys,

I found a solution for this.

Use this updated version, and it works.

credits to ShiftaDeband

=========================

  1. Install the required ruby version.
  2. Download zip/clone fork from ShiftaDeband https://github.com/ShiftaDeband/wayback-machine-downloader.
  3. Navigate to wayback-machine-downloader\bin
  4. On Windows, you can launch Power Shell by shift + right-click the empty space in the folder and select Open Power Shell.
  5. From here, you can run the program with ruby wayback_machine_downloader instead of normal wayback_machine_downloader if you installed it as a gem.

You can also uninstall the original nonfunctional gem if you installed it previously with the gem uninstall wayback_machine_downloader

Note: I tried this and its working now (Tested on 6th October 2024)

Don't forget to give star to ShiftaDeband

Hello, I have the 400 BAD REQUEST issue, and this method didn't work for me. When I use ruby wayback_machine_downloader, PowserShell doesn't recpgnize "ruby" as a command. But it is installed along with the gem.

And, if I use ruby instead of PowerShell, from the bin folder of wayback_machine_downloader of ShiftaDeband , I get the 400 bad request error.

I also tried this method:

Thanks hupictz,But it doesn't work for me

use these commands for Windows PowerShell, after installing ruby and downloading wayback-machine-downloader: cd downloads cd wayback-machine-downloader-feature-httpGet cd bin ruby wayback_machine_downloader example.com --from 20220331

But it doesn't work. When I use cd downloads, PowerShell doesn't recognize the path. From where am I supposed to use this command?

@giorsgeks
Copy link

giorsgeks commented Dec 8, 2024

It works!!! thanks!!! I am using it in Windows trying to restore wordpress site
Going to directory in downloads does not help, then I just replaced original gem files with extracted ones.
Long time to find where those files gone...
No matter what I write in --directory it saves all to
C:\Users\username\Downloads\wayback-machine-downloader-feature-httpGet\bin...
using download without filetypes made zillion of index.html files without images
using --only wp-uploads it downloads images fine, thanks again!

@tobypeschel
Copy link

@vmackey First of all, it sounds like Ruby isn't on your system PATH. If you run the Ruby installer, does it give you an option like "add Ruby executables to your PATH"? If so, make sure that's selected.

Or if you can't do that, add Ruby to your PATH manually:

  1. Open the Start Menu and search for "Environment Variables".
  2. Click "Edit the system environment variables".
  3. Under the "Advanced" tab, click "Environment Variables".
  4. Find "Path" in the System Variables section and click "Edit".
  5. Add the path to the Ruby bin folder (e.g. C:\RubyXX\bin or wherever).
  6. Save changes and restart/open PowerShell.

When I use cd downloads, PowerShell doesn't recognize the path. From where am I supposed to use this command?

I think you can ignore the cd downloads part; it's just assuming you've previously placed it in the Downloads folder, I believe.

You want to be inside wayback-machine-downloader-feature-httpGet/bin and running ruby wayback_machine_downloader.

@giorsgeks
Copy link

giorsgeks commented Dec 8, 2024

Actually it is
image
Now I am happy downloading images maybe I forgot to add /bin , will see it later
..
...
there were bin,
don't know why lot of wordpress pictures are placed in https://i0.wp.com/yoursite.com/wp-content/uploads/
Pictures could be easy downloaded providing full path in directory, now my command looks like
C:\Users\username\Downloads\wayback-machine-downloader-feature-httpGet\bin>wayback_machine_downloader https://i0.wp.com/yoursite.com/wp-content/uploads/ --directory yoursitepictures

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests