Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doesn't scrape images from page #99

Open
dineshbvadhia opened this issue Feb 13, 2017 · 5 comments
Open

Doesn't scrape images from page #99

dineshbvadhia opened this issue Feb 13, 2017 · 5 comments

Comments

@dineshbvadhia
Copy link

Installed ImageScraper with pip and pointed it to https://www.wikiart.org/en/recently-added-artworks and the response was:

C:\Users\Think\VM\aml-1.6\dev>image-scraper -s C:\Users\Think\VM\watest\images https://www.wikiart.org/en/recently-added-artworks

ImageScraper

Requesting page....
Sorry, no images found.

Do you know what the problem is?

@sananth12
Copy link
Owner

ImageScraper does not scrape images if they are injected into the html runtime. It looks like that's whats happening in the mentioned website

@sananth12
Copy link
Owner

@dineshbvadhia have you tried using the --injected option?

@dineshbvadhia
Copy link
Author

Had to install selenium which is not listed in requirements.txt but still not working.

C:\Users\Think\VM>image-scraper -s C:\Users\Think\VM\watest https://www.wikiart.org/en/recently-added-artworks --injected

ImageScraper

Requesting page....

Traceback (most recent call last):
File "c:\users\think\anaconda3\lib\site-packages\selenium\webdriver\common\service.py", line 74, in start
stdout=self.log_file, stderr=self.log_file)
File "c:\users\think\anaconda3\lib\subprocess.py", line 947, in init
restore_signals, start_new_session)
File "c:\users\think\anaconda3\lib\subprocess.py", line 1224, in _execute_child
startupinfo)
FileNotFoundError: [WinError 2] The system cannot find the file specified

@ssundarraj
Copy link
Collaborator

The documentation is not very clear about this. I think you have to have PhantomJS installed and in your PATH.

@nevertoday
Copy link

ImageScraper does not scrape images if they are injected into the html runtime. It looks like that's whats happening in the mentioned website

but so many webpage are ajax or other mode to display image.
so soso hope , this need will be support.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants