- Add
PolipusCrawler#add_to_queue
to add a page back to the queue #24 - Introduce new block
PolipusCrawler#on_page_error
which runs when there was an error (Page#error
). For example a connectivity error. See/examples/error_handling.rb
#15 - Add
Page#success?
which returns true if HTTP code is something in between 200 and 206. - Polipus supports now
robots.txt
directives. Set the option:obey_robots_txt
totrue
. See/examples/robots_txt_handling.rb
#30 - Add support for GZIP and deflate compressed HTTP requests #26
- Minor improvements to code style