GitHub - Cosmin-Hodor/scrapper: Scrap data off the web with Rust!

A basic scrapper written in Rust

This code uses the hyper and regex crates to scrape links from a web page. The code first creates an instance of the Client struct from the hyper crate, which is used to make HTTP requests, then prompts the user to enter a URL, reads the URL from the user input using stdin, and stores it in the url variable. The newline character is trimmed from the end of the URL string using the trim method, then uses the get method from the Client instance to make a GET request to the specified URL.

The response from the server is stored in the response variable. The body of the response is read into the body variable, which is a string.

The code then uses the regex crate to define a regular expression pattern that matches links in the HTML body. The Regex::new function is used to compile the pattern, which looks for elements with a href attribute. The captures_iter method is used to iterate over all the links that match the pattern in the body string. For each match, the link is extracted from the first capture group and printed to the console using println!.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
main.rs		main.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A basic scrapper written in Rust

About

Releases

Packages

Languages

Cosmin-Hodor/scrapper

Folders and files

Latest commit

History

Repository files navigation

A basic scrapper written in Rust

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages