Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LinkedIn Job Extractor - Job data extraction tool from the web using Scrapy #375 #388

Closed
wants to merge 1 commit into from

Conversation

Niraj1608
Copy link
Contributor

Related Issues or Bug

  • This PR addresses issues related to the scraping of job postings from LinkedIn, including challenges with rate limits and IP blocking.

Fixes: #335

Proposed Changes

  • Implemented the Octoparse proxy middleware to manage requests and avoid IP bans.
  • Optimized the CSS selectors for improved accuracy in extracting job details.
  • Enhanced error handling and logging for better monitoring of scraping performance.
  • Added support for dynamic content scraping.

Additional Info

  • The Octoparse proxy middleware helps in rotating IPs and reduces the chances of getting blocked by LinkedIn during scraping.
  • This implementation allows for more efficient and reliable data extraction from LinkedIn job postings.

Screenshots

Screenshot 2024-10-16 194309

Copy link

👋 Thank you for opening this pull request! We appreciate your contribution to improving this project. Your PR is under review, and we'll get back to you shortly.
Don't forget to mention the issue you solved!.

To help move the process along, please tag @UppuluriKalyani, @Neilblaze, and @SaiNivedh26 for a faster review!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Feature request - Add hand gesture recognition project
1 participant