Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Add concurrency with proxies #62

Open
phoenixpereira opened this issue Dec 8, 2024 · 0 comments
Open

[Feature Request] Add concurrency with proxies #62

phoenixpereira opened this issue Dec 8, 2024 · 0 comments
Assignees
Labels
dependencies Pull requests that update a dependency file enhancement New feature or request

Comments

@phoenixpereira
Copy link
Member

Overview

Add concurrency with proxies for the scraper.

Details

  • Fetch a list of proxies and save working ones to a text file.
  • Use the working proxies with concurrency to scrape course data.
  • Add retry logic after getting a 429 error to try other proxies before waiting for 60 seconds.

Why would this feature be useful?

This will significantly reduce the scraping time and allow automated daily scraping to happen on GitHub actions, thus allowing support for how full a class currently is.

Additional Notes

No response

@phoenixpereira phoenixpereira added enhancement New feature or request dependencies Pull requests that update a dependency file labels Dec 8, 2024
@phoenixpereira phoenixpereira self-assigned this Dec 8, 2024
@phoenixpereira phoenixpereira moved this to In Progress in Courses API Dec 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dependencies Pull requests that update a dependency file enhancement New feature or request
Projects
Status: In Progress
Development

No branches or pull requests

1 participant