-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
429 Client Error: Too Many Requests for url: https://archive.md/ #32
Comments
Confirmed. Same here. Except the domain atm is |
Looking at the response that comes back it appears that it is getting caught by a CAPTCHA. Unsure what the proper way to deal with this is. |
I was getting 429's also. The command-line program comes bundled with a
|
I think once you hit that captcha you're pretty much just stuck. I've tried in browser automation libraries like Playwright to do obvious things like click the captcha checkbox to no avail. Like the comment above said, choosing a real user-agent probably helps. Besides that, I think that their rate limiting is fairly IP based, so if you are able to distribute your requests across IPs that may help. I have also noticed that they throttle me way more aggressively when using Cloudflare 1.1.1.1 or Warp. This may likely apply to other VPNs as well, but I haven't tested personally. It would be nice to wrap up this project with a queuing system using the submissions as jobs and having automatic retries to make it more robust / act like a "service". I haven't seen anyone doing exactly that yet. I have also noticed that, while infrequent, there are definitely some pages which seem to crash the archive.is archiver and never succeed in being archived. Not sure if there's a way to report that to them. |
I saw in another similar package how they solved it: They downloaded a clearance cookies after manually entering the site once. This cookies was then stored as a system variable. Maybe also a solution here? |
What package was that @alybel ? |
This has never worked for me, I always get 429 error
Running from within Python:
Running command-line:
The text was updated successfully, but these errors were encountered: