Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding feat: implement unthrottled concurrency using task queue #141

Open
wumpus opened this issue Jun 9, 2024 · 8 comments
Open

Regarding feat: implement unthrottled concurrency using task queue #141

wumpus opened this issue Jun 9, 2024 · 8 comments
Assignees
Labels
bug Something isn't working

Comments

@wumpus
Copy link

wumpus commented Jun 9, 2024

Can you stop attacking the Common Crawl CDX API?

@lc
Copy link
Owner

lc commented Jun 9, 2024

I’m not? This is an open source tool to find archived URLs for a given domain…

@wumpus
Copy link
Author

wumpus commented Jun 9, 2024

Yes, and because it isn't throttled, use of this package harms the target, which is me.

@wumpus
Copy link
Author

wumpus commented Jun 11, 2024

Any progress? I was hoping for rate limiting, honoring 503 and 429 status codes, and exponential backoff.

And not just "unthrottled concurrency".

@lc
Copy link
Owner

lc commented Jun 11, 2024

It’s open source, so PR's are welcome.

It is going to be a busy month with some life changes for me – I will put this in my TODO's. Unfortunately will likely not get done until late June or early July

@lc lc closed this as completed Jun 11, 2024
@lc lc reopened this Jun 11, 2024
@lc
Copy link
Owner

lc commented Jun 11, 2024

Accidentally closed when commenting

@lc lc self-assigned this Jun 12, 2024
@lc lc added the bug Something isn't working label Jun 12, 2024
@wumpus
Copy link
Author

wumpus commented Jun 13, 2024

Thanks for adding to your TODO list, I appreciate it!

Here's an example of making a single query in Athena that's much more efficient than gau: https://positive.security/blog/ransack-data-exfiltration#common-crawl

@lc
Copy link
Owner

lc commented Jun 13, 2024

Thanks for the reference & sorry about the slowness to implement. Getting hitched!

@wumpus
Copy link
Author

wumpus commented Jun 16, 2024

Congratulations!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants