Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Capture stats about hosts on status stream post fetching #504

Open
jnioche opened this issue Oct 6, 2017 · 3 comments
Open

Capture stats about hosts on status stream post fetching #504

jnioche opened this issue Oct 6, 2017 · 3 comments
Labels

Comments

@jnioche
Copy link
Contributor

jnioche commented Oct 6, 2017

This could be used to notify external components of host-related information such as:

This would be used by the FetcherBolt, for instance, to implement throttling at the spout level and users would not have to modify the Fetcher code for doing so.

@owenrh

@jnioche
Copy link
Contributor Author

jnioche commented Feb 23, 2018

The fetch times are stored in the metadata - the same could be done with the robots delay. Instead of having yet another stream, we could have a custom bolt intercepting the URLs from the FetcherBolt on the status stream and doing whatever it wants based on the content of the metadata.

@jnioche
Copy link
Contributor Author

jnioche commented May 22, 2019

Having a windowing bolt for that purpose would make it easier to aggregate counts per minute

@jnioche jnioche changed the title Custom HostInfo stream? Capture stats about hosts on status stream post fetching May 22, 2019
@jnioche
Copy link
Contributor Author

jnioche commented May 22, 2019

Could also rely on the state framework provided by Storm to send to a pluggable storage e.g. Redis

https://storm.apache.org/releases/1.2.2/Windowing.html
https://storm.apache.org/releases/1.2.2/State-checkpointing.html

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant