You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This would be used by the FetcherBolt, for instance, to implement throttling at the spout level and users would not have to modify the Fetcher code for doing so.
The fetch times are stored in the metadata - the same could be done with the robots delay. Instead of having yet another stream, we could have a custom bolt intercepting the URLs from the FetcherBolt on the status stream and doing whatever it wants based on the content of the metadata.
This could be used to notify external components of host-related information such as:
This would be used by the FetcherBolt, for instance, to implement throttling at the spout level and users would not have to modify the Fetcher code for doing so.
@owenrh
The text was updated successfully, but these errors were encountered: