You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
)
that less than 20 seconds to await for free slot.
When message goes from libp2p connection to libp2p node process with full buffer it blocked on message add to queue for about 20 seconds. after 5 seconds left python code decide to retry and reconnect to node process.
In TAC demo two messages were generated every two seconds, one message has not existing address as a target, so it caused constant delay for 20 seconds.
Overall logic and design of p2p node is good, but particular case showed weak parts of the implementation.
Ways to solve the particular issue:
Add queue state warning logs in libp2p_node to help find out similar issue in the future.
Add a specific benchmarks for similar cases.
adjust timeouts and time limits: like bigger ACN timeout on the python side and lower address lookup timeout will help to fix this issue without any extra code/logic update.
python part will be blocked until empty envelope slot appears.
add acn wait response that will be sent in case queue is full, to pause envelope sending from the python side till free slot appears.
try to parallel processing of the messages from queue if possible (depends on libp2p code, envelopes order):
in case slow envelopes put it into special queue and process individually, cause most of the time it awaits for response from remote node.
introduce several sending channels: like workers that consume queue and perform some tasks in parallel: 5 workers to send 5 envelopes in a time, if some blocked other will keep working. sure, all the workers can be blocked, but this solution can decrease probability of total blocking
probably create per target address queues that will block envelopes to specific address only
another point:
find out the way to notify p2p connection, that specific address is not reachable, to avoid sending messages to this address and probably notify user about it
The text was updated successfully, but these errors were encountered:
libp2p_node out queue overflow issue:
The case:
During TAC demo checks was found that internal queue (
agents-aea/packages/fetchai/connections/p2p_libp2p/libp2p_node/aea/api.go
Line 82 in 41ea492
is running full and freed quite slowly, cause look up for unknown address (that does not present in the network) has timeout 20 seconds (
agents-aea/packages/fetchai/connections/p2p_libp2p/libp2p_node/dht/dhtpeer/dhtpeer.go
Line 90 in 41ea492
Queue goes full and freed for one slot in 20 seconds.
default ACN ack timeout for libp2p_node connection is 5 seconds (
agents-aea/packages/fetchai/connections/p2p_libp2p/connection.py
Line 129 in 41ea492
that less than 20 seconds to await for free slot.
When message goes from libp2p connection to libp2p node process with full buffer it blocked on message add to queue for about 20 seconds. after 5 seconds left python code decide to retry and reconnect to node process.
In TAC demo two messages were generated every two seconds, one message has not existing address as a target, so it caused constant delay for 20 seconds.
Overall logic and design of p2p node is good, but particular case showed weak parts of the implementation.
Ways to solve the particular issue:
Add queue state warning logs in libp2p_node to help find out similar issue in the future.
Add a specific benchmarks for similar cases.
python part will be blocked until empty envelope slot appears.
another point:
find out the way to notify p2p connection, that specific address is not reachable, to avoid sending messages to this address and probably notify user about it
The text was updated successfully, but these errors were encountered: