Too many pings and one client always disconnects #4300

ajulyav · 2024-10-07T13:43:34Z

Describe the bug

grpc._channel._MultiThreadedRendezvous: <_MultiThreadedRendezvous of RPC that terminated with:
	status = StatusCode.UNAVAILABLE
	details = "Too many pings"
	debug_error_string = "UNKNOWN:Error received from peer ipv4:192.168.229.99:5040 {grpc_message:"Too many pings", grpc_status:14, created_time:"2024-10-07T15:40:46.164225255+02:00"}"
>

I've got my grpc server settings as:

        ("grpc.http2.max_pings_without_data", 0),
        # Is it permissible to send keepalive pings from the client without
        # any outstanding streams. More explanation here:
        # https://github.com/adap/flower/pull/2197
        ("grpc.keepalive_permit_without_calls", 0),

but it does not help though

later, i added up two options:

        ("grpc.http2.max_ping_strikes", 0),
        ("grpc.http2.min_ping_interval_without_data_ms", 10)

it allowed me escape the initial error, but then I have:

    raise GrpcBridgeClosed()
flwr.server.superlink.fleet.grpc_bidi.grpc_bridge.GrpcBridgeClosed

Steps/Code to Reproduce

I use basic FedAvg strategy except that i send additional round of evaluation on each client during aggregate_fit
EvaluateRes = client_proxy.evaluate(ins = evaluate_ins, timeout = None, group_id=rnd) . Sometimes when rerun the clients and server, the error happens after 1 successful round, so it is not always happens the same moment.

Expected Results

Client stays alive

Actual Results

Client disconnects

The text was updated successfully, but these errors were encountered:

oabuhamdan · 2024-10-21T16:37:32Z

Did you come to a solution?

ajulyav · 2024-10-24T10:13:16Z

Did you come to a solution?

Hello, I am still encountering this problem, and it occurs quite randomly. A few things have helped me reduce the frequency of this issue:

Run the server and clients on the same machine, so you can use "localhost" as the server address.
If you're using loops that send messages to clients, try replacing the loop with non-loop code.

WilliamLindskog · 2025-01-28T21:36:44Z

Hi @ajulyav,

Thanks for raising this. Are you still experiencing this issue?

ajulyav added the bug Something isn't working label Oct 7, 2024

ajulyav closed this as completed Oct 24, 2024

ajulyav reopened this Oct 24, 2024

ajulyav closed this as completed Oct 24, 2024

ajulyav reopened this Oct 24, 2024

WilliamLindskog added stale If issue/PR hasn't been updated within 3 weeks. part: communication Issues/PRs that affect federated communication e.g. gRPC. labels Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Too many pings and one client always disconnects #4300

Too many pings and one client always disconnects #4300

ajulyav commented Oct 7, 2024 •

edited

Loading

oabuhamdan commented Oct 21, 2024

ajulyav commented Oct 24, 2024

WilliamLindskog commented Jan 28, 2025

Too many pings and one client always disconnects #4300

Too many pings and one client always disconnects #4300

Comments

ajulyav commented Oct 7, 2024 • edited Loading

Describe the bug

Steps/Code to Reproduce

Expected Results

Actual Results

oabuhamdan commented Oct 21, 2024

ajulyav commented Oct 24, 2024

WilliamLindskog commented Jan 28, 2025

ajulyav commented Oct 7, 2024 •

edited

Loading