net: ip: Fix the warning in the data path #93282

krish2718 · 2025-07-17T19:07:37Z

Instead of warning for every-packet, warn only once and let user debug the underlying cause.

Fix #49845 partially.

Instead of warning for every-packet, warn only once and let user debug the underlying cause. Fix zephyrproject-rtos#49845 partially. Signed-off-by: Chaitanya Tata <[email protected]>

sonarqubecloud · 2025-07-17T19:11:39Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

JordanYates · 2025-07-17T22:24:36Z

subsys/net/ip/net_if.c

@@ -262,8 +262,8 @@ static bool net_if_tx(struct net_if *iface, struct net_pkt *pkt)
 		status = net_if_l2(iface)->send(iface, pkt);
 		net_if_tx_unlock(iface);
 		if (status < 0) {
-			NET_WARN("iface %d pkt %p send failure status %d",
-				 net_if_get_by_iface(iface), pkt, status);
+			NET_WARN_ONCE("iface %d pkt %p send failure status %d",


From the linked issue I can see how warning on every packet is a usability problem (as mentioned in the LOG_WRN_ONCE PR), but I also don't think that only ever outputting a single warning is great either.
The two annoyances I see are:

you have no idea whether its just a transient failure or whether all packets are failing

If you only attach to the logs after the first occurance, you have no idea there is a problem at all

Couldn't the initial issue be resolved at the zperf level by handling packet send errors?

The two annoyances I see are:

you have no idea whether its just a transient failure or whether all packets are failing

If you only attach to the logs after the first occurance, you have no idea there is a problem at all

I understand that a single print might not help, but do we really want to debug data path issues using prints? IMHO, we should be using statistics to convey the seriousness of the issue. If that is still not acceptable, then I propose we pull in another Linux feature printk_ratelimited which I am still not keen (in favour of printk_once) this way at least we don't bombard and user can control the rate. WDYT?

I can only speak for myself, but if a deployed device is not getting data through to the cloud, I'm much more likely to be looking at serial logs than to sit there polling a stats object (somehow?) and checking to see if an error counter is going up. Even if it is going up, it doesn't really provide any reasoning as to why its going up.

A rate limited output would be fine from my perspective, but is obviously more work.

likely to be looking at serial logs than to sit there polling a stats object (somehow?) and checking to see if an error counter is going up.

Well, I almost always use those shell commands to look at drops :). Traffic running async + shell to keep dumping stats is my go to debug for data path issues than looking at a flood of prints.

A rate limited output would be fine from my perspective, but is obviously more work.

Yes, it's a proper feature that needs to be implemented.

JordanYates

Put more succinctly, shouldn't the problem at the driver layer that causes the failures and the application layer that continuously keeps trying to send be fixed, rather than making the log output less useful?

krish2718 · 2025-07-18T05:33:12Z

Put more succinctly, shouldn't the problem at the driver layer that causes the failures and the application layer that continuously keeps trying to send be fixed, rather than making the log output less useful?

Absolutely, the entire pipeline as you say is responsible as you say (and IIRC we had the same discussion about lacking stop/start data path in Zephyr), but the specific problem this PR addresses is that, bombarding with prints (Zperf pumping at 50M) doesn't help, esp. you loose any control over the shell, cannot even type in wifi statistics or net stats to debug.

JordanYates · 2025-07-18T06:04:22Z

Absolutely, the entire pipeline as you say is responsible as you say (and IIRC we had the same discussion about lacking stop/start data path in Zephyr), but the specific problem this PR addresses is that, bombarding with prints (Zperf pumping at 50M) doesn't help, esp. you loose any control over the shell, cannot even type in wifi statistics or net stats to debug.

Can we do something like only printing a warning if at least 1 second has passed since the last warning?

krish2718 · 2025-07-18T06:46:23Z

Absolutely, the entire pipeline as you say is responsible as you say (and IIRC we had the same discussion about lacking stop/start data path in Zephyr), but the specific problem this PR addresses is that, bombarding with prints (Zperf pumping at 50M) doesn't help, esp. you loose any control over the shell, cannot even type in wifi statistics or net stats to debug.

Can we do something like only printing a warning if at least 1 second has passed since the last warning?

Yeah, the rate limiting discussion is in the above comment.

net: ip: Fix the warning in the data path

db6d1d1

Instead of warning for every-packet, warn only once and let user debug the underlying cause. Fix zephyrproject-rtos#49845 partially. Signed-off-by: Chaitanya Tata <[email protected]>

zephyrbot added the area: Networking label Jul 17, 2025

zephyrbot requested review from jukkar, pdgendt, rlubos and ssharks July 17, 2025 19:08

zephyrbot assigned rlubos and jukkar Jul 17, 2025

krish2718 requested a review from JordanYates July 17, 2025 19:08

jukkar approved these changes Jul 17, 2025

View reviewed changes

JordanYates reviewed Jul 17, 2025

View reviewed changes

JordanYates requested changes Jul 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

net: ip: Fix the warning in the data path #93282

net: ip: Fix the warning in the data path #93282

krish2718 commented Jul 17, 2025

Uh oh!

sonarqubecloud bot commented Jul 17, 2025

Uh oh!

JordanYates Jul 17, 2025

Uh oh!

krish2718 Jul 18, 2025

Uh oh!

JordanYates Jul 18, 2025

Uh oh!

krish2718 Jul 18, 2025

Uh oh!

JordanYates left a comment

Uh oh!

krish2718 commented Jul 18, 2025

Uh oh!

JordanYates commented Jul 18, 2025

Uh oh!

krish2718 commented Jul 18, 2025

Uh oh!

Uh oh!

net: ip: Fix the warning in the data path #93282

Are you sure you want to change the base?

net: ip: Fix the warning in the data path #93282

Conversation

krish2718 commented Jul 17, 2025

Uh oh!

sonarqubecloud bot commented Jul 17, 2025

Quality Gate passed

Uh oh!

JordanYates Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

krish2718 Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

JordanYates Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

krish2718 Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

JordanYates left a comment

Choose a reason for hiding this comment

Uh oh!

krish2718 commented Jul 18, 2025

Uh oh!

JordanYates commented Jul 18, 2025

Uh oh!

krish2718 commented Jul 18, 2025

Uh oh!

Uh oh!