Poor ethernet latency

whiskerp · May 8, 2025, 9:42am

Hi,

I have an aml-s905x-cc currently running Debian 12 with Linux 6.1.92. The network latency and ping response time is generally poor. I know it is a 100Mbps port however my really slow Raspberry Pi Model B can more than perform twice as well in respect to latency on its 100Mbps port.

The ping latency is of the order 1.0ms to 1Gbps servers on my 1Gbps LAN whereas the RPiB manages 0.3 to 0.35ms.

The RPiB has a configuration “smsc95xx.turbo_mode=0” to improve the latency but I’ve not seen any way of reducing latency on the Le Potato board.
It means the slow and ancient RPiB is a far better GPS Stratum 1 reference than the allegedly much more performant Libre Computer board.

Anyone with any ideas or configurations for improving this?

librecomputer · May 9, 2025, 2:31pm

From an AML-S805X-AC (same chip as AML-S905X-CC in a different package), to a gigabit device on the same switch. Never seen a generic kernel get 0.3ms.

debian-12-aml-s805x-ac:~$ ping debian-12-aml-a311d-cc -A -c 20
PING debian-12-aml-a311d-cc (192.168.12.173) 56(84) bytes of data.
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=1 ttl=64 time=0.740 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=2 ttl=64 time=0.687 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=3 ttl=64 time=0.679 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=4 ttl=64 time=0.671 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=5 ttl=64 time=0.674 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=6 ttl=64 time=0.486 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=7 ttl=64 time=0.695 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=8 ttl=64 time=0.512 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=9 ttl=64 time=0.677 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=10 ttl=64 time=0.686 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=11 ttl=64 time=0.683 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=12 ttl=64 time=0.664 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=13 ttl=64 time=0.414 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=14 ttl=64 time=0.678 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=15 ttl=64 time=0.667 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=16 ttl=64 time=0.665 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=17 ttl=64 time=0.678 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=18 ttl=64 time=0.682 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=19 ttl=64 time=0.678 ms
64 bytes from 192.168.12.173: icmp_seq=20 ttl=64 time=0.676 ms

--- debian-12-aml-a311d-cc ping statistics ---
20 packets transmitted, 20 received, 0% packet loss, time 49ms
rtt min/avg/max/mdev = 0.414/0.649/0.740/0.078 ms, ipg/ewma 2.590/0.661 ms

From gigabit to gigabit

debian-12-aml-s905d3-cc:~$ ping debian-12-aml-a311d-cc -A -c 20
PING debian-12-aml-a311d-cc.lan (192.168.12.173) 56(84) bytes of data.
64 bytes from debian-12-aml-a311d-cc.lan (192.168.12.173): icmp_seq=1 ttl=64 time=0.736 ms
64 bytes from debian-12-aml-a311d-cc.lan (192.168.12.173): icmp_seq=2 ttl=64 time=0.669 ms
64 bytes from debian-12-aml-a311d-cc.lan (192.168.12.173): icmp_seq=3 ttl=64 time=0.683 ms
64 bytes from debian-12-aml-a311d-cc.lan (192.168.12.173): icmp_seq=4 ttl=64 time=0.627 ms
64 bytes from debian-12-aml-a311d-cc.lan (192.168.12.173): icmp_seq=5 ttl=64 time=0.634 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=6 ttl=64 time=0.519 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=7 ttl=64 time=0.402 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=8 ttl=64 time=0.619 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=9 ttl=64 time=0.620 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=10 ttl=64 time=0.608 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=11 ttl=64 time=0.603 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=12 ttl=64 time=0.598 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=13 ttl=64 time=0.608 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=14 ttl=64 time=0.598 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=15 ttl=64 time=0.611 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=16 ttl=64 time=0.615 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=17 ttl=64 time=0.604 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=18 ttl=64 time=0.616 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=19 ttl=64 time=0.474 ms
64 bytes from debian-12-aml-a311d-cc (192.168.12.173): icmp_seq=20 ttl=64 time=0.611 ms

--- debian-12-aml-a311d-cc.lan ping statistics ---
20 packets transmitted, 20 received, 0% packet loss, time 88ms
rtt min/avg/max/mdev = 0.402/0.602/0.736/0.068 ms, ipg/ewma 4.655/0.600 ms

There’s a few important things that affect network latency and trade off throughput. CPU frequency, packet offloading, DMA, interrupt handling. But getting 0.75ms is typical. As you’ve stated, your gigabit only gets 1.0ms.

mmaney · May 13, 2025, 2:22am

There are a number of things that may affect ethernet latency. I have to disagree that 0.75 ms is normal, unless maybe the AML chips with stock config (in their Bookworm image) are that bad. Can’t recall when I last used one straight stock…

First, summarizing a few quick pings:

GHz to GHz (apu3s):
rtt min/avg/max/mdev = 0.169/0.189/0.219/0.019 ms

GHz to 100MHz (apu3 to Beagle Bone Black):
rtt min/avg/max/mdev = 0.328/0.385/0.572/0.063 ms

GHz to 100MHz (apu3 to frite):
rtt min/avg/max/mdev = 0.238/0.277/0.319/0.023 ms

These are all somewhat tweaked for NTP performance (so latency over throughput). For the frite network interface:
ethtool -C end0 rx-usecs 25 tx-usecs 1 tx-frames 0

Frite NIC/driver has some quirks. 25 is smallest rx delay setting it will accept - reports it set to 24 after that.

After setting frite's rx-usecs to 384, it did get worse:
rtt min/avg/max/mdev = 0.625/0.650/0.730/0.031 ms

And I seem to recall that the default setting was weirdly large, but 
I'm not going to reboot just to check the number. So I guess this
matters after all, at least for 805/905 chips.  tx-usecs didn't
change pings either incoming or outgoing...

EEE (ethernet energy efficiency) can also delay things. IIRC the frite doesn’t accept setting it to off, but the switches involved here do have it disabled. Oh yeah, frite rejects even --show-eee, oh well.

What has probably had the biggest effect on NTP performance [after reducing the coalesce delays as much as possible, see next posting for more] has been disabling the deeper idle state and, for the BBB and frite, choosing the performance (non-)scaling frequency manager. For frite:

#!/bin/sh

# disable low power states to improve interrupt response time (for chrony, mostly)
for c in 0 1 2 3
do
	echo 1 >/sys/bus/cpu/devices/cpu$c/cpuidle/state1/disable
done

# there is only one cpufreq governor to bind them, one to run them...
echo performance >/sys/bus/cpu/devices/cpu0/cpufreq/scaling_governor


exit 0

This doesn’t seem to change the power draw much if at all for the frite, at least when it’s lightly loaded.

With all this (and more that I’ve tried along the way), I find the frite (and the sweetp) make less good NTP servers than the old single cored BBB. The apu3 leaves both of them in the dust, but they’ve been EOL for a couple years now. Uhm, this is mostly in the context of using them with a GPS PPS source, and working on a local ethernet. Talking to anything out there in the Interwebs is doing great of it’s only a few msec off, of course.

Just for the record, I am not a timenut: no rubidium or cesium references in the building, nope.

mmaney · May 13, 2025, 5:09pm

Well, I had reason to reboot the frite this morning, and with this discussion fresh in my mind I even remembered to comment out the settings in the interfaces config file (traditional Debian manual network config). And the stock settings are… clearly not concerned with low latency:

# ethtool -c end0
Coalesce parameters for end0:
   ...
rx-usecs: 246
rx-frames: 0
  ...
tx-usecs: 1000
tx-frames: 25

Ping from GHz to frite, then vice-versa:
rtt min/avg/max/mdev = 0.478/0.502/0.550/0.022 ms
rtt min/avg/max/mdev = 0.475/0.494/0.546/0.019 ms

And NTP offsets go from ~25us to ~135us, as expected for an increase in receive delay of ~220us (calculated offset changes by half the added delay because it’s a long story). [oops, I was thinking of this paper, with error analysis in sections 6 and later.] And that’s why the outdated Beagle Bone Black makes a better NTP stratum 1 reference than the AML boards.

I’m willing to believe that similarly latency-hostile defaults on the a311d account for the other ~200ms of ping latency, and could probably be mitigated using ethtool as I described previously.

librecomputer · May 14, 2025, 9:57pm

Everything is a trade-off between stability, throughput, and latency. The default driver for the PHY and MAC sets the delay and data calibration to auto in the Linux driver.

github.com/libre-computer-project/libretech-linux

drivers/net/phy/meson-gxl.c

master

// SPDX-License-Identifier: GPL-2.0+
/*
 * Amlogic Meson GXL Internal PHY Driver
 *
 * Copyright (C) 2015 Amlogic, Inc. All rights reserved.
 * Copyright (C) 2016 BayLibre, SAS. All rights reserved.
 * Author: Neil Armstrong <narmstrong@baylibre.com>
 */
#include <linux/kernel.h>
#include <linux/module.h>
#include <linux/mii.h>
#include <linux/ethtool.h>
#include <linux/phy.h>
#include <linux/netdevice.h>
#include <linux/bitfield.h>
#include <linux/smscphy.h>

#define TSTCNTL		20
#define  TSTCNTL_READ		BIT(15)
#define  TSTCNTL_WRITE		BIT(14)

This file has been truncated. show original

github.com/libre-computer-project/libretech-linux

drivers/net/ethernet/stmicro/stmmac/dwmac-meson8b.c

master

// SPDX-License-Identifier: GPL-2.0-only
/*
 * Amlogic Meson8b, Meson8m2 and GXBB DWMAC glue layer
 *
 * Copyright (C) 2016 Martin Blumenstingl <martin.blumenstingl@googlemail.com>
 */

#include <linux/bitfield.h>
#include <linux/clk.h>
#include <linux/clk-provider.h>
#include <linux/device.h>
#include <linux/ethtool.h>
#include <linux/io.h>
#include <linux/ioport.h>
#include <linux/module.h>
#include <linux/of.h>
#include <linux/of_net.h>
#include <linux/mfd/syscon.h>
#include <linux/platform_device.h>
#include <linux/stmmac.h>

This file has been truncated. show original

These must be manually calibrated along with the interrupt handling by the CPU/Linux/irq/AHB priority subsystems. The comparison with a single core BeagleBone is not apt because multi-core systems running modern Linux are inherently more complex than low complexity devices running simpler version of Linux. Modern devices are optimized for throughput, not latency.

You must tune for your application and cannot expect the default configuration to fit all of your needs. This is why we provide all of the code in open-source format.

librecomputer · May 16, 2025, 12:26am

You can also use this datasheet for ideas as the IP is very similiar for the most part. https://ww1.microchip.com/downloads/aemDocuments/documents/OTH/ProductDocuments/DataSheets/LAN83C185-Data-Sheet-DS00002808A.pdf

mmaney · May 16, 2025, 1:48am

And you wonder why I’m less than impressed with your documentation? A datasheet for IP that is very similiar for the most part… and it’s for the PHY, which is the wrong part of the network hardware for the coalescence issue.

Everything is a trade-off between stability, throughput, and latency is like an AI’s description of, well, most anything - technically correct but hardly relevant in context. Oh, for the record, the BBB is running Debian Bookworm, not some archaic simpler version of Linux.

Anyway, I’m satisfied that your Spuds are simply not the answer to my NTP plans (1), so I’m certainly not going to try to fix your device’s driver code. If it is just the code, for some reason, limiting the range of rx-coalescence setting for no good reason, rather than a baked-in hardware problem. That shouldn’t interfere with using it as a networked print server, for example.

Let me be clear: the default setting, well, you have your opinion of it, but THAT WAS NEVER WHAT I USED EXCEPT FOR THE TESTING ABOVE. I don’t expect general purpose systems to be properly setup out of the box - they never are.

(1) You can make a pretty decent local NTP server out of any of these little Spuds and a GPS with PPS. Certainly better than the FC-NTP-MINI that someone in China has been churning out for years now. But if there’s another good time source that doesn’t have the 25 (or 135 by default) usec offset, they can work together to confuse the NTP clients - that was what first led me to recognize the builtin offset error in these boards.

librecomputer · May 16, 2025, 2:10am

We don’t make the SoCs. You’re barking up the wrong tree.

mmaney · May 25, 2025, 10:59pm

But this is the tree that designed them, no?

Woof!

librecomputer · June 2, 2025, 7:28pm

We do not design the SoCs.

mmaney · June 5, 2025, 2:37am

Sorry, that was intended as humor, but that gangs aft aglay, as the poet put it.

And no, LC doesn’t design the SOC. But you do choose it, and design the SBC around it - don’t you? - and so I would think you’d have better documentation for it than specs for a sort of similar subsystem in another chip. Though maybe not, I have found some “datasheets” for AML chips that are rather lacking…

librecomputer · June 6, 2025, 5:46pm

When you assume, you make an ass out of you and me. The datasheets are public. Two minutes of Google will show you what everyone is dealing with.

mmaney · June 9, 2025, 7:31pm

Oh, I’ve seen the inadequate info that was released to Hardkernel or Wesion. I guess that might suffice to adapt a reference SBC design if you weren’t making any significant changes, but it’s so lacking in details as to be something of a joke, at least compared to TI’s documentation of its Sitara SOCs.

Heh - probably took TI longer to produce those docs than Amlogic took spinning one of their variant SOCs up.

librecomputer · June 9, 2025, 10:34pm

You cannot compare TI with Amlogic. TI cost several times more per SoC.

mmaney · June 15, 2025, 4:35pm

Sorry, didn’t mean to melt your snowflake.

librecomputer · June 23, 2025, 5:13am

Here’s a relatively standard run of ping and the latency breakdown of each function:
It took about 0.06ms to send the packet from the MAC and 0.1ms to from the interrupt trigger to processing the packet. The rest of the latency is in the PHY/switch/PHY. You can save maybe 0.15ms by disabling all of the kernel modules for network packet queuing and optimize the interrupt handling.

++ ping -Dc 1 192.168.12.1
PING 192.168.12.1 (192.168.12.1) 56(84) bytes of data.
[1750653227.574674] 64 bytes from 192.168.12.1: icmp_seq=1 ttl=64 time=0.970 ms

--- 192.168.12.1 ping statistics ---
1 packets transmitted, 1 received, 0% packet loss, time 0ms
rtt min/avg/max/mdev = 0.970/0.970/0.970/0.000 ms
+ cmd_ret=0
+ '[' '!' -z '' ']'
+ set +x
# tracer: function_graph
#
#     TIME        CPU  TASK/PID         DURATION                  FUNCTION CALLS
#      |          |     |    |           |   |                     |   |   |   |
19568.664713 |   2)   ping-2739    |               |  ip_route_output_key_hash() {
19568.664718 |   2)   ping-2739    | + 10.792 us   |    ip_route_output_key_hash_rcu();
19568.664730 |   2)   ping-2739    | + 24.000 us   |  }
19568.664733 |   2)   ping-2739    |               |  ip_route_output_flow() {
19568.664734 |   2)   ping-2739    |   2.250 us    |    ip_route_output_key_hash_rcu();
19568.664738 |   2)   ping-2739    |   6.167 us    |  }
19568.664754 |   2)   ping-2739    |   2.958 us    |  ip_mc_drop_socket();
19568.664772 |   2)   ping-2739    |   3.167 us    |  ip_setsockopt();
19568.664868 |   2)   ping-2739    |   2.875 us    |  net_enable_timestamp();
19568.664947 |   2)   ping-2739    |               |  ip_route_output_flow() {
19568.664948 |   2)   ping-2739    |   4.084 us    |    ip_route_output_key_hash_rcu();
19568.664953 |   2)   ping-2739    |   7.667 us    |  }
19568.664955 |   2)   ping-2739    |               |  ip_append_data() {
19568.664957 |   2)   ping-2739    |   2.333 us    |    ip_setup_cork();
19568.664968 |   2)   ping-2739    |   2.416 us    |    ip_generic_getfrag();
19568.664972 |   2)   ping-2739    | + 17.333 us   |  }
19568.664974 |   2)   ping-2739    |               |  ip_push_pending_frames() {
19568.664980 |   2)   ping-2739    |   1.416 us    |    icmp_out_count();
19568.664982 |   2)   ping-2739    |               |    ip_send_skb() {
19568.664984 |   2)   ping-2739    |   1.666 us    |      ip_send_check();
19568.664987 |   2)   ping-2739    |               |      ip_output() {
19568.664988 |   2)   ping-2739    |               |        ip_finish_output() {
19568.664991 |   2)   ping-2739    |               |          ip_finish_output2() {
19568.665002 |   2)   ping-2739    |               |            stmmac_xmit() {
19568.665005 |   2)   ping-2739    |   3.666 us    |              stmmac_disable_eee_mode();
19568.665016 |   2)   ping-2739    |   1.375 us    |              stmmac_flush_tx_descriptors();
19568.665018 |   2)   ping-2739    |   3.000 us    |              stmmac_tx_timer_arm();
19568.665022 |   2)   ping-2739    | + 20.375 us   |            }
19568.665023 |   2)   ping-2739    | + 32.542 us   |          }
19568.665024 |   2)   ping-2739    | + 36.333 us   |        }
19568.665025 |   2)   ping-2739    | + 38.708 us   |      }
19568.665026 |   2)   ping-2739    | + 43.916 us   |    }
19568.665026 |   2)   ping-2739    | + 53.458 us   |  }
19568.665042 |   0)    <idle>-0    |               |  stmmac_interrupt() {
19568.665045 |   0)    <idle>-0    |   2.166 us    |    stmmac_safety_feat_interrupt();
19568.665047 |   0)    <idle>-0    |   4.167 us    |    stmmac_common_interrupt();
19568.665053 |   0)    <idle>-0    |   2.042 us    |    stmmac_napi_check();
19568.665055 |   0)    <idle>-0    | + 15.084 us   |  }
19568.665117 |   0)    <idle>-0    |               |  stmmac_interrupt() {
19568.665118 |   0)    <idle>-0    |   0.875 us    |    stmmac_safety_feat_interrupt();
19568.665120 |   0)    <idle>-0    |   1.458 us    |    stmmac_common_interrupt();
19568.665122 |   0)    <idle>-0    |   0.959 us    |    stmmac_napi_check();
19568.665123 |   0)    <idle>-0    |   6.417 us    |  }
19568.665604 |   0)    <idle>-0    |               |  stmmac_interrupt() {
19568.665604 |   0)    <idle>-0    |   0.791 us    |    stmmac_safety_feat_interrupt();
19568.665606 |   0)    <idle>-0    |   1.375 us    |    stmmac_common_interrupt();
19568.665608 |   0)    <idle>-0    |   0.959 us    |    stmmac_napi_check();
19568.665609 |   0)    <idle>-0    |   5.792 us    |  }
19568.665682 |   0)    <idle>-0    |               |  stmmac_interrupt() {
19568.665683 |   0)    <idle>-0    |   1.208 us    |    stmmac_safety_feat_interrupt();
19568.665685 |   0)    <idle>-0    |   1.709 us    |    stmmac_common_interrupt();
19568.665688 |   0)    <idle>-0    |   1.042 us    |    stmmac_napi_check();
19568.665689 |   0)    <idle>-0    |   8.042 us    |  }
19568.665869 |   0)    <idle>-0    |               |  stmmac_interrupt() {
19568.665870 |   0)    <idle>-0    |   0.750 us    |    stmmac_safety_feat_interrupt();
19568.665871 |   0)    <idle>-0    |   1.250 us    |    stmmac_common_interrupt();
19568.665873 |   0)    <idle>-0    |   2.875 us    |    stmmac_napi_check();
19568.665876 |   0)    <idle>-0    |   7.709 us    |  }
19568.665879 |   0)    <idle>-0    |               |  net_rx_action() {
19568.665881 |   0)    <idle>-0    |               |    stmmac_napi_poll_rx() {
19568.665884 |   0)    <idle>-0    |   1.917 us    |      stmmac_rx_buf1_len();
19568.665893 |   0)    <idle>-0    |   1.375 us    |      stmmac_get_rx_hwtstamp();
19568.665895 |   0)    <idle>-0    |   1.167 us    |      stmmac_rx_vlan();
19568.665898 |   0)    <idle>-0    |   1.708 us    |      stmmac_has_ip_ethertype();
19568.665904 |   0)    <idle>-0    |   1.333 us    |      stmmac_finalize_xdp_rx();
19568.665914 |   0)    <idle>-0    |               |      ip_list_rcv() {
19568.665916 |   0)    <idle>-0    |   2.167 us    |        ip_rcv_core();
19568.665919 |   0)    <idle>-0    |               |        ip_sublist_rcv() {
19568.665921 |   0)    <idle>-0    |               |          ip_rcv_finish_core.constprop.0() {
19568.665923 |   0)    <idle>-0    |               |            ip_route_input_noref() {
19568.665924 |   0)    <idle>-0    |   6.833 us    |              ip_route_input_slow();
19568.665931 |   0)    <idle>-0    |   9.125 us    |            }
19568.665932 |   0)    <idle>-0    | + 11.958 us   |          }
19568.665934 |   0)    <idle>-0    |               |          ip_sublist_rcv_finish() {
19568.665936 |   0)    <idle>-0    |               |            ip_local_deliver() {
19568.665937 |   0)    <idle>-0    |               |              ip_local_deliver_finish() {
19568.665939 |   0)    <idle>-0    |               |                ip_protocol_deliver_rcu() {
19568.665942 |   0)    <idle>-0    |   1.417 us    |                  ip_mc_sf_allow();
19568.665954 |   0)    <idle>-0    |   7.000 us    |                  icmp_rcv();
19568.665962 |   0)    <idle>-0    | + 23.083 us   |                }
19568.665962 |   0)    <idle>-0    | + 25.625 us   |              }
19568.665963 |   0)    <idle>-0    | + 27.625 us   |            }
19568.665964 |   0)    <idle>-0    | + 30.417 us   |          }
19568.665964 |   0)    <idle>-0    | + 45.584 us   |        }
19568.665965 |   0)    <idle>-0    | + 51.542 us   |      }
19568.665966 |   0)    <idle>-0    | + 86.125 us   |    }
19568.665967 |   0)    <idle>-0    | + 89.292 us   |  }
19568.666524 |   2)   ping-2739    |   6.042 us    |  ip_mc_drop_socket();
19568.666551 |   2)   ping-2739    |   1.500 us    |  ip_mc_drop_socket();
19568.666555 |   2)   ping-2739    |   2.291 us    |  ip_ra_control();
19568.666558 |   2)   ping-2739    |   1.917 us    |  ip_flush_pending_frames();

Latencies are lower on a hub or point to point link as they don’t have to do store and forward and simply repeat the signal. TI has their own line of MAC and PHY IP optimized for latency so I’m sure they put a lot of work into the BBB. BBB cost 4 times as much as La Frite with one core so apple to oranges comparison.

mmaney · June 27, 2025, 11:22pm

Interesting amount of detail, though largely irrelevant for NTP. Front end delays are expected and mitigated by a timely timestamping of the packet. Done in software on the AML, the hardware timestamping of the i210 NICs is a good part of the apu’s advantage there.

Latencies are lower on a hub or point to point link as they don’t have to do store and forward and simply repeat the signal.

Was this very dry humor? I haven’t had a hub around since the old thinnet was replaced a few decades ago. And good switches don’t have to store and forward when both ports run at the same speed, granted not the case in all (?) the discussion above, being 100M AML connecting to a 1G network.

But I guess I could configure one of the ports on an apu to connect directly to sweetp or frite and play with that, if I were terminally bored.