Re: [Intel-wired-lan] [PATCH iwl-next v4] igb: Retrieve Tx timestamp from BH workqueue

From: Miroslav Lichvar

Date: Mon Mar 09 2026 - 12:08:47 EST


On Mon, Mar 09, 2026 at 01:37:54AM -0700, Jacob Keller wrote:
> On 3/4/2026 1:35 AM, Miroslav Lichvar wrote:
> > With the patch:
> > 150000 15000 5.11% 0.00% 0.00% 94.88% +4426 +460642 +640884 83746
> > 157500 15750 11.54% 0.00% 0.26% 88.20% +14434 +543656 +738355 30349
> > 165375 16384 15.61% 0.00% 0.31% 84.08% +35822 +515304 +833859 25596
> > 173643 16384 19.58% 0.00% 0.37% 80.05% +20762 +568962 +900100 28118
> > 182325 16384 23.46% 0.00% 0.42% 76.13% +41829 +547974 +804170 27890
> > 191441 16384 27.23% 0.00% 0.46% 72.31% +15182 +557920 +798212 28868
> > 201013 16384 30.51% 0.00% 0.49% 69.00% +15980 +560764 +805576 29979
> > 211063 16384 0.06% 0.00% 0.00% 99.94% +12668 +80487 +410555 62182
> > 221616 16384 2.94% 0.00% 0.05% 97.00% +21587 +342769 +517566 23359
> > 232696 16384 6.94% 0.00% 0.10% 92.96% +16581 +336068 +484574 18453
> > 244330 16384 11.45% 0.00% 0.14% 88.41% +23608 +345023 +564130 19177
> >
>
>
> With the fix, we have a higher lost percentage, which sounds bad to
> me..? And we have a higher response time (which also sounds bad??) and
> we have a much worse standard deviation across all the values from low
> to high rate.
>
> I guess I just don't understand what these numbers mean and why its
> "better" with the patch. Perhaps its the naming? Or perhaps "xleave" is
> bad, and this is showing that with the patch we get less of that? But
> that looks like it gets consistently lower as the rate and number of
> clients goes up.

xleave is 100% when the server responds to all requests and all
responses are in the expected (interleaved) mode. That is the ideal
result.

The patch doesn't seem to change the absolute maximum rate of
responses, unlike with the previous versions, but at some lower rates
up to 30% responses are missing, apparently due to the kernel being
able to fetch HW timestamps from the NIC at a higher rate, i.e. the
driver is trading network traffic for HW timestamps.

> > At 211063 requests per second and higher the performance looks the
> > same. But at the lower rates there is a clear drop. The higher
> > mean response time (difference between server TX and RX timestamps)
> > indicates more of the provided TX timestamps are hardware timestamps
> > and the chrony server timestamp statistics confirm that.
> >
>
>
> So you're saying a higher mean response time is.. better? What is it
> really measuring then? Oh. I see. it has a higher response time because
> it takes longer to get a Tx timestamp, but the provided timestamp is
> higher quality. While previously it was using software timestamps so it
> could reply faster (since it takes less time to get the software
> timestamp back out) but the quality is lower?
>
> Ok. That makes a bit more sense...

A transmit HW timestamp is taken later than SW timestamp, so there
is a larger difference between the TX and RX timestamps. ntpperf can
also measure the accuracy of the TX timestamp directly, but it's a
more difficult test to set up as the HW clocks need be synchronized to
each other.

--
Miroslav Lichvar