Re: perf tools:Is there any tools to found out the max latency by irq or cpu idle

From: Paul Clarke
Date: Mon Apr 15 2019 - 08:53:09 EST


On 4/13/19 1:01 AM, Linhaifeng wrote:
> Sorry, the value 131081408 is just for example. Actually the result is like this:
> sqrt 2019-04-10 23:53:50: 43968
> sqrt 2019-04-10 23:53:51: 44060
> sqrt 2019-04-10 23:53:52: 49012
> sqrt 2019-04-10 23:53:53: 38172
> sqrt 2019-04-10 23:53:54: 131081408
> sqrt 2019-04-10 23:53:55: 43600
> sqrt 2019-04-10 23:53:56: 46704
> sqrt 2019-04-10 23:53:57: 46880
> sqrt 2019-04-10 23:53:58: 44332
> ââ
> sqrt 2019-04-10 02:17:15: 136345876
> ââ
> sqrt 2019-04-10 04:40:35: 136335644
> ââ

Given the periodicity, I'm wondering if it could be System Management Interrupts (SMIs)?

If so, some BIOSes have settings to mitigate, otherwise, there's not much that can be done, per my understanding.

PC

> -----Original Message-----
> From: William Cohen [mailto:wcohen@xxxxxxxxxx]
> Sent: 2019å4æ12æ 22:06
> To: Linhaifeng <haifeng.lin@xxxxxxxxxx>; linux-perf-users@xxxxxxxxxxxxxxx; linux-kernel@xxxxxxxxxxxxxxx
> Subject: Re: perf tools:Is there any tools to found out the max latency by irq or cpu idle
>
> On 4/11/19 8:57 PM, Linhaifeng wrote:
>> Hi,
>> I have a single thread application like this:
>>
>> While (1) {
>> ÂÂÂ start = rdtsc();
>> ÂÂÂ sqrt (1024);ÂÂÂÂÂ
>> ããend = rdtsc();
>> ããcycles = end â start;
>> ããprintf("cycles: %d-%02d-%02d %02d:%02d:%02d: %lu\n",
>> ãããããã1900+timeinfo->tm_year, 1+timeinfo->tm_mon, timeinfo->tm_mday, timeinfo->tm_hour, timeinfo->tm_min, timeinfo->tm_sec,
>> ããããããcycles);
>> }
>> It print the cycles of sqrt every second and run with taskset âc 1 ./sqrt.
>> The result of test is:
>>
>> sqrt 2019-04-10 23:53:50: 43968
>> sqrt 2019-04-10 23:53:51: 44060
>> sqrt 2019-04-10 23:53:52: 49012
>> sqrt 2019-04-10 23:53:53: 38172
>> sqrt 2019-04-10 23:53:54: 131081408
>> sqrt 2019-04-10 23:53:55: 43600
>> sqrt 2019-04-10 23:53:56: 46704
>> sqrt 2019-04-10 23:53:57: 46880
>> sqrt 2019-04-10 23:53:58: 44332
>> ââ
>> sqrt 2019-04-10 02:17:15: 131081408
>> ââ
>> sqrt 2019-04-10 04:40:35: 131081408
>> ââ
>>
>> Every 2hour23min there would be a large cycles. I use perf sched not found any sched_switch events.
>
> Hi,
>
> The fact that it is the same value 131081408 every 2 hours 23 minutes looks suspect. One would expect some variation in the counts. It looks like there is some rollover or overflow issue. It would be helpful to print out the start and end values to see what is going on with the raw rdstc values. Maybe the thread is being moved between processors and the TSC are out of sync. What particular processor model was this running on? Was this running on physical hardware or inside a kvm guest?
>
> According to the Intel 64 and IA-32 Architecture Software Devloper's Manual Volume 3 (325384-sdm-vol-3abcd.pdf):
>
> The RDTSC instruction reads the time-stamp counter and is guaranteed to return a monotonically increasing unique value whenever executed, except for a 64-bit counter wraparound. Intel guarantees that the time-stamp counter will not wraparound within 10 years after being reset. The period for counter wrap is longer for Pentium 4, Intel Xeon, P6 family, and Pentium processors.
>
> -Will Cohen
>
>
>>
>> L2GW_2680:/home/fsp/zn # perf sched record -C 6-11 -o perf.sched ^C[
>> perf record: Woken up 64 times to write data ] [ perf record: Captured
>> and wrote 204.878 MB perf.sched (1911189 samples) ]
>>
>> L2GW_2680:/home/fsp/zn # perf sched latency -i perf.sched
>>
>> ----------------------------------------------------------------------
>> -------------------------------------------
>>  Task | Runtime ms | Switches | Average delay ms
>> | Maximum delay ms | Maximum delay atÂÂÂÂÂÂ |
>> ----------------------------------------------------------------------
>> -------------------------------------------
>> ----------------------------------------------------------------------
>> -------------------------------------------
>> Â TOTAL:ÂÂÂÂÂÂÂÂÂÂÂÂÂÂÂ |ÂÂÂÂÂ 0.000 ms | ÂÂÂÂÂÂÂ0 |
>> ---------------------------------------------------
>>
>>
>>
>> Is there any other tools of perf to found out the max latency by irq or cpu idle ?
>>
>>
>