From: Pete Heist <pete@heistp.net>
To: Cake List <cake@lists.bufferbloat.net>
Subject: [Cake] lockup with multiple cake instances on 3.16.7
Date: Wed, 30 Jan 2019 09:58:50 +0100 [thread overview]
Message-ID: <15FB76CC-44B2-496B-80EC-8D00AD2AF9B7@heistp.net> (raw)
[-- Attachment #1: Type: text/plain, Size: 1097 bytes --]
I’m seeing a lockup with cake on 3.16.7 with 64-bit Intel when adding more than 4 cake instances. I’m using cake and tc-adv from head. Does anyone have an idea what this could be? It can be reproduced as follows:
-----
# Either htb of hfsc can be used, but it doesn’t happen with fq_codel or sfq.
# Also, when magic number “5” below is changed to “4”, no lockup occurs.
tc qdisc add dev eth0 root handle 1: htb default 1
for i in {1..5}; do
tc class add dev eth0 parent 1: classid 1:$i htb rate 100mbit ceil 100mbit
tc qdisc add dev eth0 parent 1:$i cake
done
# lockup occurs on execution of this command
tc qdisc show dev eth0
-----
When the lockup occurs, all network activity stops and the serial console freezes. Eventually the machine either reboots itself or needs to be power cycled. I caught the attached output to the console. I do not see it on kernel 3.10.107 with mipsle, or any other later kernels that I have, which are all 4.x and later.
Otherwise, it looks like we’ll have to upgrade these older kernels at FreeNet to use cake… :)
[-- Attachment #2: cake_lockup.txt --]
[-- Type: text/plain, Size: 10857 bytes --]
[192765.830059] INFO: rcu_sched detected stalls on CPUs/tasks: {} (detected by 0, t=60002 )
[192765.841407] INFO: Stall ended before state dump start
[192825.841319] INFO: rcu_sched self-detected stall on CPU[192825.842328] INFO: rcu_sched d)
[192825.842345] sending NMI to all CPUs:
[192825.842352] NMI backtrace for cpu 0
[192825.842363] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G O 3.16.7-ckt9-voyage #1
[192825.842366] Hardware name: PC Engines APU/APU, BIOS 4.0 09/08/2014
[192825.842372] task: c1433878 ti: c1426000 task.ti: c1426000
[192825.842378] EIP: 0060:[<c1020035>] EFLAGS: 00200006 CPU: 0
[192825.842396] EIP is at arch_trigger_all_cpu_backtrace+0x66/0x95
[192825.842400] EAX: 00000800 EBX: c143e940 ECX: c13b6e5a EDX: 00000800
[192825.842404] ESI: f616f72c EDI: 000013bb EBP: c1427e4c ESP: c1427e44
[192825.842409] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
[192825.842413] CR0: 8005003b CR2: b7764750 CR3: 3544f000 CR4: 00000790
[192825.842414] Stack:
[192825.842417] c13d0cec c143e940 c1427e74 c1064a26 00000001 00200083 c143e940 00000000
[192825.842430] 00000001 c1433878 00000000 00000000 c1427e88 c103203b f616f8e0 f326674a
[192825.842444] 0000af63 c1427e90 c106c2b4 c1427ea8 c106c4a8 c1427f28 f616f8e0 f616f514
[192825.842457] Call Trace:
[192825.842462] [<c1064a26>] rcu_check_callbacks+0x373/0x41c
[192825.842478] [<c103203b>] update_process_times+0x2d/0x50
[192825.842490] [<c106c2b4>] tick_sched_handle+0x30/0x32
[192825.842503] [<c106c4a8>] tick_sched_timer+0x31/0x55
[192825.842511] [<c103fcca>] __run_hrtimer.isra.24+0x3e/0x94
[192825.842524] [<c1040417>] hrtimer_interrupt+0xeb/0x1d9
[192825.842534] [<c101e781>] local_apic_timer_interrupt+0x41/0x46
[192825.842543] [<c101ea9c>] smp_apic_timer_interrupt+0x2e/0x3d
[192825.842552] [<c131a6dd>] apic_timer_interrupt+0x2d/0x40
[192825.842564] [<c1202441>] ? acpi_safe_halt+0x17/0x1a
[192825.842575] [<c120297f>] acpi_idle_enter_c1+0x4e/0x74
[192825.842583] [<c126adc6>] cpuidle_enter_state+0x2c/0x96
[192825.842595] [<c126aee8>] cpuidle_enter+0xf/0x12
[192825.842603] [<c1051e54>] cpu_startup_entry+0x135/0x1e1
[192825.842616] [<c1313161>] rest_init+0x5d/0x5f
[192825.842625] [<c1462a40>] start_kernel+0x359/0x360
[192825.842635] [<c14622af>] i386_start_kernel+0x79/0x7d
[192825.842644] Code: ba 54 6e 3b c1 74 05 ba ec 0c 3d c1 52 68 5a 6e 3b c1 e8 98 5a 2f 00 d
[192825.842742] NMI backtrace for cpu 1
[192825.842751] CPU: 1 PID: 25469 Comm: tc Tainted: G O 3.16.7-ckt9-voyage #1
[192825.842753] Hardware name: PC Engines APU/APU, BIOS 4.0 09/08/2014
[192825.842757] task: f5e20470 ti: f3ea6000 task.ti: f3ea6000
[192825.842762] EIP: 0060:[<c1221236>] EFLAGS: 00000002 CPU: 1
[192825.842773] EIP is at io_serial_in+0xd/0x12
[192825.842777] EAX: c14f5100 EBX: c14f51ec ECX: 00000000 EDX: 000003fd
[192825.842780] ESI: 0000270b EDI: 00000020 EBP: f3ea7a44 ESP: f3ea7a44
[192825.842784] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
[192825.842787] CR0: 80050033 CR2: 080c6b74 CR3: 35d89000 CR4: 00000790
[192825.842788] Stack:
[192825.842790] f3ea7a58 c1221650 c14f51ec 00000031 c12216a6 f3ea7a68 c12216b9 c14e25a1
[192825.842802] c14f51ec f3ea7a80 c121e838 c14e25ce c14f51ec 00000001 c122123b f3ea7aa8
[192825.842812] c12224c6 c12216a6 0000003a c14e2594 00000005 00000046 c144dec4 c1222422
[192825.842823] Call Trace:
[192825.842827] [<c1221650>] wait_for_xmitr+0x19/0x6f
[192825.842836] [<c12216a6>] ? wait_for_xmitr+0x6f/0x6f
[192825.842843] [<c12216b9>] serial8250_console_putchar+0x13/0x20
[192825.842851] [<c121e838>] uart_console_write+0x2b/0x36
[192825.842857] [<c122123b>] ? io_serial_in+0x12/0x12
[192825.842863] [<c12224c6>] serial8250_console_write+0xa4/0xe4
[192825.842870] [<c12216a6>] ? wait_for_xmitr+0x6f/0x6f
[192825.842876] [<c1222422>] ? serial8250_modem_status+0x9c/0x9c
[192825.842883] [<c105bb11>] call_console_drivers.constprop.24+0x63/0x6d
[192825.842892] [<c105c769>] console_unlock+0x13a/0x371
[192825.842900] [<c105ce55>] vprintk_emit+0x348/0x354
[192825.842907] [<c1315ad0>] printk+0x17/0x19
[192825.842915] [<c10647ad>] rcu_check_callbacks+0xfa/0x41c
[192825.842924] [<c103203b>] update_process_times+0x2d/0x50
[192825.842931] [<c106c2b4>] tick_sched_handle+0x30/0x32
[192825.842939] [<c106c4a8>] tick_sched_timer+0x31/0x55
[192825.842946] [<c103fcca>] __run_hrtimer.isra.24+0x3e/0x94
[192825.842954] [<c1040417>] hrtimer_interrupt+0xeb/0x1d9
[192825.842962] [<c101e781>] local_apic_timer_interrupt+0x41/0x46
[192825.842970] [<c101ea9c>] smp_apic_timer_interrupt+0x2e/0x3d
[192825.842977] [<c131a6dd>] apic_timer_interrupt+0x2d/0x40
[192825.842985] [<c1319ab5>] ? _raw_spin_lock_bh+0x27/0x2b
[192825.842991] [<c128519e>] gnet_stats_start_copy_compat+0x27/0x8d
[192825.842999] [<c129fd72>] tc_fill_qdisc+0x195/0x225
[192825.843007] [<c129fe91>] tc_dump_qdisc_root+0x8f/0xc4
[192825.843014] [<c129ff45>] tc_dump_qdisc+0x7f/0xd6
[192825.843020] [<c12a51bb>] netlink_dump+0xb4/0x201
[192825.843027] [<c12a5488>] netlink_recvmsg+0x180/0x28b
[192825.843035] [<c1279652>] sock_recvmsg+0x59/0x72
[192825.843042] [<c11a5b1a>] ? _copy_from_user+0x29/0x3f
[192825.843049] [<c128334b>] ? verify_iovec+0x4b/0x80
[192825.843055] [<c12795f9>] ? sock_sendmsg_nosec+0x6a/0x6a
[192825.843060] [<c1279ee6>] ___sys_recvmsg.part.27+0xd1/0x160
[192825.843067] [<c12795f9>] ? sock_sendmsg_nosec+0x6a/0x6a
[192825.843073] [<c119c781>] ? cpumask_any_but+0x21/0x30
[192825.843080] [<c108a655>] ? lru_cache_add+0x8/0xa
[192825.843088] [<c109f2fd>] ? page_add_new_anon_rmap+0x77/0xb5
[192825.843096] [<c109a143>] ? handle_mm_fault+0x2ba/0x674
[192825.843103] [<c11a5b1a>] ? _copy_from_user+0x29/0x3f
[192825.843109] [<c127b2d0>] __sys_recvmsg+0x3b/0x59
[192825.843116] [<c127b68e>] SyS_socketcall+0x186/0x1b2
[192825.843123] [<c1002911>] ? smp_threshold_interrupt+0x5/0x5
[192825.843130] [<c1024ee8>] ? vmalloc_sync_all+0x106/0x106
[192825.843138] [<c1319f5e>] syscall_call+0x7/0x7
[192825.843145] Code: 0f b6 48 35 d3 e2 03 50 08 89 1a 5b 5d c3 55 0f b6 48 35 89 e5 d3 e2 3
[192826.399601] { 1} (t=60561 jiffies g=1443066 c=1443065 q=5051)
[192826.405893] sending NMI to all CPUs:
[192826.409560] NMI backtrace for cpu 1
[192826.413149] CPU: 1 PID: 25469 Comm: tc Tainted: G O 3.16.7-ckt9-voyage #1
[192826.420981] Hardware name: PC Engines APU/APU, BIOS 4.0 09/08/2014
[192826.427247] task: f5e20470 ti: f3ea6000 task.ti: f3ea6000
[192826.432732] EIP: 0060:[<c102003b>] EFLAGS: 00000006 CPU: 1
[192826.438309] EIP is at arch_trigger_all_cpu_backtrace+0x6c/0x95
[192826.444234] EAX: 00000800 EBX: 00002710 ECX: c13b6e5a EDX: 00000800
[192826.450585] ESI: f617b72c EDI: 000013bb EBP: f3ea7b28 ESP: f3ea7b24
[192826.456940] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
[192826.462435] CR0: 80050033 CR2: 080c6b74 CR3: 35d89000 CR4: 00000790
[192826.468794] Stack:
[192826.470899] c143e940 f3ea7b64 c1064832 c13bf23b 0000ec91 001604fa 001604f9 000013bb
[192826.478933] 00000001 f5e20470 c143e940 00000001 0b7911e7 f5e20470 00000000 00000001
[192826.486966] f3ea7b78 c103203b f617b8e0 f31720a2 0000af63 f3ea7b80 c106c2b4 f3ea7b98
[192826.495003] Call Trace:
[192826.497549] [<c1064832>] rcu_check_callbacks+0x17f/0x41c
[192826.503041] [<c103203b>] update_process_times+0x2d/0x50
[192826.508450] [<c106c2b4>] tick_sched_handle+0x30/0x32
[192826.513597] [<c106c4a8>] tick_sched_timer+0x31/0x55
[192826.518659] [<c103fcca>] __run_hrtimer.isra.24+0x3e/0x94
[192826.524152] [<c1040417>] hrtimer_interrupt+0xeb/0x1d9
[192826.529388] [<c101e781>] local_apic_timer_interrupt+0x41/0x46
[192826.535315] [<c101ea9c>] smp_apic_timer_interrupt+0x2e/0x3d
[192826.541070] [<c131a6dd>] apic_timer_interrupt+0x2d/0x40
[192826.546478] [<c1319ab5>] ? _raw_spin_lock_bh+0x27/0x2b
[192826.551800] [<c128519e>] gnet_stats_start_copy_compat+0x27/0x8d
[192826.557902] [<c129fd72>] tc_fill_qdisc+0x195/0x225
[192826.562875] [<c129fe91>] tc_dump_qdisc_root+0x8f/0xc4
[192826.568109] [<c129ff45>] tc_dump_qdisc+0x7f/0xd6
[192826.572912] [<c12a51bb>] netlink_dump+0xb4/0x201
[192826.577713] [<c12a5488>] netlink_recvmsg+0x180/0x28b
[192826.582862] [<c1279652>] sock_recvmsg+0x59/0x72
[192826.587576] [<c11a5b1a>] ? _copy_from_user+0x29/0x3f
[192826.592721] [<c128334b>] ? verify_iovec+0x4b/0x80
[192826.597610] [<c12795f9>] ? sock_sendmsg_nosec+0x6a/0x6a
[192826.603020] [<c1279ee6>] ___sys_recvmsg.part.27+0xd1/0x160
[192826.608686] [<c12795f9>] ? sock_sendmsg_nosec+0x6a/0x6a
[192826.614096] [<c119c781>] ? cpumask_any_but+0x21/0x30
[192826.619247] [<c108a655>] ? lru_cache_add+0x8/0xa
[192826.624046] [<c109f2fd>] ? page_add_new_anon_rmap+0x77/0xb5
[192826.629800] [<c109a143>] ? handle_mm_fault+0x2ba/0x674
[192826.635121] [<c11a5b1a>] ? _copy_from_user+0x29/0x3f
[192826.640270] [<c127b2d0>] __sys_recvmsg+0x3b/0x59
[192826.645069] [<c127b68e>] SyS_socketcall+0x186/0x1b2
[192826.650130] [<c1002911>] ? smp_threshold_interrupt+0x5/0x5
[192826.655801] [<c1024ee8>] ? vmalloc_sync_all+0x106/0x106
[192826.661208] [<c1319f5e>] syscall_call+0x7/0x7
[192826.665743] Code: 05 ba ec 0c 3d c1 52 68 5a 6e 3b c1 e8 98 5a 2f 00 8b 0d dc b4 43 c1 8
[192826.686274] NMI backtrace for cpu 0
[192826.689875] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G O 3.16.7-ckt9-voyage #1
[192826.697969] Hardware name: PC Engines APU/APU, BIOS 4.0 09/08/2014
[192826.704246] task: c1433878 ti: c1426000 task.ti: c1426000
[192826.709741] EIP: 0060:[<c1202441>] EFLAGS: 00200246 CPU: 0
[192826.715328] EIP is at acpi_safe_halt+0x17/0x1a
[192826.719869] EAX: 00000000 EBX: f5cec464 ECX: 00000001 EDX: 00000001
[192826.726232] ESI: 00000001 EDI: f5cec400 EBP: c1427f64 ESP: c1427f64
[192826.732592] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
[192826.738087] CR0: 8005003b CR2: b7764750 CR3: 35007000 CR4: 00000790
[192826.744445] Stack:
[192826.746553] c1427f78 c120297f c1449560 f5ef2800 c14495bc c1427f98 c126adc6 00000001
[192826.754587] 14f241e3 0000af64 c1449560 c1427fec 00000001 c1427fa4 c126aee8 f5ef2800
[192826.762618] c1427fcc c1051e54 c1426000 00037191 c1427fec c1427fec c1449560 00037191
[192826.770655] Call Trace:
[192826.773214] [<c120297f>] acpi_idle_enter_c1+0x4e/0x74
[192826.778457] [<c126adc6>] cpuidle_enter_state+0x2c/0x96
[192826.783783] [<c126aee8>] cpuidle_enter+0xf/0x12
[192826.788502] [<c1051e54>] cpu_startup_entry+0x135/0x1e1
[192826.793828] [<c1313161>] rest_init+0x5d/0x5f
[192826.798284] [<c1462a40>] start_kernel+0x359/0x360
[192826.803170] [<c14622af>] i386_start_kernel+0x79/0x7d
[192826.808312] Code: f8 43 4f c1 89 e5 b8 0f 00 00 00 e8 4e 1b ff ff 31 c0 5d c3 64 a1 e8 f
[192826.829309] INFO: NMI handler (arch_trigger_all_cpu_backtrace_handler) took too long tos
[-- Attachment #3: cake_lockup2.txt --]
[-- Type: text/plain, Size: 2004 bytes --]
[ 1035.517370] ------------[ cut here ]------------
[ 1035.522085] WARNING: CPU: 1 PID: 0 at net/sched/sch_generic.c:264 dev_watchdog+0xf4/0x18)
[ 1035.530493] NETDEV WATCHDOG: eth0 (r8169): transmit queue 0 timed out
[ 1035.536938] Modules linked in: em_meta cls_basic sch_cake(O) sch_hfsc xt_ACCOUNT(O) ipt_n
[ 1035.588357] CPU: 1 PID: 0 Comm: swapper/1 Tainted: G O 3.16.7-ckt9-voyage #1
[ 1035.596366] Hardware name: PC Engines APU/APU, BIOS 4.0 09/08/2014
[ 1035.602555] 00000000 00000000 f5c99f30 c13167e9 c13f2100 f5c99f48 c102a7dd c129e917
[ 1035.610537] f5fe1000 00000000 0009f2e0 f5c99f60 c102a858 00000009 f5c99f58 c13f2100
[ 1035.618527] f5c99f74 f5c99f90 c129e917 c13f2139 00000108 c13f2100 f5fe1000 f80f9a93
[ 1035.626499] Call Trace:
[ 1035.628975] [<c13167e9>] dump_stack+0x41/0x52
[ 1035.633442] [<c102a7dd>] warn_slowpath_common+0x5c/0x73
[ 1035.638770] [<c129e917>] ? dev_watchdog+0xf4/0x186
[ 1035.643666] [<c102a858>] warn_slowpath_fmt+0x26/0x2a
[ 1035.648737] [<c129e917>] dev_watchdog+0xf4/0x186
[ 1035.653465] [<c129e823>] ? pfifo_fast_dequeue+0xaa/0xaa
[ 1035.658799] [<c103160e>] call_timer_fn.isra.30+0x12/0x61
[ 1035.664213] [<c103187f>] run_timer_softirq+0x12b/0x171
[ 1035.669460] [<c102d689>] __do_softirq+0x91/0x175
[ 1035.674184] [<c102d5f8>] ? __hrtimer_tasklet_trampoline+0x1a/0x1a
[ 1035.680378] [<c10033c3>] do_softirq_own_stack+0x1d/0x23
[ 1035.685702] <IRQ> [<c102d8a9>] irq_exit+0x34/0x75
[ 1035.690642] [<c101eaa1>] smp_apic_timer_interrupt+0x33/0x3d
[ 1035.696318] [<c131a6dd>] apic_timer_interrupt+0x2d/0x40
[ 1035.701648] [<c1202441>] ? acpi_safe_halt+0x17/0x1a
[ 1035.706630] [<c120297f>] acpi_idle_enter_c1+0x4e/0x74
[ 1035.711787] [<c126adc6>] cpuidle_enter_state+0x2c/0x96
[ 1035.717029] [<c126aee8>] cpuidle_enter+0xf/0x12
[ 1035.721666] [<c1051e54>] cpu_startup_entry+0x135/0x1e1
[ 1035.726909] [<c101d553>] start_secondary+0x1a6/0x1ab
[ 1035.731978] ---[ end trace 915dc201bbcccd42 ]---
next reply other threads:[~2019-01-30 8:58 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-01-30 8:58 Pete Heist [this message]
2019-01-30 22:59 ` Toke Høiland-Jørgensen
2019-01-31 7:15 ` Pete Heist
2019-01-31 14:53 ` Toke Høiland-Jørgensen
2019-01-31 20:25 ` Pete Heist
2019-01-31 23:09 ` Toke Høiland-Jørgensen
2019-01-31 23:18 ` Pete Heist
2019-01-31 23:10 ` Pete Heist
2019-01-31 23:18 ` Toke Høiland-Jørgensen
2019-01-31 23:23 ` Pete Heist
2019-01-31 23:31 ` Toke Høiland-Jørgensen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
List information: https://lists.bufferbloat.net/postorius/lists/cake.lists.bufferbloat.net/
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=15FB76CC-44B2-496B-80EC-8D00AD2AF9B7@heistp.net \
--to=pete@heistp.net \
--cc=cake@lists.bufferbloat.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox