-----Original Message-----
From: netdev-owner@xxxxxxxxxxxxxxx [mailto:netdev-
owner@xxxxxxxxxxxxxxx] On Behalf Of Tommi Rantala
Sent: Monday, November 13, 2017 11:23
To: Jon Maloy <jon.maloy@xxxxxxxxxxxx>; Ying Xue
<ying.xue@xxxxxxxxxxxxx>; David S. Miller <davem@xxxxxxxxxxxxx>;
netdev@xxxxxxxxxxxxxxx; tipc-discussion@xxxxxxxxxxxxxxxxxxxxx; linux-
kernel@xxxxxxxxxxxxxxx
Subject: tipc_udp_send_msg oops in 4.4 when setting link tolerance
Hi,
I always get an instant TIPC oops in 4.4, when I try to set the link tolerance
(with LINKNAME != "broadcast-link"):
$ tipc link set tolerance 1000 link $LINKNAME
Any idea what's going on? Some tipc patch missing in 4.4?
In 4.9 the "tipc" command executes just fine, but I've seen a few times that
later some random process crashes with "BUG: Bad page state". KASAN does
not report anything before it happens.
4.14 is OK, could not reproduce these problems with it.
tipc_udp_send_msg+0x102/0x4f0
matches to:
tipc_udp_send_msg at linux-stable/net/tipc/udp_media.c:172
static int tipc_udp_send_msg(struct net *net, struct sk_buff *skb,
struct tipc_bearer *b,
struct tipc_media_addr *dest) {
int ttl, err = 0;
struct udp_bearer *ub;
struct udp_media_addr *dst = (struct udp_media_addr *)&dest->value;
struct udp_media_addr *src = (struct udp_media_addr *)&b-
addr.value;struct rtable *rt;
if (skb_headroom(skb) < UDP_MIN_HEADROOM) {
err = pskb_expand_head(skb, UDP_MIN_HEADROOM, 0,
GFP_ATOMIC);
if (err)
goto tx_error;
}
skb_set_inner_protocol(skb, htons(ETH_P_TIPC));
ub = rcu_dereference_rtnl(b->media_ptr);
if (!ub) {
err = -ENODEV;
goto tx_error;
}
if (dst->proto == htons(ETH_P_IP)) { <------ HERE
[ 111.423647]
==========================================================
========
[ 111.424826] BUG: KASAN: null-ptr-deref on address (null)
[ 111.425538] Read of size 2 by task tipc/2643 [ 111.426215] CPU: 3 PID: 2643
Comm: tipc Not tainted 4.4.97-pc64 #1 [ 111.428081] 0000000000000000
ffff880026327478 ffffffff8248005e
0000000000000002
[ 111.429476] ffff880047ad5ac0 ffff8800263274f8 ffffffff8227f5af
0000000265711040
[ 111.430728] 0000000000000000 0000000000000297 ffffffffa0387fd2
02090220ffffffff [ 111.432051] Call Trace:
[ 111.432472] [<ffffffff8248005e>] dump_stack+0x86/0xc8 [ 111.433208]
[<ffffffff8227f5af>] kasan_report.part.2+0x41f/0x520 [ 111.434040]
[<ffffffffa0387fd2>] ? tipc_udp_send_msg+0x102/0x4f0 [tipc] [ 111.434908]
[<ffffffff8227f965>] kasan_report+0x25/0x30 [ 111.435647]
[<ffffffff8227e3a6>] __asan_load2+0x66/0x70 [ 111.436391]
[<ffffffffa0387fd2>] tipc_udp_send_msg+0x102/0x4f0 [tipc] [ 111.437334]
[<ffffffff8227eb1e>] ? kasan_kmalloc+0x5e/0x70 [ 111.438301]
[<ffffffff8227edfd>] ? kasan_slab_alloc+0xd/0x10 [ 111.439328]
[<ffffffff8227e04c>] ?
__kmalloc_node_track_caller+0xac/0x230
[ 111.440493] [<ffffffff8227eb1e>] ? kasan_kmalloc+0x5e/0x70 [
111.441479] [<ffffffffa0387ed0>] ? tipc_udp_disable+0xe0/0xe0 [tipc] [
111.442628] [<ffffffff8227eb1e>] ? kasan_kmalloc+0x5e/0x70 [ 111.443598]
[<ffffffff8227ef52>] ? kasan_krealloc+0x62/0x80 [ 111.444610]
[<ffffffff8227ebf8>] ? memset+0x28/0x30 [ 111.445539] [<ffffffff8275fab3>]
? __alloc_skb+0x2b3/0x310 [ 111.446560] [<ffffffff8275f800>] ?
skb_complete_tx_timestamp+0x110/0x110
[ 111.447695] [<ffffffff82147a16>] ? __module_text_address+0x16/0xa0 [
111.448735] [<ffffffff8275e3fb>] ? skb_put+0x8b/0xd0 [ 111.449608]
[<ffffffff8227ec76>] ? memcpy+0x36/0x40 [ 111.450524]
[<ffffffffa03665e8>] ?
tipc_link_build_proto_msg+0x398/0x4c0 [tipc] [ 111.451946]
[<ffffffffa0364920>] tipc_bearer_xmit_skb+0xa0/0xb0 [tipc] [ 111.453078]
[<ffffffffa036a60b>] tipc_link_proto_xmit+0x11b/0x160 [tipc] [ 111.454218]
[<ffffffffa036a4f0>] ?
tipc_link_build_reset_msg+0x50/0x50 [tipc] [ 111.455542]
[<ffffffffa036c5be>] tipc_nl_link_set+0x1ee/0x3b0 [tipc] [ 111.456659]
[<ffffffffa036c3d0>] ? tipc_nl_parse_link_prop+0xd0/0xd0 [tipc] [
111.457831] [<ffffffff82190a29>] ? is_ftrace_trampoline+0x59/0x90 [
111.458884] [<ffffffff820b15a5>] ? __kernel_text_address+0x65/0x80 [
111.459931] [<ffffffff824ba386>] ? nla_parse+0xb6/0x140 [ 111.460892]
[<ffffffff827d20ee>] genl_family_rcv_msg+0x37e/0x5e0 [ 111.461948]
[<ffffffffa0380005>] ? set_orig_addr.isra.53+0xe5/0x120 [tipc] [ 111.463107]
[<ffffffff827d1d70>] ? genl_rcv+0x40/0x40 [ 111.463987]
[<ffffffff82278864>] ? alloc_debug_processing+0x154/0x180
[ 111.465048] [<ffffffff8227a39d>] ? ___slab_alloc+0x43d/0x460 [
111.465986] [<ffffffff82278864>] ? alloc_debug_processing+0x154/0x180
[ 111.467045] [<ffffffff827cde5c>] ? netlink_lookup+0x19c/0x220 [
111.468067] [<ffffffff827d2428>] genl_rcv_msg+0xd8/0x110 [ 111.468994]
[<ffffffff827d143b>] netlink_rcv_skb+0x14b/0x180 [ 111.469939]
[<ffffffff827d2350>] ? genl_family_rcv_msg+0x5e0/0x5e0 [ 111.470954]
[<ffffffff827d1d58>] genl_rcv+0x28/0x40 [ 111.471798] [<ffffffff827d0a27>]
netlink_unicast+0x2e7/0x3a0 [ 111.472806] [<ffffffff827d0740>] ?
netlink_attachskb+0x330/0x330 [ 111.473845] [<ffffffff8249b731>] ?
copy_from_iter+0xf1/0x3b0 [ 111.474847] [<ffffffff827d0f8d>]
netlink_sendmsg+0x4ad/0x620 [ 111.475788] [<ffffffff827d0ae0>] ?
netlink_unicast+0x3a0/0x3a0 [ 111.476793] [<ffffffff822c0683>] ?
__fdget+0x13/0x20 [ 111.477723] [<ffffffff82751575>] ?
sockfd_lookup_light+0x95/0xb0 [ 111.478773] [<ffffffff827538fc>]
SYSC_sendto+0x1bc/0x290 [ 111.479659] [<ffffffff82753740>] ?
sock_write_iter+0x200/0x200 [ 111.480692] [<ffffffff822c0683>] ?
__fdget+0x13/0x20 [ 111.481559] [<ffffffff82751575>] ?
sockfd_lookup_light+0x95/0xb0 [ 111.482591] [<ffffffff827caf71>] ?
netlink_getname+0xb1/0x110 [ 111.483570] [<ffffffff82750b0c>] ?
move_addr_to_user+0x5c/0x70 [ 111.484539] [<ffffffff82751706>] ?
SYSC_getsockname+0x176/0x190 [ 111.485540] [<ffffffff82751590>] ?
sockfd_lookup_light+0xb0/0xb0 [ 111.486558] [<ffffffff82753225>] ?
SYSC_bind+0xe5/0x180 [ 111.487548] [<ffffffff82753140>] ?
__sock_recv_ts_and_drops+0x260/0x260
[ 111.488700] [<ffffffff822c132b>] ? fd_install+0x3b/0x50 [ 111.489596]
[<ffffffff827514b4>] ? sock_map_fd+0x44/0x70 [ 111.490553]
[<ffffffff82753f4c>] ? SyS_socket+0xcc/0x120 [ 111.491437]
[<ffffffff82753e80>] ? move_addr_to_kernel+0x40/0x40 [ 111.492505]
[<ffffffff820022b6>] ? exit_to_usermode_loop+0x86/0x120 [ 111.493557]
[<ffffffff82002017>] ? trace_hardirqs_on_thunk+0x17/0x19 [ 111.494629]
[<ffffffff827544ce>] SyS_sendto+0xe/0x10 [ 111.495588]
[<ffffffff829299ae>] entry_SYSCALL_64_fastpath+0x12/0x6d
[ 111.496697]
==========================================================
========
[ 111.498005] Disabling lock debugging due to kernel taint [ 111.499059]
BUG: unable to handle kernel NULL pointer dereference at
(null)
[ 111.500698] IP: [<ffffffffa0387fd2>] tipc_udp_send_msg+0x102/0x4f0
[tipc] [ 111.502027] PGD 4b01c067 PUD 1f0a5067 PMD 0 [ 111.503053] Oops:
0000 [#1] SMP KASAN [ 111.503980] Modules linked in: ip6table_mangle
ip6_tables iptable_mangle iptable_filter ip_tables x_tables tipc
ip6_udp_tunnel udp_tunnel fuse isofs aesni_intel aes_x86_64 glue_helper
lrw gf128mul ablk_helper cryptd ata_piix i6300esb sch_fq_codel
nf_conntrack_proto_sctp nf_conntrack autofs4
[ 111.509927] CPU: 3 PID: 2643 Comm: tipc Tainted: G B
4.4.97-pc64 #1
[ 111.511249] Hardware name: Fedora Project OpenStack Nova, BIOS
seabios-1.7.5-11.el7.tis.1 04/01/2014
[ 111.512935] task: ffff880047ad5ac0 ti: ffff880026320000 task.ti:
ffff880026320000
[ 111.514283] RIP: 0010:[<ffffffffa0387fd2>] [<ffffffffa0387fd2>]
tipc_udp_send_msg+0x102/0x4f0 [tipc]
[ 111.515960] RSP: 0018:ffff880026327528 EFLAGS: 00010292 [ 111.516832]
RAX: ffff880047ad5ac0 RBX: ffff880065711040 RCX:
0000000000000000
[ 111.517992] RDX: 1ffffffff06b9196 RSI: 0000000000000297 RDI:
0000000000000297
[ 111.519117] RBP: ffff8800263276f0 R08: 0000000000000000 R09:
fffffbfff069f014
[ 111.520228] R10: dffffc0000000001 R11: ffff88006bc02a00 R12:
1ffff10004c64eb1
[ 111.521361] R13: ffff88005ad07750 R14: 0000000000000000 R15:
ffff88005154d9e0
[ 111.522538] FS: 00007f467f3ac700(0000) GS:ffff88006c380000(0000)
knlGS:0000000000000000
[ 111.523960] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [
111.524947] CR2: 0000000000000000 CR3: 000000001f076000 CR4:
00000000001406e0
[ 111.526084] Stack:
[ 111.526551] 1ffff1000a2c11d8 ffff880026327550 ffffffff8227eb1e
ffff880051608cc0
[ 111.528272] ffff88006bc02a00 ffff880026327560 ffffffff8227edfd
ffff8800263275b0
[ 111.529886] ffffffff8227e04c ffff880026327590 ffffffff8227eb1e
ffffffff832dfec0
[ 111.531535] Call Trace:
[ 111.532106] [<ffffffff8227eb1e>] ? kasan_kmalloc+0x5e/0x70 [
111.533075] [<ffffffff8227edfd>] ? kasan_slab_alloc+0xd/0x10 [ 111.534041]
[<ffffffff8227e04c>] ?
__kmalloc_node_track_caller+0xac/0x230
[ 111.535102] [<ffffffff8227eb1e>] ? kasan_kmalloc+0x5e/0x70 [
111.536047] [<ffffffffa0387ed0>] ? tipc_udp_disable+0xe0/0xe0 [tipc] [
111.537186] [<ffffffff8227eb1e>] ? kasan_kmalloc+0x5e/0x70 [ 111.538139]
[<ffffffff8227ef52>] ? kasan_krealloc+0x62/0x80 [ 111.539106]
[<ffffffff8227ebf8>] ? memset+0x28/0x30 [ 111.539946] [<ffffffff8275fab3>]
? __alloc_skb+0x2b3/0x310 [ 111.540876] [<ffffffff8275f800>] ?
skb_complete_tx_timestamp+0x110/0x110
[ 111.541954] [<ffffffff82147a16>] ? __module_text_address+0x16/0xa0 [
111.542978] [<ffffffff8275e3fb>] ? skb_put+0x8b/0xd0 [ 111.543914]
[<ffffffff8227ec76>] ? memcpy+0x36/0x40 [ 111.544817]
[<ffffffffa03665e8>] ?
tipc_link_build_proto_msg+0x398/0x4c0 [tipc] [ 111.546199]
[<ffffffffa0364920>] tipc_bearer_xmit_skb+0xa0/0xb0 [tipc] [ 111.547355]
[<ffffffffa036a60b>] tipc_link_proto_xmit+0x11b/0x160 [tipc] [ 111.548482]
[<ffffffffa036a4f0>] ?
tipc_link_build_reset_msg+0x50/0x50 [tipc] [ 111.549763]
[<ffffffffa036c5be>] tipc_nl_link_set+0x1ee/0x3b0 [tipc] [ 111.550822]
[<ffffffffa036c3d0>] ? tipc_nl_parse_link_prop+0xd0/0xd0 [tipc] [
111.551921] [<ffffffff82190a29>] ? is_ftrace_trampoline+0x59/0x90 [
111.552961] [<ffffffff820b15a5>] ? __kernel_text_address+0x65/0x80 [
111.554010] [<ffffffff824ba386>] ? nla_parse+0xb6/0x140 [ 111.554906]
[<ffffffff827d20ee>] genl_family_rcv_msg+0x37e/0x5e0 [ 111.555954]
[<ffffffffa0380005>] ? set_orig_addr.isra.53+0xe5/0x120 [tipc] [ 111.557104]
[<ffffffff827d1d70>] ? genl_rcv+0x40/0x40 [ 111.557949]
[<ffffffff82278864>] ? alloc_debug_processing+0x154/0x180
[ 111.559030] [<ffffffff8227a39d>] ? ___slab_alloc+0x43d/0x460 [
111.559983] [<ffffffff82278864>] ? alloc_debug_processing+0x154/0x180
[ 111.561058] [<ffffffff827cde5c>] ? netlink_lookup+0x19c/0x220 [
111.562038] [<ffffffff827d2428>] genl_rcv_msg+0xd8/0x110 [ 111.562966]
[<ffffffff827d143b>] netlink_rcv_skb+0x14b/0x180 [ 111.563930]
[<ffffffff827d2350>] ? genl_family_rcv_msg+0x5e0/0x5e0 [ 111.564949]
[<ffffffff827d1d58>] genl_rcv+0x28/0x40 [ 111.565818] [<ffffffff827d0a27>]
netlink_unicast+0x2e7/0x3a0 [ 111.566759] [<ffffffff827d0740>] ?
netlink_attachskb+0x330/0x330 [ 111.567765] [<ffffffff8249b731>] ?
copy_from_iter+0xf1/0x3b0 [ 111.568707] [<ffffffff827d0f8d>]
netlink_sendmsg+0x4ad/0x620 [ 111.569706] [<ffffffff827d0ae0>] ?
netlink_unicast+0x3a0/0x3a0 [ 111.570658] [<ffffffff822c0683>] ?
__fdget+0x13/0x20 [ 111.571548] [<ffffffff82751575>] ?
sockfd_lookup_light+0x95/0xb0 [ 111.572541] [<ffffffff827538fc>]
SYSC_sendto+0x1bc/0x290 [ 111.573459] [<ffffffff82753740>] ?
sock_write_iter+0x200/0x200 [ 111.574435] [<ffffffff822c0683>] ?
__fdget+0x13/0x20 [ 111.575330] [<ffffffff82751575>] ?
sockfd_lookup_light+0x95/0xb0 [ 111.576354] [<ffffffff827caf71>] ?
netlink_getname+0xb1/0x110 [ 111.577371] [<ffffffff82750b0c>] ?
move_addr_to_user+0x5c/0x70 [ 111.578385] [<ffffffff82751706>] ?
SYSC_getsockname+0x176/0x190 [ 111.579407] [<ffffffff82751590>] ?
sockfd_lookup_light+0xb0/0xb0 [ 111.580431] [<ffffffff82753225>] ?
SYSC_bind+0xe5/0x180 [ 111.581369] [<ffffffff82753140>] ?
__sock_recv_ts_and_drops+0x260/0x260
[ 111.582518] [<ffffffff822c132b>] ? fd_install+0x3b/0x50 [ 111.583450]
[<ffffffff827514b4>] ? sock_map_fd+0x44/0x70 [ 111.584417]
[<ffffffff82753f4c>] ? SyS_socket+0xcc/0x120 [ 111.585353]
[<ffffffff82753e80>] ? move_addr_to_kernel+0x40/0x40 [ 111.586405]
[<ffffffff820022b6>] ? exit_to_usermode_loop+0x86/0x120 [ 111.587434]
[<ffffffff82002017>] ? trace_hardirqs_on_thunk+0x17/0x19 [ 111.588511]
[<ffffffff827544ce>] SyS_sendto+0xe/0x10 [ 111.589378]
[<ffffffff829299ae>] entry_SYSCALL_64_fastpath+0x12/0x6d
[ 111.590420] Code: 00 00 e8 e2 64 ef e1 4c 89 ef 80 a3 93 00 00 00 f7
e8 43 65 ef e1 4d 8b 7d 00 4d 85 ff 0f 84 db 03 00 00 4c 89 f7 e8 6e 63 ef e1
<66> 41 83 3e 08 0f 84 80 01 00 00 48 8d bc 24 20 01 00 00 31 c0 [ 111.598579]
RIP [<ffffffffa0387fd2>] tipc_udp_send_msg+0x102/0x4f0 [tipc] [
111.599831] RSP <ffff880026327528> [ 111.600538] CR2: 0000000000000000 [
111.601202] ---[ end trace 827dd66f798de44a ]--- [ 111.602025] Kernel panic -
not syncing: Fatal exception in interrupt [ 111.614704] Kernel Offset: disabled
[ 111.615249] Rebooting in 60 seconds..
[ 31.985039] BUG: Bad page state in process ___ pfn:400c0
[ 31.985680] page:ffffea0001003000 count:0 mapcount:0
mapping:000000000000003c index:0x0
[ 31.986619] flags: 0x10000(mappedtodisk)
[ 31.987081] page dumped because: PAGE_FLAGS_CHECK_AT_PREP flag set
[ 31.987701] bad because of flags: 0x10000(mappedtodisk)
[ 31.988268] Modules linked in: iptable_filter ip_tables x_tables tipc ...
[ 31.991804] CPU: 3 PID: 2716 Not tainted 4.9.52 #1
[ 31.993608] ffffc90003023b28 ffffffff822ee180 ffffea0001003000
ffffffff827b76c8
[ 31.994792] ffffc90003023b50 ffffffff8215c5e4 0000000000010000
ffffea0001000000
[ 31.995986] 0000000000000009 ffffc90003023b60 ffffffff8215c71f
ffffc90003023c28
[ 31.997380] Call Trace:
[ 31.997780] [<ffffffff822ee180>] dump_stack+0x86/0xc6
[ 31.998460] [<ffffffff8215c5e4>] bad_page+0xc4/0x130
[ 31.999160] [<ffffffff8215c71f>] check_new_page_bad+0x5f/0x70
[ 31.999968] [<ffffffff8215fe5a>] get_page_from_freelist+0x7ca/0xb20
[ 32.000782] [<ffffffff8216123c>] __alloc_pages_nodemask+0xdc/0x220
[ 32.001621] [<ffffffff821a4c18>] alloc_fresh_huge_page+0x68/0xc0
[ 32.002407] [<ffffffff821a590f>] set_max_huge_pages+0x4df/0x530
[ 32.003176] [<ffffffff8230698c>] ? _kstrtoull+0x2c/0x70
[ 32.003841] [<ffffffff821a59ec>] nr_hugepages_store_common+0x8c/0xf0
[ 32.004619] [<ffffffff821c2626>] ?
mem_cgroup_commit_charge+0x66/0x430
[ 32.005384] [<ffffffff821a5a83>] nr_hugepages_store+0x13/0x20
[ 32.006176] [<ffffffff822f02bf>] kobj_attr_store+0xf/0x20
[ 32.006838] [<ffffffff82237877>] sysfs_kf_write+0x37/0x40
[ 32.007534] [<ffffffff82236bcc>] kernfs_fop_write+0x11c/0x1b0
[ 32.008258] [<ffffffff821c7068>] __vfs_write+0x28/0x120
[ 32.008931] [<ffffffff820fb23d>] ? __audit_syscall_entry+0xad/0xf0
[ 32.009681] [<ffffffff821c7735>] vfs_write+0xb5/0x1a0
[ 32.010308] [<ffffffff821c8a96>] SyS_write+0x46/0xa0
[ 32.010917] [<ffffffff8204b8fa>] ? trace_do_page_fault+0x5a/0x140
[ 32.011741] [<ffffffff82002bfe>] do_syscall_64+0x7e/0x1a0
[ 32.012476] [<ffffffff825dc0c4>] entry_SYSCALL64_slow_path+0x25/0x25
[ 32.013244] Disabling lock debugging due to kernel taint
[ 34.055994] ip6_tables: (C) 2000-2006 Netfilter Core Team
-Tommi