firefly-RK3288非正常死机
本帖最后由 lh2008xp 于 2018-8-2 14:44 编辑运行系统Ubuntu 16.04 LTS,运行代码平台ROS中的Kinetic,工程启动后5分钟左右,死机!通过远程ssh访问的RK3288,ssh全部掉线且无法重连!
如果连接hdmi接显示器,显示器桌面卡死,表现为系统桌面时间不再刷新!
1.死机时RK3288串口打印结果如下:
firefly login: INFO: task sd-resolve:597 blocked for more than 120 seconds.
Not tainted 4.4.55 #2
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Kernel panic - not syncing: hung_task: blocked tasks
CPU: 3 PID: 29 Comm: khungtaskd Not tainted 4.4.55 #2
Hardware name: Rockchip (Device Tree)
[<c01103a8>] (unwind_backtrace) from [<c010be58>] (show_stack+0x20/0x24)
[<c010be58>] (show_stack) from [<c03ff91c>] (dump_stack+0x88/0xa8)
[<c03ff91c>] (dump_stack) from [<c01e36cc>] (panic+0x98/0x204)
[<c01e36cc>] (panic) from [<c01aa8fc>] (watchdog+0x32c/0x398)
[<c01aa8fc>] (watchdog) from [<c013a3bc>] (kthread+0xf8/0x10c)
[<c013a3bc>] (kthread) from [<c01074f8>] (ret_from_fork+0x14/0x3c)
CPU1: stopping
CPU: 1 PID: 2137 Comm: robot_pose_ekf Not tainted 4.4.55 #2
Hardware name: Rockchip (Device Tree)
[<c01103a8>] (unwind_backtrace) from [<c010be58>] (show_stack+0x20/0x24)
[<c010be58>] (show_stack) from [<c03ff91c>] (dump_stack+0x88/0xa8)
[<c03ff91c>] (dump_stack) from [<c010e900>] (handle_IPI+0x168/0x2c4)
[<c010e900>] (handle_IPI) from [<c010147c>] (gic_handle_irq+0x88/0xa4)
[<c010147c>] (gic_handle_irq) from [<c010c994>] (__irq_svc+0x54/0x70)
Exception stack(0xee193d60 to 0xee193da8)
3d60: ed2e13f0 3beb3bea 00000001 dde42580 ed2e1380 00000140 ed2e13f0 00000000
3d80: ed2e17b8 00000000 00000144 ee193dbc ee193db0 ee193db0 c0b69500 c0b69500
3da0: 600f0013 ffffffff
[<c010c994>] (__irq_svc) from [<c0b69500>] (_raw_spin_lock_bh+0x30/0x34)
[<c0b69500>] (_raw_spin_lock_bh) from [<c090d160>] (release_sock+0x24/0x14c)
[<c090d160>] (release_sock) from [<c0993db4>] (tcp_recvmsg+0x754/0xa28)
[<c0993db4>] (tcp_recvmsg) from [<c09bd370>] (inet_recvmsg+0x9c/0xb8)
[<c09bd370>] (inet_recvmsg) from [<c09094e4>] (sock_recvmsg+0x4c/0x50)
[<c09094e4>] (sock_recvmsg) from [<c090a734>] (SyS_recvfrom+0xac/0xfc)
[<c090a734>] (SyS_recvfrom) from [<c090a7ac>] (SyS_recv+0x28/0x30)
[<c090a7ac>] (SyS_recv) from [<c0107440>] (ret_fast_syscall+0x0/0x3c)
CPU2: stopping
CPU: 2 PID: 0 Comm: swapper/2 Not tainted 4.4.55 #2
Hardware name: Rockchip (Device Tree)
[<c01103a8>] (unwind_backtrace) from [<c010be58>] (show_stack+0x20/0x24)
[<c010be58>] (show_stack) from [<c03ff91c>] (dump_stack+0x88/0xa8)
[<c03ff91c>] (dump_stack) from [<c010e900>] (handle_IPI+0x168/0x2c4)
[<c010e900>] (handle_IPI) from [<c010147c>] (gic_handle_irq+0x88/0xa4)
[<c010147c>] (gic_handle_irq) from [<c010c994>] (__irq_svc+0x54/0x70)
Exception stack(0xee971f50 to 0xee971f98)
1f40: 00000001 00000000 ee971fb0 c0118fa0
1f60: ee970000 00000000 00000000 ee971fc0 c10b03ac c10b4858 c0c01a44 ee971fac
1f80: ee971fb0 ee971fa0 c010805c c0108060 60010013 ffffffff
[<c010c994>] (__irq_svc) from [<c0108060>] (arch_cpu_idle+0x3c/0x48)
[<c0108060>] (arch_cpu_idle) from [<c0160fe4>] (default_idle_call+0x38/0x3c)
[<c0160fe4>] (default_idle_call) from [<c0161240>] (cpu_startup_entry+0x1fc/0x334)
[<c0161240>] (cpu_startup_entry) from [<c010e524>] (secondary_start_kernel+0x12c/0x148)
[<c010e524>] (secondary_start_kernel) from [<0010152c>] (0x10152c)
CPU0: stopping
CPU: 0 PID: 10 Comm: watchdog/0 Not tainted 4.4.55 #2
Hardware name: Rockchip (Device Tree)
[<c01103a8>] (unwind_backtrace) from [<c010be58>] (show_stack+0x20/0x24)
[<c010be58>] (show_stack) from [<c03ff91c>] (dump_stack+0x88/0xa8)
[<c03ff91c>] (dump_stack) from [<c010e900>] (handle_IPI+0x168/0x2c4)
[<c010e900>] (handle_IPI) from [<c010147c>] (gic_handle_irq+0x88/0xa4)
[<c010147c>] (gic_handle_irq) from [<c010c994>] (__irq_svc+0x54/0x70)
Exception stack(0xee96dd00 to 0xee96dd48)
dd00: ee39d940 89188917 00000000 00000000 ee39d93c ee39d940 ee39d950 ee96dd60
dd20: 00000000 c11d59c0 ee96ddb8 ee96dd5c ee96dd50 ee96dd50 c0b6955c c0b69560
dd40: 60010113 ffffffff
[<c010c994>] (__irq_svc) from [<c0b69560>] (_raw_spin_unlock_irq+0x1c/0x20)
[<c0b69560>] (_raw_spin_unlock_irq) from [<c07515b0>] (usb_giveback_urb_bh+0xc0/0xc8)
[<c07515b0>] (usb_giveback_urb_bh) from [<c0122b48>] (tasklet_action+0x9c/0xfc)
[<c0122b48>] (tasklet_action) from [<c0122058>] (__do_softirq+0x138/0x31c)
[<c0122058>] (__do_softirq) from [<c01224f8>] (irq_exit+0x98/0x108)
[<c01224f8>] (irq_exit) from [<c016ed98>] (__handle_domain_irq+0x9c/0xc0)
[<c016ed98>] (__handle_domain_irq) from [<c0101450>] (gic_handle_irq+0x5c/0xa4)
[<c0101450>] (gic_handle_irq) from [<c010c994>] (__irq_svc+0x54/0x70)
Exception stack(0xee96de70 to 0xee96deb8)
de60: eef923c0 00000000 00000002 ed660e28
de80: ed660e00 00000000 eef923c0 ee943840 ee940c80 c0b66040 c0c02218 ee96df04
dea0: ee96de70 ee96dec0 c0b6955c c0b65cd4 80010013 ffffffff
[<c010c994>] (__irq_svc) from [<c0b65cd4>] (__schedule+0x470/0x730)
[<c0b65cd4>] (__schedule) from [<c0b66040>] (schedule+0xac/0xbc)
[<c0b66040>] (schedule) from [<c013e378>] (smpboot_thread_fn+0x16c/0x19c)
[<c013e378>] (smpboot_thread_fn) from [<c013a3bc>] (kthread+0xf8/0x10c)
[<c013a3bc>] (kthread) from [<c01074f8>] (ret_from_fork+0x14/0x3c)
rockchip-thermal ff280000.tsadc: channal 0: temperature(43 C)
THERMAL REGS:
00000000: 00000200 00030061 00000602 00000000 00000000 00000000 00000000 00000000
00000020: 00000000 00000e2d 00000e37 00000000 00000000 00000deb 00000000 00000000
00000040: 00000000 00000db6 00000db6 00000000 00000000 00000000 00000000 00000000
00000060: 00000004 00000004 000000fa 00000032 00000000 00000000 00000000 00000000
00000080: 00000000 00000000
---[ end Kernel panic - not syncing: hung_task: blocked tasks
------------[ cut here ]------------
WARNING: CPU: 3 PID: 29 at kernel/watchdog.c:352 watchdog_timer_fn+0xf8/0x344()
Watchdog detected hard LOCKUP on cpu 0
Modules linked in: ch341 cp210x
CPU: 3 PID: 29 Comm: khungtaskd Not tainted 4.4.55 #2
Hardware name: Rockchip (Device Tree)
[<c01103a8>] (unwind_backtrace) from [<c010be58>] (show_stack+0x20/0x24)
[<c010be58>] (show_stack) from [<c03ff91c>] (dump_stack+0x88/0xa8)
[<c03ff91c>] (dump_stack) from [<c011de5c>] (warn_slowpath_common+0x98/0xc4)
[<c011de5c>] (warn_slowpath_common) from [<c011dec8>] (warn_slowpath_fmt+0x40/0x48)
[<c011dec8>] (warn_slowpath_fmt) from [<c01ab098>] (watchdog_timer_fn+0xf8/0x344)
[<c01ab098>] (watchdog_timer_fn) from [<c01823fc>] (__hrtimer_run_queues+0x1b0/0x324)
[<c01823fc>] (__hrtimer_run_queues) from [<c0182ccc>] (hrtimer_interrupt+0xac/0x1f8)
[<c0182ccc>] (hrtimer_interrupt) from [<c086f978>] (arch_timer_handler_phys+0x38/0x40)
[<c086f978>] (arch_timer_handler_phys) from [<c01730ac>] (handle_percpu_devid_irq+0xd8/0x1ac)
[<c01730ac>] (handle_percpu_devid_irq) from [<c016ea54>] (generic_handle_irq+0x28/0x38)
[<c016ea54>] (generic_handle_irq) from [<c016ed94>] (__handle_domain_irq+0x98/0xc0)
[<c016ed94>] (__handle_domain_irq) from [<c0101450>] (gic_handle_irq+0x5c/0xa4)
[<c0101450>] (gic_handle_irq) from [<c010c994>] (__irq_svc+0x54/0x70)
Exception stack(0xeeb61e08 to 0xeeb61e50)
1e00: eeb61e94 00000005 fd640800 c086f8d0 eeb61e94 00005dbf
1e20: 00008a48 c11d5568 0000004f c1154604 00000399 eeb61e64 eeb61e68 eeb61e58
1e40: c010f730 c086f8e0 a00e0013 ffffffff
[<c010c994>] (__irq_svc) from [<c086f8e0>] (arch_counter_get_cntpct+0x10/0x18)
[<c086f8e0>] (arch_counter_get_cntpct) from [<c010f730>] (arch_timer_read_counter_long+0x20/0x28)
[<c010f730>] (arch_timer_read_counter_long) from [<c03fd7d0>] (read_current_timer+0x30/0x48)
[<c03fd7d0>] (read_current_timer) from [<c03fd820>] (__timer_delay+0x38/0x5c)
[<c03fd820>] (__timer_delay) from [<c03fd870>] (__timer_const_udelay+0x2c/0x34)
[<c03fd870>] (__timer_const_udelay) from [<c01e37fc>] (panic+0x1c8/0x204)
[<c01e37fc>] (panic) from [<c01aa8fc>] (watchdog+0x32c/0x398)
[<c01aa8fc>] (watchdog) from [<c013a3bc>] (kthread+0xf8/0x10c)
[<c013a3bc>] (kthread) from [<c01074f8>] (ret_from_fork+0x14/0x3c)
---[ end trace 7bec9f8eaf3d9ec7 ]---
注意到以上log中有提到 Watchdog detected hard LOCKUP on cpu 0,请问是否是锁死原因造成的原因?
同时提供其他信息以供参考:
2.死机时RK3288温度测试结果,死机后实测芯片温度73.9°C,温度是否过高?
3.死机前CPU使用情况,其中CPU0的软中断(Software IRQ)占用CPU的百分比100%,是否异常?
file:///C:/Users/pc/Documents/My%20Knowledge/temp/9adca0f7-7393-49d7-8451-35e0035edcbf/128/index_files/de077649-718d-410c-8553-469583a9130f.png
从串口打印消息Kernel panic - not syncing: hung_task: blocked tasks知道,发生了内核错误!请问这个该如何处理了定位?我仅仅是在现有的RK3288上进行应用开发!
页:
[1]