Kernel panic on A20 5.8.18

Started by pac, January 28, 2021, 10:14:26 AM

Previous topic - Next topic

pac

Hi All,
I got kernel panic issues on A20 LIME with the following messages:

[14403.828216] Kernel panic - not syncing: stack-protector: Kernel stack is corrupted in: wait_consider_task+0x9bb/0x9cc
[14403.838832] CPU: 1 PID: 11320 Comm: (lightdm) Not tainted 5.8.18-olimex #140443
[14403.846129] Hardware name: Allwinner sun7i (A20) Family
[14403.851370] [<c010cc71>] (unwind_backtrace) from [<c0109239>] (show_stack+0x11/0x14)
[14403.859112] [<c0109239>] (show_stack) from [<c060be2b>] (dump_stack+0x77/0x84)
[14403.866334] [<c060be2b>] (dump_stack) from [<c011ad49>] (panic+0xcd/0x24c)
[14403.873212] [<c011ad49>] (panic) from [<c0a00489>] (printk_nmi_enter+0x1/0x20)
[14404.919318] SMP: failed to stop secondary CPUs
[14404.923767] Rebooting in 10 seconds..
[14415.966723] SMP: failed to stop secondary CPUs

the issue happens once a day.
Could anybody help me ?
Thanks in advance
Paolo

LubOlimex

What image exactly are you using?

Did you try the latest ones from here:

http://images.olimex.com/release/a20/
Technical support and documentation manager at Olimex

Strela17

I can confirm that this can happen if you've downloaded some incorrect/corrupt/outdated files, if you get the latest images, it'll solve your problem.

pac

#3
The problem happens on image A20-OLinuXino-bionic-base-20201105-143953.img

I try to use the latest A20-OLinuXino-focal-base-20210127-100834.img, but that image does not start completely on A20-LIME boards: the last kernel message I see is :

[ OK ] Started ifup for eth0.

the ETHERNER cable is connected and both ETHERNET  leds blinks

The same also using the image A20-OLinuXino-buster-base-20210127-100834.img

Best Regards,
Paolo

LubOlimex

Can you tell me what exactly A20-OLinuXino-LIME board do you have - what is the variant and hardware revision?

I will get the same board and test here to confirm if there is some problem specifically with that board in the board definitions.

If you have trouble identifying the board, please upload two pictures of top and bottom and upload them to a free image/file hosting web-site and post the links.
Technical support and documentation manager at Olimex

pac


LubOlimex

When I ran the A20-OLinuXino-focal-base-20210127-100834.img the first time I had the same hang around the same spot:

[  OK  ] Started ifup for eth0.
[  OK  ] Finished Olinuxino Expand Root Partition.
[  OK  ] Reached target Basic System.
         Starting Accounts Service...

After that hang I removed the power supply and re-applied it after 10 seconds. The second time the board booted fine.

Can you try again A20-OLinuXino-focal-base-20210127-100834.img and when it hangs, power down and power up the board again? Does it boot the second time?

Meanwhile I will test the buster too.



Technical support and documentation manager at Olimex

LubOlimex

I can confirm that A20-OLinuXino-LIME does not boot with A20-OLinuXino-buster-base-20210127-100834.img.7z

The first boot is stops at:

[  OK  ] Reached target Timers.
[  OK  ] Found device /sys/subsystem/net/devices/eth0.
[  OK  ] Started ifup for eth0.
[  OK  ] Started Olinuxino Expand Root Partition.

the second boot it stops at:


[  OK  ] Reached target Sockets.
[  OK  ] Started Daily man-db regeneration.
[  OK  ] Started Daily apt download activities.
[  OK  ] Started Daily apt upgrade and clean activities.
[  OK  ] Reached target Timers.
[   15.911498] lcd_olinuxino 2-0050: error reading from device at 00
[  OK  ] Found device /dev/ttyS0.
[  OK  ] Started Raise network interfaces.
[  OK  ] Created slice system-systemd\x2dbacklight.slice.
         Starting Load/Save Screen ...ness of backlight:backlight...
[  OK  ] Started ifup for eth0.
[  OK  ] Started Load/Save Screen B...htness of backlight:backlight.
Technical support and documentation manager at Olimex

pac

I try again with A20-OLinuXino-focal-base-20210127-100834.img but the last message I can see is always

[  OK  ] Started ifup for eth0.

after that message I wait some minutes but I don't see anything more.
Then I powerdown, wait 10 sec. and powerup, but the board still non terminate the boot, and the last message I see is :

[  OK  ] Started Modem Manager.

Then I try again to powerdown, wait 20 sec, then powerup and the boot stops with the following messages:

[  OK  ] Started LSB: Load kernel m...ded to enable cpufreq scaling
         Starting LSB: set CPUfreq kernel parameters...
[   19.639209] lcd_olinuxino 2-0050: error reading from device at 00



But I have not LCD connected.




LubOlimex

It doesn't matter if you have LCD attached or not, I tested with LCD attached, still the Debian didn't work.

I've forwarded the info to the development team, they are looking into it.
Technical support and documentation manager at Olimex

pac


LubOlimex

For the moment one workaround is to use the previous image and upgrade it via Internet with:

# sudo apt dist-upgrade

You can find previous releases here:

http://images.olimex.com/release-old/a20/
Technical support and documentation manager at Olimex

LubOlimex

There are new releases today, the A20-OLinuXino-LIME should now work fine:

http://images.olimex.com/release/a20/
Technical support and documentation manager at Olimex

pac


pac

I try the new image, but I get an error on powerdown.
This is the stack trace: Hope this helps.


[  OK  ] Reached target Power-Off.
[  217.068652] reboot: Power down
[  217.071799] ------------[ cut here ]------------
[  217.076451] WARNING: CPU: 0 PID: 1 at drivers/i2c/i2c-core.h:41 i2c_transfer+0x108/0x114
[  217.084535] No atomic I2C transfer handler for 'i2c-0'
[  217.089669] Modules linked in: fuse rfkill evdev axp20x_adc industrialio at24 lima gpu_sched sun4i_ss l
ibdes sun4i_drm_hdmi sunxi_cedrus(C) v4l2_mem2mem videobuf2_dma_contig videobuf2_memops videobuf2_v4l2 vid
eobuf2_common videodev mc uio_pdrv_genirq uio cpufreq_dt sch_fq_codel sun4i_ts hwmon usb_f_acm u_serial g_
serial libcomposite ip_tables x_tables
[  217.120920] CPU: 0 PID: 1 Comm: systemd-shutdow Tainted: G         C        5.10.12-olimex #211118
[  217.129869] Hardware name: Allwinner sun7i (A20) Family
[  217.135094] Backtrace:
[  217.137566] [<c0a82514>] (dump_backtrace) from [<c0a828dc>] (show_stack+0x20/0x24)
[  217.145137]  r7:00000029 r6:600f0093 r5:00000000 r4:c0facb3c
[  217.150805] [<c0a828bc>] (show_stack) from [<c0a85de8>] (dump_stack+0xa4/0xb8)
[  217.158034] [<c0a85d44>] (dump_stack) from [<c0123eb0>] (__warn+0xfc/0x114)
[  217.164996]  r7:00000029 r6:00000009 r5:c07e23cc r4:c0cd4564
[  217.170661] [<c0123db4>] (__warn) from [<c0a82fd8>] (warn_slowpath_fmt+0xa4/0xc0)
[  217.178142]  r7:00000029 r6:c0cd4564 r5:c0f04fc8 r4:c0cd4604
[  217.183807] [<c0a82f38>] (warn_slowpath_fmt) from [<c07e23cc>] (i2c_transfer+0x108/0x114)
[  217.191985]  r9:00000032 r8:00000001 r7:00000000 r6:00000001 r5:c14f7d30 r4:c21314a8
[  217.199728] [<c07e22c4>] (i2c_transfer) from [<c07e2434>] (i2c_transfer_buffer_flags+0x5c/0x80)
[  217.208421]  r6:c2104741 r5:00000002 r4:c0f04fc8
[  217.213053] [<c07e23d8>] (i2c_transfer_buffer_flags) from [<c0692528>] (regmap_i2c_write+0x24/0x40)
[  217.222092]  r5:00000001 r4:00000002
[  217.225679] [<c0692504>] (regmap_i2c_write) from [<c068d438>] (_regmap_raw_write_impl+0x6e4/0x82c)
[  217.234632]  r5:00000001 r4:c2107e00
[  217.238218] [<c068cd54>] (_regmap_raw_write_impl) from [<c068d608>] (_regmap_bus_raw_write+0x88/0xb0)
[  217.247436]  r10:00000058 r9:c0f0ed28 r8:6637e700 r7:00000080 r6:00000032 r5:c0688b84
[  217.255260]  r4:c2107e00
[  217.257805] [<c068d580>] (_regmap_bus_raw_write) from [<c068ca84>] (_regmap_write+0x6c/0x124)
[  217.266327]  r7:c2107e00 r6:00000080 r5:00000032 r4:c2107e00
[  217.271993] [<c068ca18>] (_regmap_write) from [<c068e218>] (regmap_write+0x4c/0x6c)
[  217.279651]  r9:c0f0ed28 r8:6637e700 r7:fee1dead r6:00000080 r5:00000032 r4:c2107e00
[  217.287399] [<c068e1cc>] (regmap_write) from [<c069cf60>] (axp20x_power_off+0x3c/0x48)
[  217.295314]  r7:fee1dead r6:00000000 r5:4321fedc r4:c0f04fc8
[  217.300981] [<c069cf24>] (axp20x_power_off) from [<c010a9dc>] (machine_power_off+0x34/0x38)
[  217.309335] [<c010a9a8>] (machine_power_off) from [<c014ce38>] (kernel_power_off+0x7c/0x80)
[  217.317685] [<c014cdbc>] (kernel_power_off) from [<c014cf90>] (__do_sys_reboot+0x154/0x1e8)
[  217.326036] [<c014ce3c>] (__do_sys_reboot) from [<c014d094>] (sys_reboot+0x18/0x1c)
[  217.333695]  r9:c14f6000 r8:c0100264 r7:00000058 r6:beb2bb44 r5:beb2bb48 r4:4321fedc
[  217.341440] [<c014d07c>] (sys_reboot) from [<c0100060>] (ret_fast_syscall+0x0/0x54)
[  217.349091] Exception stack(0xc14f7fa8 to 0xc14f7ff0)
[  217.354149] 7fa0:                   4321fedc beb2bb48 fee1dead 28121969 4321fedc 6637e700
[  217.362327] 7fc0: 4321fedc beb2bb48 beb2bb44 00000058 beb2bb48 beb2bb44 fffff000 beb2bb4c
[  217.370500] 7fe0: 00000058 beb2babc b0a5ad65 b09d9be6
[  217.375552] ---[ end trace ade2dcec75cf139e ]---