Adds core dump support via CrashCatcher #52

salkinium · 2018-08-02T22:07:39Z

This replaces the xpcc UART debug functionality on HardFault with something way more useful: post-mortem debugging via CrashDebug.

This relatively simple application just uses CrashCatcher's HexDump behavior to dump all RAM via UART, which can take a lot of time, since the memories are quite large and UART quite slow at 115200 Baud.

Since the dump can take a long time, I would implement a custom dump process similar to HexDump, but with a optional user callback to set the device into a safe configuration.

Custom dump process with a optional callback on entry to make device safe.
Dump peripherals too, if possible.

cc @rleh @strongly-typed

rleh · 2018-08-03T01:06:02Z

src/modm/platform/fault/minimal/module.lb

@@ -15,7 +15,7 @@ import os

 def init(module):
    module.parent = "platform"
-    module.name = "fault.cortex"
+    module.name = "fault.minimal"


Is there a reason for renaming this from cortex to minimal?
This module is completely ARM Cortex-M specific.

So is fault.coredump. I'd prefer to not encode the device id into the module name, since they themselves decide which devices to support. Maybe with a different platform the fault.minimal module could be implemented differently (Cortex-A, RiscV, Xtensa) but with the same functionality.

Btw, this is how all of modm:platform:** modules work, the modm:platform:gpio module for AVRs implements the same functionality as the one for STM32, but they don't conflict, since only one is enabled per platform.

rleh · 2018-08-03T01:31:55Z

Fault handler use cases:

Eurobot robot main control unit/...
- CrashDumper dumps microcontroller state via UART into logging/debug computer. Blinking LED notifies user.
Eurobot robot slave motor controller/...
- Blinking LED to notify, but maybe not visible from outside the robot -> Immediate reset, except if debugger is connected: trigger breakpoint.
- Better:
  - Write crash information to flash memory, then reset. Software can detect previous fault during startup and do something (e.g. CAN communication).
  - Fault handler sends special CAN message, then reset.
  - Some devices may not have a accessible UART. (e.g. micro-motor)
Freestanding device using modm/...
- Usecases above, or maybe just wait and toggle a failure LED?

Always useful: optional user callback to set the device into a safe configuration.

I cannot see a usecase for the "wait" mode right now.

chris-durand · 2018-08-03T02:42:21Z

src/modm/platform/fault/crashcatcher/crashcatcher_handler.cpp.in

+			const uint16_t *pMemory = (const uint16_t *) pvMemory;
+			for (size_t ii = 1; ii <= elementCount; ii++)
+			{
+				uint16_t val = *pMemory++;


To be pedantic, dereferencing pMemory here is undefined behaviour. According to language rules commonly referred to as "strict aliasing rules" pMemory has to point to an actual (u)int16_t object to be safe to dereference (https://en.cppreference.com/w/cpp/language/reinterpret_cast#Type_aliasing). Although every sensible compiler will probably generate the expected code, this pitfall is easy to avoid here. There is an exception to these rules for (unsigned) char and std::byte which may alias any type. If I am not overlooking something, it is possible to define pMemory as uint8_t* and to pass it to dumpMemory directly.

If I am not overlooking something, it is possible to define pMemory as uint8_t* and to pass it to dumpMemory directly.

Some memories can only be read with 16- or 32-bit accesses (mostly the peripherals), hence this annoyingly duplicate code. I can't change the const void *pvMemory, since that's the callback API from CrashCatcher, but how would you do this correctly?

how would you do this correctly?

To me it does not make sense at all, to apply these rules to memory-mapped IO. The rules deal with accessing C++ objects in memory through a pointer of a different type. Applying the concept of a C++ object to some piece of hardware is totally nonsensical. There is no existing C++ object in memory that the compiler knows of. Hence, the only practical solution is to leave the code as it is.

However I don't think there is any guarantee that the intended 16-bit access will actually be performed as one. I haven't looked much into the assembly output of arm compilers but on modern x86 platforms simple byte copying operations in loops often turn into fancy AVX instructions.
Using a pointer to volatile for the read would clearly state to the compiler: I want a 16 bit read and I want it now. I suppose adding volatile would not impede wanted optimizations. As the pointer itself is not volatile but the data it points to, operations on the pointer will still be optimized. I compiled a few toy examples for cortex-m4 with -Os and the resulting assembly looked almost identical regardless of whether volatile is added or not.

Good observation!
Another way to get around UB could be to add a read16 function coded in assembly that includes the correct load instruction.

Added it to the TODO list in the new PR.

This allows for easier sharing of common settings between different build script generators.

This allows specifying common options like project name and build path together and not having to duplicate them for both generators.

Adds these behaviors on hard fault: - reset device. - force breakpoint. - loop forever. - flash LED every 1s. Removes switching to process stack, so on stack overflow, only the first three behaviors will work.

Selecting all modules through :platform:** is not possible anymore with the introduction of conflicting :platform:fault.* modules.

salkinium · 2019-05-10T18:29:28Z

Closed in favor of #210.

salkinium force-pushed the feature/crash_catcher branch 2 times, most recently from 79bbfc8 to 002aec5 Compare August 3, 2018 00:18

rleh reviewed Aug 3, 2018

View reviewed changes

chris-durand reviewed Aug 3, 2018

View reviewed changes

salkinium added 10 commits August 5, 2018 01:20

[build] Extract common code and compiler flags

0248757

This allows for easier sharing of common settings between different build script generators.

[build] Move build.{scons, cmake} to submodules

69b7830

This allows specifying common options like project name and build path together and not having to duplicate them for both generators.

[build] Adapt examples, tests for :build:* modules

45bab58

[driver] Fix Bmp085 compilation for gcc 8

2935306

[ci] Beautify job descriptions

f867b7f

[fault] Simplify fault.minimal and adapt examples

4ffb298

Adds these behaviors on hard fault: - reset device. - force breakpoint. - loop forever. - flash LED every 1s. Removes switching to process stack, so on stack overflow, only the first three behaviors will work.

[ext] Add adamgreen/CrashCatcher submodule

abf0fd5

[fault] Add coredump module using CrashCatcher

1ba98eb

[example] Add :platform:fault.coredump example

6a5770a

[test/all] Manually select platform modules

3770838

Selecting all modules through :platform:** is not possible anymore with the introduction of conflicting :platform:fault.* modules.

salkinium force-pushed the feature/crash_catcher branch from 002aec5 to 3770838 Compare August 5, 2018 01:12

salkinium force-pushed the develop branch from 738dfa4 to 1b36326 Compare January 23, 2019 19:35

salkinium force-pushed the develop branch 4 times, most recently from 3da698d to 01092cf Compare March 8, 2019 23:04

salkinium added stale ♾ advanced 🤯 feature 🚧 labels Apr 19, 2019

salkinium closed this May 10, 2019

salkinium deleted the feature/crash_catcher branch May 10, 2019 18:46

salkinium mentioned this pull request May 30, 2019

Add GNU Build ID to identify firmware #219

Merged

salkinium removed the stale ♾ label Aug 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds core dump support via CrashCatcher #52

Adds core dump support via CrashCatcher #52

salkinium commented Aug 2, 2018 •

edited

Loading

rleh Aug 3, 2018

salkinium Aug 3, 2018

salkinium Aug 3, 2018

rleh commented Aug 3, 2018

chris-durand Aug 3, 2018

salkinium Aug 3, 2018

chris-durand Aug 5, 2018

ekiwi May 10, 2019

salkinium May 10, 2019

salkinium commented May 10, 2019

Adds core dump support via CrashCatcher #52

Adds core dump support via CrashCatcher #52

Conversation

salkinium commented Aug 2, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rleh commented Aug 3, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

salkinium commented May 10, 2019

salkinium commented Aug 2, 2018 •

edited

Loading