Skip to content

Commit

Permalink
⚠️ [rte] use a single, global trap handler table (#1150)
Browse files Browse the repository at this point in the history
  • Loading branch information
stnolting authored Jan 11, 2025
2 parents 9b68c61 + 547d87e commit f5bdf69
Show file tree
Hide file tree
Showing 10 changed files with 437 additions and 268 deletions.
1 change: 1 addition & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -29,6 +29,7 @@ mimpid = 0x01040312 -> Version 01.04.03.12 -> v1.4.3.12

| Date | Version | Comment | Ticket |
|:----:|:-------:|:--------|:------:|
| 11.01.2025 | 1.10.9.4 | :warning: RTE: use a single, global trap handler table that applies to _both_ cores | [#1150](https://github.com/stnolting/neorv32/pull/1150) |
| 10.01.2025 | 1.10.9.3 | split functional behavior of `fence` and `fence.i` instructions | [#1149](https://github.com/stnolting/neorv32/pull/1149) |
| 10.01.2025 | 1.10.9.2 | clean-up SMP dual-core configuration (HW and SW optimizations) | [#1146](https://github.com/stnolting/neorv32/pull/1146) |
| 09.01.2025 | 1.10.9.1 | fix side-effects of CSR read instructions | [#1145](https://github.com/stnolting/neorv32/pull/1145) |
Expand Down
27 changes: 18 additions & 9 deletions docs/datasheet/cpu_dual_core.adoc
Original file line number Diff line number Diff line change
Expand Up @@ -26,17 +26,18 @@ The following table summarizes the most important aspects when using the dual-co
|=======================
| **CPU configuration** | Both cores use the same cache, CPU and ISA configuration provided by the according top generics.
| **Debugging** | A special SMP openOCD script (`sw/openocd/openocd_neorv32.dual_core.cfg`) is required to
debug both cores at one. SMP-debugging is fully supported by the RISC-V gdb port.
debug both cores at once. SMP-debugging is fully supported by the RISC-V gdb port.
| **Clock and reset** | Both cores use the same global processor clock and reset. If <<_cpu_clock_gating>>
is enabled, the clock of each core can be individually halted by putting the core into <<_sleep_mode>>.
| **Address space** | Both cores have full access to the same physical <<_address_space>>.
| **Interrupts** | All <<_processor_interrupts>> are routed to both cores. Hence, each core has access to
all <<_neorv32_specific_fast_interrupt_requests>> (FIRQs). Additionally, the RISC-V machine-level _external
interrupt_ (via the top `mext_irq_i` port) is also send to both cores. In contrast, the RISC-V machine level
_software_ and _timer_ interrupts are core-exclusive (provided by the <<_core_local_interruptor_clint>>).
| **RTE** | The <<_neorv32_runtime_environment>> fully supports the dual-core configuration and provides
core-individual trap handler tables. However, the RTE needs to be explicitly initialized on each core
(executing `neorv32_rte_setup()`).
| **RTE** | The <<_neorv32_runtime_environment>> can be used for both cores. However, the RTE needs to be
explicitly initialized on each core (executing `neorv32_rte_setup()`). Note that the installed trap handlers
apply to both cores. The installed user-defined trap handlers can determine the core's ID to perform
core-specific trap handling.
| **Memory** | Each core has its own stack. The top of stack of core 0 is defined by the <<_linker_script>>
while the top of stack of core 1 has to be explicitly defined by core 0 (see <<_dual_core_boot>>). Both
cores share the same heap, `.data` and `.bss` sections. Hence, only core 0 setups the `.data` and `.bss`
Expand All @@ -53,6 +54,19 @@ instructions) or by using <<_atomic_memory_access>>.
|=======================


==== SMP Software Library

An SMP library provides basic functions for launching the secondary core and for performing direct
core-to-core communication:

[cols="<1,<8"]
[grid="none"]
|=======================
| neorv32_smp.c | link:https://stnolting.github.io/neorv32/sw/neorv32__smp_8c.html[Online software reference (Doxygen)]
| neorv32_smp.h | link:https://stnolting.github.io/neorv32/sw/neorv32__smp_8h.html[Online software reference (Doxygen)]
|=======================


==== Inter-Core Communication (ICC)

Both cores can communicate with each other via a direct point-to-point connection based on FIFO-like message
Expand All @@ -78,11 +92,6 @@ The ICC FIFOs do not provide any interrupt capabilities. Software is expected to
interrupt of the receiving core (provided by the <<_core_local_interruptor_clint>>) to inform it about
available messages.

.ICC Software API
[TIP]
The NEORV32 software framework provides API wrappers to abstract inter-core communication:
`sw/lib/include/noevr32_smp.h`


==== Dual-Core Boot

Expand Down
269 changes: 87 additions & 182 deletions docs/datasheet/software_rte.adoc

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion rtl/core/neorv32_package.vhd
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ package neorv32_package is

-- Architecture Constants -----------------------------------------------------------------
-- -------------------------------------------------------------------------------------------
constant hw_version_c : std_ulogic_vector(31 downto 0) := x"01100903"; -- hardware version
constant hw_version_c : std_ulogic_vector(31 downto 0) := x"01100904"; -- hardware version
constant archid_c : natural := 19; -- official RISC-V architecture ID
constant XLEN : natural := 32; -- native data path width

Expand Down
33 changes: 33 additions & 0 deletions sw/example/demo_dual_core_rte/Makefile
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
# Application makefile.
# Use this makefile to configure all relevant CPU / compiler options.

# Override the default CPU ISA
MARCH = rv32ia_zicsr_zifencei

# Override the default RISC-V GCC prefix
#RISCV_PREFIX ?= riscv-none-elf-

# Override default optimization goal
EFFORT = -Os

# Add extended debug symbols
USER_FLAGS += -ggdb -gdwarf-3

# Adjust processor IMEM size
USER_FLAGS += -Wl,--defsym,__neorv32_rom_size=16k

# Adjust processor DMEM size
USER_FLAGS += -Wl,--defsym,__neorv32_ram_size=8k

# Adjust maximum heap size
#USER_FLAGS += -Wl,--defsym,__neorv32_heap_size=3k

# Additional sources
#APP_SRC += $(wildcard ./*.c)
#APP_INC += -I .

# Set path to NEORV32 root directory
NEORV32_HOME ?= ../../..

# Include the main NEORV32 makefile
include $(NEORV32_HOME)/sw/common/common.mk
203 changes: 203 additions & 0 deletions sw/example/demo_dual_core_rte/main.c
Original file line number Diff line number Diff line change
@@ -0,0 +1,203 @@
// ================================================================================ //
// The NEORV32 RISC-V Processor - https://github.com/stnolting/neorv32 //
// Copyright (c) NEORV32 contributors. //
// Copyright (c) 2020 - 2025 Stephan Nolting. All rights reserved. //
// Licensed under the BSD-3-Clause license, see LICENSE for details. //
// SPDX-License-Identifier: BSD-3-Clause //
// ================================================================================ //

/**********************************************************************//**
* @file demo_dual_core_rte/main.c
* @brief SMP dual-core program to show how to use the RTE on two cores.
* This example runs the same code on both cores and triggers the timer
* and software interrupts to showcase dual-core trap handling using the
* NEORV32 runtime environment (RTE).
**************************************************************************/
#include <neorv32.h>
#include "spinlock.h"

/** User configuration */
#define BAUD_RATE 19200

/** Global variables */
volatile uint8_t __attribute__ ((aligned (16))) core1_stack[2048]; // stack memory for core1


/**********************************************************************//**
* Machine timer (CLINT) interrupt handler for BOTH cores.
**************************************************************************/
void trap_handler_mtmi(void) {

// find out which core is currently executing this
uint32_t core_id = neorv32_smp_whoami();

spin_lock();
neorv32_uart0_printf("[core %u] MTIMER interrupt.\n", core_id);
spin_unlock();

// compute next interrupt time
uint64_t next_irq_time = neorv32_clint_time_get(); // current system time from CLINT.MTIME
if (core_id == 0) {
next_irq_time += 1 * neorv32_sysinfo_get_clk(); // 1 second for core 0
}
else {
next_irq_time += 2 * neorv32_sysinfo_get_clk(); // 2 seconds for core 0
}

// this is automatically mapped to the current core's MTIMECMP register
neorv32_clint_mtimecmp_set(next_irq_time);

// trigger software interrupt of the other core
if (core_id == 0) {
neorv32_clint_msi_set(1); // trigger core 1
}
else {
neorv32_clint_msi_set(0); // trigger core 0
}
}


/**********************************************************************//**
* Machine software (CLINT) interrupt handler for BOTH cores.
**************************************************************************/
void trap_handler_mswi(void) {

// find out which core is currently executing this
uint32_t core_id = neorv32_smp_whoami();

spin_lock();
neorv32_uart0_printf("[core %u] Software interrupt.\n", core_id);
spin_unlock();

// clear software interrupt of current core
neorv32_clint_msi_clr(core_id);
}


/**********************************************************************//**
* Machine environment call trap handler for BOTH cores.
**************************************************************************/
void trap_handler_ecall(void) {

// find out which core is currently executing this
uint32_t core_id = neorv32_smp_whoami();

spin_lock();
neorv32_uart0_printf("[core %u] Environment call.\n", core_id);
spin_unlock();
}


/**********************************************************************//**
* "Application code" executed by BOTH cores.
*
* @return Irrelevant (but can be inspected by the debugger).
**************************************************************************/
int app_main(void) {

// (re-)setup NEORV32 runtime-environment (RTE) for the core that is executing this code
neorv32_rte_setup();


// print message; use spinlock to have exclusive access to UART0
uint32_t core_id = neorv32_smp_whoami(); // find out which core is currently executing this
spin_lock();
neorv32_uart0_printf("[core %u] Hello world! This is core %u starting 'app_main()'.\n", core_id, core_id);
spin_unlock();


// The NEORV32 Runtime Environment (RTE) provides an internal trap vector table. Each entry
// corresponds to a specific trap (exception or interrupt). Application software can install
// specific trap handler function to take care of each type of trap.

// However, there is only a single trap vector table. Hence, both cores will execute the SAME
// handler function if they encounter the same trap.

// setup machine timer interrupt for ALL cores
neorv32_clint_mtimecmp_set(0); // initialize core-specific MTIMECMP
neorv32_rte_handler_install(RTE_TRAP_MTI, trap_handler_mtmi); // install trap handler
neorv32_cpu_csr_set(CSR_MIE, 1 << CSR_MIE_MTIE); // enable interrupt source

// setup machine software interrupt for ALL cores
neorv32_rte_handler_install(RTE_TRAP_MSI, trap_handler_mswi); // install trap handler
neorv32_cpu_csr_set(CSR_MIE, 1 << CSR_MIE_MSIE); // enable interrupt source

// setup machine environment call trap for ALL cores
neorv32_rte_handler_install(RTE_TRAP_MENV_CALL, trap_handler_ecall); // install trap handler


// trigger environment call exception (just to test the according handler)
asm volatile ("ecall");

// enable machine-level interrupts and wait in sleep mode
neorv32_cpu_csr_set(CSR_MSTATUS, 1 << CSR_MSTATUS_MIE);
while (1) {
neorv32_cpu_sleep();
}

return 0;
}


/**********************************************************************//**
* Main function for core 0 (primary core).
*
* @warning This program requires the dual-core configuration, the CLINT, UART0
* and the A/Zaamo ISA extension.
*
* @return Irrelevant (but can be inspected by the debugger).
**************************************************************************/
int main(void) {

// setup NEORV32 runtime-environment (RTE) for _this_ core (core 0)
// this is not required but keeps us safe
neorv32_rte_setup();


// setup UART0 at default baud rate, no interrupts
if (neorv32_uart0_available() == 0) { // UART0 available?
return -1;
}
neorv32_uart0_setup(BAUD_RATE, 0);
neorv32_uart0_printf("\n<< NEORV32 SMP Dual-Core RTE Demo >>\n\n");


// check hardware/software configuration
if (neorv32_sysinfo_get_numcores() < 2) { // two cores available?
neorv32_uart0_printf("[ERROR] dual-core option not enabled!\n");
return -1;
}
if (neorv32_clint_available() == 0) { // CLINT available?
neorv32_uart0_printf("[ERROR] CLINT module not available!\n");
return -1;
}
if ((neorv32_cpu_csr_read(CSR_MXISA) & (1<<CSR_MXISA_ZAAMO)) == 0) { // atomic memory operations available?
neorv32_uart0_printf("[ERROR] 'A'/'Zaamo' ISA extension not available!\n");
return -1;
}
#ifndef __riscv_atomic
#warning "Application has to be compiled with RISC-V 'A' ISA extension!"
neorv32_uart0_printf("[ERROR] Application has to be compiled with 'A' ISA extension!\n");
return -1;
#endif


// initialize _global_ system timer (CLINT's machine timer)
neorv32_clint_time_set(0);


// start core 1
neorv32_uart0_printf("Launching core 1...\n");
int smp_launch_rc = neorv32_smp_launch(app_main, (uint8_t*)core1_stack, sizeof(core1_stack));
if (smp_launch_rc) { // check if launching was successful
neorv32_uart0_printf("[ERROR] Launching core 1 failed (%d)!\n", smp_launch_rc);
return -1;
}


// start the "application code" that is executed by both cores
app_main();


return 0;
}
31 changes: 31 additions & 0 deletions sw/example/demo_dual_core_rte/spinlock.c
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
/**
* @file spinlock.c
* @brief Single simple spinlock based on atomic memory operations.
*/
#include <neorv32.h>

/**********************************************************************//**
* Private spinlock locked variable.
**************************************************************************/
static volatile uint32_t __spin_locked = 0;


/**********************************************************************//**
* Spinlock: set lock.
*
* @warning This function is blocking until the lock is acquired and set.
**************************************************************************/
void spin_lock(void) {

while(__sync_lock_test_and_set(&__spin_locked, -1)); // -> amoswap.w
}


/**********************************************************************//**
* Spinlock: remove lock.
**************************************************************************/
void spin_unlock(void) {

//__sync_lock_release(&__spin_locked); // uses fence that is not required here
__sync_lock_test_and_set(&__spin_locked, 0); // -> amoswap.w
}
12 changes: 12 additions & 0 deletions sw/example/demo_dual_core_rte/spinlock.h
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
/**
* @file spinlock.h
* @brief Single simple spin-lock based on atomic memory operations.
*/

#ifndef spinlock_h
#define spinlock_h

void spin_lock(void);
void spin_unlock(void);

#endif // spinlock_h
1 change: 0 additions & 1 deletion sw/lib/include/neorv32_rte.h
Original file line number Diff line number Diff line change
Expand Up @@ -65,7 +65,6 @@ enum NEORV32_RTE_TRAP_enum {
void neorv32_rte_setup(void);
void neorv32_rte_core(void);
int neorv32_rte_handler_install(int id, void (*handler)(void));
int neorv32_rte_handler_uninstall(int id);
void neorv32_rte_debug_handler(void);
uint32_t neorv32_rte_context_get(int x);
void neorv32_rte_context_put(int x, uint32_t data);
Expand Down
Loading

0 comments on commit f5bdf69

Please sign in to comment.