Interrupts on RISC-V #847

thejpster · 2024-09-01T15:51:43Z

) Adds an Xh3irq controller driver
) Adds a default MachineExternal handler which uses the Xh3irq to run whichever interrupts are pending
) Adds a bunch of interrupt control stuff to hal::arch
) Fixes the powman_test example to use the new interrupt APIs - it now works on RISC-V and Arm.

Tested powman_test on a Pico 2 in both riscv32imac-unknown-none-elf and thumbv8m.main-none-eabihf.

rp235x-hal-examples/src/bin/i2c_async.rs

rp235x-hal/src/xh3irq.rs

thejpster · 2024-09-02T16:46:57Z

We might want to block this until we've written a new interrupt macro that works like the cortex-m-rt macro and does the static mut transform and type-checks the function signature.

rp235x-hal-examples/Cargo.toml

jannic · 2024-09-13T19:27:33Z

rp235x-hal/src/arch.rs

+    #[no_mangle]
+    #[allow(non_snake_case)]
+    unsafe fn DefaultIrqHandler() {
+        panic!();


cortex_m does not panic but loop {} in the default handler.
It did panic for a short while in the past, but that was reverted here: https://github.com/rust-embedded/cortex-m-rt/pull/289/files#diff-b1a35a68f14e696205874893c07fd24fdb88882b47c23cc0e0c80a30c7d53759L993

I'm not sure if the reasons for that decision also apply for RISC-V, but at least for the rp2040/rp235x case it would be nicer if both implementations did the same.

I'll fix this

Changed it to spin

jannic · 2024-09-13T20:09:20Z

We might want to block this until we've written a new interrupt macro that works like the cortex-m-rt macro and does the static mut transform and type-checks the function signature.

As the rp235x is a multi-core processor, and given that rust-embedded/cortex-m#411 indicates that the transformation made by cortex-m-rt is unsound on multi-core systems, I don't think we should just copy it. Is there a good way to make the transform safe on rp235x? Taking a full critical section on each interrupt entry is probably too expensive?

rp235x-hal-examples/src/bin/gpio_irq_example.rs

thejpster · 2024-09-13T21:30:35Z

So I guess the risk is that both CPUs enter the same interrupt handler at the same time. If one peripheral interrupt signal is unmasked on both CPUs, that could happen. So perhaps the fix is to design the interrupt API to make that impossible (like a bitmask in a static atomic or the watchdog scratch that records which cpu has which interrupt enabled). Or we could ensure that if you enter an interrupt on Core A, it is masked for the duration on Core B - which is nicer than turning all interrupts off or holding a spin lock.

But given 2040 has the same problem the question is whether we want to fix it here or make a plan and fix it later.

9names · 2024-09-14T00:47:00Z

The user already unmasks the interrupt, and that operation is unsafe. It is almost certainly a bug if they do so on both cores, and we have had no reports of people doing this accidentally.

Until someone comes has a use-case where having both cores handle an interrupt is beneficial, I think trying to solve this is not a good use of our time.

My opinion is we should document "don't do this" and move on with our lives

jannic · 2024-09-14T07:35:46Z

But given 2040 has the same problem the question is whether we want to fix it here or make a plan and fix it later.

That's an option, I just wouldn't want to spend time on porting the macro only to remove it again later.

Until someone comes has a use-case where having both cores handle an interrupt is beneficial,

Are there situations where it's not avoidable? Any non maskable interrupts or exceptions?

jannic · 2024-09-14T08:56:34Z

But to be honest, there's a second thing I dislike about the interrupt macro, so I'm biased.

For me, it's totally non-obvious that the macro annotation changes the code inside the function.
So if I see this:

#[interrupt]
fn SOME_INTERRUPT() {
  static mut SOMETHING: Option<Whatever> = None;
  [...]
  *SOMETHING = foo(); 
}

I always feel like the usage of the variable doesn't match the declaration. This was very confusing when I was new to embedded rust, and it still sometimes surprises me.

Therefore I'd prefer something more obvious. Perhaps a macro directly on the static mut?

#[interrupt]
fn SOME_INTERRUPT() {
  let something: &mut Option<Whatever> = interrupt_singleton!( didn't think about what comes here yet :Option<Whatever> );
  [...]
  *something = foo(); 
}

Probably needs an unsafe declaration as well, as using this macro outside an interrupt (or in a re-entrant interrupt) would be unsound. So it's far from a finished proposal, of course. This is only meant as an explanation why I don't like the current interrupt macro.

Back to topic: Personally I'd merge the PR as it is, and not wait for an #[interrupt] macro implementation for RISC-V. The current situation is not perfect, but it's good enough, and it's not clear yet how an improvement would look like.

(That said, if someone ported #[interrupt] I wouldn't oppose that either. While I still don't like it, I do value consistency.)

jannic · 2024-09-14T09:52:52Z

So I guess the risk is that both CPUs enter the same interrupt handler at the same time. If one peripheral interrupt signal is unmasked on both CPUs, that could happen.

BTW, do I understand xh3irq correctly that it already solves this issue? xh3irq::get_next_interrupt() should only return a given interrupt to a single CPU core, even if the interrupt is enabled on both, right?

jannic · 2024-09-14T16:02:57Z

For the rp2040, we could use separate vector tables for core0 and core1, and let the macro create two distinct functions with their own statics, so even if an interrupt was actually running on both cores at the same time, it would not access the same data.
While this would be sound, I don't know if it would be useful: At least the common pattern of initializing the static on the first interrupt would break:

  #[interrupt]
  fn IO_IRQ_BANK0() {
      // The `#[interrupt]` attribute covertly converts this to `&'static mut Option<LedAndButton>`
      static mut LED_AND_BUTTON: Option<LedAndButton> = None;

      // This is one-time lazy initialisation. We steal the variables given to us
      // via `GLOBAL_PINS`.
      if LED_AND_BUTTON.is_none() {
          critical_section::with(|cs| {
              *LED_AND_BUTTON = GLOBAL_PINS.borrow(cs).take();
          });
      }
      [...]

The second core to run this interrupt would always get a None value, and therefore wouldn't be able to do any useful work.

thejpster · 2024-09-14T19:01:42Z

So I guess the risk is that both CPUs enter the same interrupt handler at the same time. If one peripheral interrupt signal is unmasked on both CPUs, that could happen.

BTW, do I understand xh3irq correctly that it already solves this issue? xh3irq::get_next_interrupt() should only return a given interrupt to a single CPU core, even if the interrupt is enabled on both, right?

I think the UART interrupt is cleared by reading the FIFO which can happen before the ISR ends. So it could re-fire before the ISR ends. Not sure either Interrupt Controller can avoid that.

thejpster · 2024-11-02T13:45:47Z

What do we want to do with this? There's been some changes over in riscv-rt since.

winksaville · 2024-11-04T21:01:57Z

What do we want to do with this? There's been some changes over in riscv-rt since.

I'd like to see interrupts working on risc-v and willing to work on this if you'd like. That
said, I'd need some initial guidance as I'm a noob so it will definitely take me some time.

1) Use the Machine Timer, not the cycle counter 2) Use the hal::arch module to control interrupts 3) Write a MachineExternal interrupt handler which asks Xh3irq which interrupt to run next, and then runs it. Note: we lose the macros that do that static-mut hack for us. Maybe we can add that back in later.

We don't mention NVIC in the examples, because in RISC-V mode the interrupt controller is the Xh3irq. Unless its the vector-table example, which only works on Arm currently.

I switched to the ADC FIFO IRQ so that the index and the bit number would be different.

jonathanpallant · 2024-11-13T16:44:30Z

I rebased and took out any use of the unsound static-mut conversion. Now the interrupts just run in a critical section. Not yet tested on hardware.

Also it doesn't need to be unsafe.

thejpster force-pushed the add-xh3irq-driver branch from 4c95771 to 03e56ec Compare September 1, 2024 16:28

thejpster mentioned this pull request Sep 2, 2024

Support RP2350 romancardenas/riscv-slic#9

Open

9names reviewed Sep 2, 2024

View reviewed changes

rp235x-hal-examples/src/bin/i2c_async.rs Outdated Show resolved Hide resolved

rp235x-hal/src/xh3irq.rs Outdated Show resolved Hide resolved

jannic reviewed Sep 9, 2024

View reviewed changes

rp235x-hal-examples/Cargo.toml Outdated Show resolved Hide resolved

jannic reviewed Sep 13, 2024

View reviewed changes

rp235x-hal-examples/src/bin/gpio_irq_example.rs Outdated Show resolved Hide resolved

thejpster added 5 commits November 13, 2024 16:11

Adds a driver for the Xh3irq interrupt controller

521417a

Add a MachineTimer driver.

7101a62

Clean up the examples.

a5c43d1

We don't mention NVIC in the examples, because in RISC-V mode the interrupt controller is the Xh3irq. Unless its the vector-table example, which only works on Arm currently.

Clarify Xh3irq testing

e2b3113

I switched to the ADC FIFO IRQ so that the index and the bit number would be different.

jonathanpallant force-pushed the add-xh3irq-driver branch from 4fd6e7d to e2b3113 Compare November 13, 2024 16:42

Change DefaultIrqHandler to spin, not panic.

0b81e79

Also it doesn't need to be unsafe.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interrupts on RISC-V #847

Interrupts on RISC-V #847

thejpster commented Sep 1, 2024

thejpster commented Sep 2, 2024

jannic Sep 13, 2024

thejpster Nov 13, 2024

jonathanpallant Nov 13, 2024

jannic commented Sep 13, 2024 •

edited

Loading

thejpster commented Sep 13, 2024

9names commented Sep 14, 2024 •

edited

Loading

jannic commented Sep 14, 2024

jannic commented Sep 14, 2024 •

edited

Loading

jannic commented Sep 14, 2024

jannic commented Sep 14, 2024

thejpster commented Sep 14, 2024

thejpster commented Nov 2, 2024

winksaville commented Nov 4, 2024

jonathanpallant commented Nov 13, 2024

Interrupts on RISC-V #847

Are you sure you want to change the base?

Interrupts on RISC-V #847

Conversation

thejpster commented Sep 1, 2024

thejpster commented Sep 2, 2024

jannic Sep 13, 2024

Choose a reason for hiding this comment

thejpster Nov 13, 2024

Choose a reason for hiding this comment

jonathanpallant Nov 13, 2024

Choose a reason for hiding this comment

jannic commented Sep 13, 2024 • edited Loading

thejpster commented Sep 13, 2024

9names commented Sep 14, 2024 • edited Loading

jannic commented Sep 14, 2024

jannic commented Sep 14, 2024 • edited Loading

jannic commented Sep 14, 2024

jannic commented Sep 14, 2024

thejpster commented Sep 14, 2024

thejpster commented Nov 2, 2024

winksaville commented Nov 4, 2024

jonathanpallant commented Nov 13, 2024

jannic commented Sep 13, 2024 •

edited

Loading

9names commented Sep 14, 2024 •

edited

Loading

jannic commented Sep 14, 2024 •

edited

Loading