[instr logging] Log the faulting instruction before fault side-effects #204

arichardson · 2022-08-11T14:57:22Z

Previously, we would not log any exception output if we get stuck in an
infinite loop due to an ifetch fault after an exception because the
log_instr_commit only happens when we execute an instruction.

To work around this, we can call qemu_log_instr_commit() before handling
the exception/interrupt and inject a fake instruction to the log buffer
(with size==0 and address -1). A nice side-effect of this change is that
the faulting exception is now decoded according to the mode that the CPU
was in before installing the trap vector. Previously a trap from capmode
to a non-capmode handler would disassemble a capability-relative load
as the integer one since the exception handler PCC was already installed
when the logging performs the disassembly.

Instead of no output, we now print an infinite stream of the following
if we get a trap while trying to jump to the exception handler:

[0:0] 0x0000000080000078:  0002c023          csc             cnull,0(ct0)
[0:0] logging exception side effects
-> Switch to Machine mode
-> Exception https://github.com/CTSRD-CHERI/qemu/issues/6 vector 0x0000000000000000 fault-addr 0x0000000080065aa8
    Write mstatus = 0000000a00001800
    Write mcause = 0000000000000006
    Write MEPCC|v:1 s:0 p:00078fff f:1 b:0000000000000000 l:ffffffffffffffff
             |o:0000000080000078 t:000000000003ffff
    Write mbadaddr = 0000000080065aa8
    Write mtval2 = 0000000000000000
    Write PCC|v:1 s:0 p:00078fff f:0 b:0000000000000000 l:ffffffffffffffff
             |o:0000000000000000 t:000000000003ffff
[0:0] logging exception side effects
-> Switch to Machine mode
-> Exception https://github.com/CTSRD-CHERI/qemu/pull/1 vector 0x0000000000000000 fault-addr 0x0000000000000000
    Write mstatus = 0000000a00001800
    Write mcause = 0000000000000001
    Write MEPCC|v:1 s:0 p:00078fff f:0 b:0000000000000000 l:ffffffffffffffff
             |o:0000000000000000 t:000000000003ffff
    Write mbadaddr = 0000000000000000
    Write mtval2 = 0000000000000000
    Write PCC|v:1 s:0 p:00078fff f:0 b:0000000000000000 l:ffffffffffffffff
             |o:0000000000000000 t:000000000003ffff
[0:0] logging exception side effects
-> Switch to Machine mode
-> Exception https://github.com/CTSRD-CHERI/qemu/pull/1 vector 0x0000000000000000 fault-addr 0x0000000000000000
    Write mstatus = 0000000a00001800
    Write mcause = 0000000000000001
    Write MEPCC|v:1 s:0 p:00078fff f:0 b:0000000000000000 l:ffffffffffffffff
             |o:0000000000000000 t:000000000003ffff
    Write mbadaddr = 0000000000000000
    Write mtval2 = 0000000000000000
    Write PCC|v:1 s:0 p:00078fff f:0 b:0000000000000000 l:ffffffffffffffff
             |o:0000000000000000 t:000000000003ffff

When no MMU is used and the guest code attempts to fetch an instruction from an invalid memory location, the exception index defaults to a data load access fault, rather an instruction access fault. Signed-off-by: Emmanuel Blot <[email protected]> Reviewed-by: Alistair Francis <[email protected]> Message-id: [email protected] Signed-off-by: Alistair Francis <[email protected]> (cherry picked from commit f9e580c)

Previously, we would not log any exception output if we get stuck in an infinite loop due to an ifetch fault after an exception because the log_instr_commit only happens when we execute an instruction. To work around this, we can call qemu_log_instr_commit() before handling the exception/interrupt and inject a fake instruction to the log buffer (with size==0 and address -1). A nice side-effect of this change is that the faulting exception is now decoded according to the mode that the CPU was in before installing the trap vector. Previously a trap from capmode to a non-capmode handler would disassemble a capability-relative load as the integer one since the exception handler PCC was already installed when the logging performs the disassembly. Instead of no output, we now print an infinite stream of the following if we get a trap while trying to jump to the exception handler: ``` [0:0] 0x0000000080000078: 0002c023 csc cnull,0(ct0) [0:0] logging exception side effects -> Switch to Machine mode -> Exception CTSRD-CHERI#6 vector 0x0000000000000000 fault-addr 0x0000000080065aa8 Write mstatus = 0000000a00001800 Write mcause = 0000000000000006 Write MEPCC|v:1 s:0 p:00078fff f:1 b:0000000000000000 l:ffffffffffffffff |o:0000000080000078 t:000000000003ffff Write mbadaddr = 0000000080065aa8 Write mtval2 = 0000000000000000 Write PCC|v:1 s:0 p:00078fff f:0 b:0000000000000000 l:ffffffffffffffff |o:0000000000000000 t:000000000003ffff [0:0] logging exception side effects -> Switch to Machine mode -> Exception CTSRD-CHERI#1 vector 0x0000000000000000 fault-addr 0x0000000000000000 Write mstatus = 0000000a00001800 Write mcause = 0000000000000001 Write MEPCC|v:1 s:0 p:00078fff f:0 b:0000000000000000 l:ffffffffffffffff |o:0000000000000000 t:000000000003ffff Write mbadaddr = 0000000000000000 Write mtval2 = 0000000000000000 Write PCC|v:1 s:0 p:00078fff f:0 b:0000000000000000 l:ffffffffffffffff |o:0000000000000000 t:000000000003ffff [0:0] logging exception side effects -> Switch to Machine mode -> Exception CTSRD-CHERI#1 vector 0x0000000000000000 fault-addr 0x0000000000000000 Write mstatus = 0000000a00001800 Write mcause = 0000000000000001 Write MEPCC|v:1 s:0 p:00078fff f:0 b:0000000000000000 l:ffffffffffffffff |o:0000000000000000 t:000000000003ffff Write mbadaddr = 0000000000000000 Write mtval2 = 0000000000000000 Write PCC|v:1 s:0 p:00078fff f:0 b:0000000000000000 l:ffffffffffffffff |o:0000000000000000 t:000000000003ffff ```

Right now we will just print the following useless information: Trap (fault_fetch) was probably caused by: 0x0000000000000000: Address 0x0 is out of bounds.

jrtc27 · 2022-08-11T15:02:17Z

From memory this is all rather hairy and it’s easy to end up with duplicate (or a second garbage) instruction if not careful. How extensively have you tested this in edge cases?

arichardson · 2022-08-11T17:28:12Z

From memory this is all rather hairy and it’s easy to end up with duplicate (or a second garbage) instruction if not careful. How extensively have you tested this in edge cases?

It's been a while (just found those commits in a worktree), but I think it's probably better to have duplicate output in these weird infinite loop corner cases rather than no output at all?

arichardson · 2022-08-11T17:29:54Z

From memory this is all rather hairy and it’s easy to end up with duplicate (or a second garbage) instruction if not careful. How extensively have you tested this in edge cases?

It's been a while (just found those commits in a worktree), but I think it's probably better to have duplicate output in these weird infinite loop corner cases rather than no output at all?

The case I tested was mostly trap where the trap handler cannot be invoked due to an invalid mtcc and the fault on ifetch case.

jrtc27 · 2022-08-11T17:32:23Z

If they're old commits are you sure they're still needed after my recent change? I would expect d6653f7 to handle that correctly...

jrtc27 · 2022-08-11T17:32:39Z

(Note that only dev has that, not qemu-cheri)

arichardson · 2022-08-11T22:30:06Z

Those do indeed look related, but I think it won't fix the ifetch MMU translate fault case?
I'm also pretty sure that adding the fake instruction was rather helpful to avoid duplicates.

Will re-test and see if this is still necessary.

jrtc27 · 2022-08-11T22:31:54Z

The MMU translate comes after the PCC checks, which themselves come after the commit I added (well hoisted)

arichardson · 2022-08-11T23:04:23Z

The MMU translate comes after the PCC checks, which themselves come after the commit I added (well hoisted)

I meant the case where the exception handler target is an invalid address (but within cap bounds), so you get an error from the translator. But this has been more than 6 months so I could be misremembering.

jrtc27 · 2022-08-11T23:06:17Z

The MMU translate comes after the PCC checks, which themselves come after the commit I added (well hoisted)

I meant the case where the exception handler target is an invalid address (but within cap bounds), so you get an error from the translator. But this has been more than 6 months so I could be misremembering.

That should be fine, it still gets a new TB and starts translating, just the insn_start generates code to fault, IIRC

qwattash · 2022-08-17T09:13:30Z

I'm not entirely sure about the splitting the side effects into a separate "fake" instruction. Can't we just run the exception handler and then commit() afterwards? I also have some experimental tracing backends that assume that there are no fake instructions. I can fix them up if it is a strong requirement though, as there are currently no downstream consumers that inspect exceptions anyway. Perhaps another option could be to add the exception side-effects into the event list I have associated with trace entries in the experimental tracing branch.

sifive-eblot and others added 3 commits August 11, 2022 14:54

[RISC-V] Don't attempt to log an instruction on ifetch faults

524ff96

Right now we will just print the following useless information: Trap (fault_fetch) was probably caused by: 0x0000000000000000: Address 0x0 is out of bounds.

arichardson requested review from jrtc27 and qwattash August 11, 2022 14:57

arichardson marked this pull request as draft August 11, 2022 22:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[instr logging] Log the faulting instruction before fault side-effects #204

[instr logging] Log the faulting instruction before fault side-effects #204

arichardson commented Aug 11, 2022

jrtc27 commented Aug 11, 2022

arichardson commented Aug 11, 2022

arichardson commented Aug 11, 2022 •

edited

Loading

jrtc27 commented Aug 11, 2022

jrtc27 commented Aug 11, 2022

arichardson commented Aug 11, 2022

jrtc27 commented Aug 11, 2022

arichardson commented Aug 11, 2022

jrtc27 commented Aug 11, 2022

qwattash commented Aug 17, 2022

[instr logging] Log the faulting instruction before fault side-effects #204

Are you sure you want to change the base?

[instr logging] Log the faulting instruction before fault side-effects #204

Conversation

arichardson commented Aug 11, 2022

jrtc27 commented Aug 11, 2022

arichardson commented Aug 11, 2022

arichardson commented Aug 11, 2022 • edited Loading

jrtc27 commented Aug 11, 2022

jrtc27 commented Aug 11, 2022

arichardson commented Aug 11, 2022

jrtc27 commented Aug 11, 2022

arichardson commented Aug 11, 2022

jrtc27 commented Aug 11, 2022

qwattash commented Aug 17, 2022

arichardson commented Aug 11, 2022 •

edited

Loading