diff --git a/src/cheri-pte-ext.adoc b/src/cheri-pte-ext.adoc index aa430e9e..ca1666b9 100644 --- a/src/cheri-pte-ext.adoc +++ b/src/cheri-pte-ext.adoc @@ -1,25 +1,33 @@ [#section_sv_cheri] [#cheri_pte_ext] -== "{cheri_pte_ext_name}" Extension for CHERI Page-Based Virtual-Memory Systems (RV64 only) +== Extending Page-Based Virtual-Memory Systems for CHERI (RV64 only), including "{cheri_pte_ext_name}" -CHERI is a security mechanism that is generally orthogonal to page-based -virtual-memory management as defined in cite:[riscv-priv-spec]. -However, it is helpful in CHERI harts to extend RISC-V's virtual-memory -management to facilitate capability revocation and control the flow of -capabilities in memory at the page granularity. For this reason, the -{cheri_pte_ext_name} extension adds new bits to RISC-V's Page Table Entry (PTE) -format. +NOTE: _Sv32_ (for RV32) does not have any spare PTE bits, and so no features from this chapter can be implemented. -NOTE: There is no explicit mechanism for enabling or disabling {cheri_pte_ext_name}. A VM-enabled legacy (non-CHERI) OS running in {cheri_int_mode_name} will not load or store capabilities, and so the default state of CW=0 causing loaded capabilities to have their tags cleared, and stored capabilities with their tags set to cause a page fault, won't occur. +In CHERI harts the Page Table Entry (PTE) format is extended to control the flow of capabilities in memory (see <>). +This is achieved by adding the PTE.CW bit described below and is a mandatory feature when any virtual memory translation scheme (_Sv39_, _Sv48_ or _Sv57_) is implemented on an RV64 system. +By default PTE.CW=0 which will prevent legacy OSs from being able to load or store tagged capabilities without software modification. -A CHERI-aware OS running a VM-enabled OS is strongly recommended to support {cheri_pte_ext_name}, and the minimum level of support is to set CW to 1 in all PTEs intended for storing capabilities (i.e. anonymous mappings) and leave <>.CRG and CRG in all PTEs set to 0, which will allow capabilities with their tags set to be loaded and stored successfully. +Additionally the {cheri_pte_ext_name} extension adds the ability to perform capability revocation of user mode pages (see <>) by adding the PTE.CRG bit, and <>.UCRG as described below. -Therefore when implementing any RV64 virtual memory translation scheme (_Sv39_, _Sv48_ or _Sv57_) and {cheri_base_ext_name}, implementing {cheri_pte_ext_name} is strongly recommended. +NOTE: {cheri_pte_ext_name} is strongly recommended but not mandatory as a future version of this specification may specify an improved method. -NOTE: It is possible to detect the presence of {cheri_pte_ext_name} in software, by configuring a page table entry without programming CW and without setting <>.CRG, and testing for an exception on storing a tagged capability. +NOTE: There is no explicit mechanism for enabling or disabling {cheri_pte_ext_name}. -NOTE: _Sv32_ (for RV32) does not have any spare PTE bits, and so this extension cannot be implemented. +NOTE: If software ignores the new PTE bits then there is no change in functionality unless capabilities are accessed. +NOTE: A future version of this specification may include kernel revocation which may require an <>.SCRG bit. + +The remainder of this chapter jointly specifies the behavior of PTE.CW, PTE.CRG and <>.UCRG. + +NOTE: The description below assumes that {cheri_pte_ext_name} has been implemented. + If that is _not_ the case then PTE.CRG and <>.UCRG should be taken as read-only-zero for purpose of the description in the remainder of this chapter only. + PTE.CRG and <>.UCRG remain reserved in this case. + +The minimum level of PTE support is to set CW to 1 in all PTEs intended for storing capabilities (i.e. private anonymous mappings) and leave <>.UCRG and CRG in all PTEs set to 0, which will allow capabilities with their tags set to be loaded and stored successfully. + + +[#limit_cap_prop] === Limiting Capability Propagation Page table enforcement can allow the operating system to limit the flow @@ -46,6 +54,7 @@ a natural solution. ^*^ _allocated using mmap_ +[#cap_revocation] === Capability Revocation Page table enforcement can accelerate concurrent capability revocation @@ -112,7 +121,7 @@ cleared <>. of this state. ** When a capability store or AMO instruction is executed and the tag bit of the capability being written is set, the -implementation sets the CW bit and assigns the CRG bit equal to <>.CRG. +implementation sets the CW bit and assigns the CRG bit equal to <>.UCRG. + The PTE update must be atomic with respect to other accesses to the PTE, and must atomically check @@ -135,39 +144,40 @@ When CW is set, the CRG bit indicates the current generation of the virtual memo regards to the ongoing capability revocation cycle. Two schemes are permitted: * A load page fault exception is raised when a capability load or AMO instruction is executed -with <> granted and the virtual page's CRG bit does not equal <>.CRG. +with <> granted and the virtual page's CRG bit does not equal <>.UCRG in user mode. * A load page fault exception is raised when a capability load or AMO instruction is executed -with <> granted and the virtual page's CRG bit does not equal <>.CRG +with <> granted and the virtual page's CRG bit does not equal <>.UCRG in user mode. and the capability read from memory optionally has its tag set^1^. [[pte_cw_crg_load_summary]] .Summary of Load CW and CRG behavior in the PTEs -[%autowidth,float="center",align="center",cols="<,<,<",options="header"] +[%autowidth,float="center",align="center",cols="<,<,<,<",options="header"] |=== -|PTE.CW |PTE.CRG |Load/AMO -| 0 | X | Clear loaded tag -| 1 |≠ <>.CRG | Page fault, or page fault if tag is set^1^ -| 1 |= <>.CRG | Normal operation +|PTE.CW |PTE.U|PTE.CRG |Load/AMO +| 0 | X | X | Clear loaded tag +| 1 | 1 |≠ <>.UCRG | Page fault, or page fault if tag is set^1^ +| 1 | 1 |= <>.UCRG | Normal operation +| 1 | 0 | X | Normal operation^2^ |=== +^1^ The choice here is whether to take data dependent exceptions on loads or atomic operations. + It is legal for the implementation to fault even if the tag is not set since this behavior is only an optimization for software. + This means it is also legal to only check the tag under certain conditions and conservatively fault otherwise. + Taking a trap when the tag is not set will introduce additional traps during revocation sweeps. + Checking the loaded tag affects the exception priority, see <>. + +^2^ A future version of this specification may check an SCRG bit in <> for kernel revocation. + [[pte_cw_crg_store_summary]] .Summary of Store CW and CRG behavior in the PTEs [%autowidth,float="center",align="center",cols="<,<,<",options="header"] |=== |PTE.CW |PTE.CRG |Store/AMO | 0 | 0 | Page fault if stored tag is set -| 0 | 1 | Page fault if stored tag is set, or hardware CW and CRG update^2^ +| 0 | 1 | Page fault if stored tag is set, or <> | 1 | X | Normal operation |=== -^1^ The choice here is whether to take data dependent exceptions on loads or atomic operations. - It is legal for the implementation to fault even if the tag is not set since this behavior is only an optimization for software. - This means it is also legal to only check the tag under certain conditions and conservatively fault otherwise. - Taking a trap when the tag is not set will introduce additional traps during revocation sweeps. - Checking the loaded tag affects the exception priority, see <>. - -^2^ See <>. - The exceptions added by {cheri_pte_ext_name} reuse the load page fault and store/AMO page fault exception cause values, and so the cause of the exception can be determined by software by checking the value in <>, <> etc. @@ -181,19 +191,16 @@ The decision about whether to take exceptions on capability stores with the tag These cause PTE Accessed and Dirty updates to be done in software, via the exception handler, or by a hardware mechanism respectively. * If only _Svade_ is implemented, or enabled through henvcfg.ADUE or menvcfg.ADUE, then take a page fault. -* If only _Svadu_ is implemented, or enabled through henvcfg.ADUE or menvcfg.ADUE, then do the hardware update of setting PTE.CW=1 and setting PTE.CRG=<>.CRG as described in <>. +* If only _Svadu_ is implemented, or enabled through henvcfg.ADUE or menvcfg.ADUE, then do the hardware update of setting PTE.CW=1 and setting PTE.CRG=<>.UCRG as described in <>. [#xstatus_pte] === Extending the Supervisor (sstatus) and Virtual Supervisor (vsstatus) Status Registers The <> and <> CSRs are extended to include the new Capability Read Generation (CRG) bit as shown. -When V=1 <>.CRG is in effect. - -<>.CRG also exists. Reading or writing it is equivalent to reading or writing <>.CRG. +When V=1 <>.UCRG is in effect. -NOTE: As there is no M-mode translation available in RISC-V, there is no current software use for <>.CRG. -It is _only_ included not to break the rule that <> is required to be a subset of <>. +<>.UCRG also exists. Reading or writing it is equivalent to reading or writing <>.UCRG. [#mstatusreg_pte] @@ -232,8 +239,9 @@ It is _only_ included not to break the rule that <> is r {bits: 1, name: 'WPRI'}, {bits: 1, name: 'MPELP'}, {bits: 1, name: 'MDT'}, - {bits: 19, name: 'WPRI'}, - {bits: 1, name: 'CRG'}, + {bits: 18, name: 'WPRI'}, + {bits: 1, name: 'UCRG'}, + {bits: 1, name: 'WPRI'}, {bits: 1, name: 'SD'}, ], config:{lanes: 4, hspace:1024}} .... @@ -262,8 +270,9 @@ It is _only_ included not to break the rule that <> is r {bits: 1, name: 'SDT'}, {bits: 7, name: 'WPRI'}, {bits: 2, name: 'UXL[1:0]'}, - {bits: 28, name: 'WPRI'}, - {bits: 1, name: 'CRG'}, + {bits: 27, name: 'WPRI'}, + {bits: 1, name: 'UCRG'}, + {bits: 1, name: 'WPRI'}, {bits: 1, name: 'SD'}, ], config:{lanes: 4, hspace:1024}} .... @@ -289,8 +298,9 @@ It is _only_ included not to break the rule that <> is r {bits: 1, name: 'SUM'}, {bits: 12, name: 'WPRI'}, {bits: 2, name: 'UXL[1:0]'}, - {bits: 28, name: 'WPRI'}, - {bits: 1, name: 'CRG'}, + {bits: 27, name: 'WPRI'}, + {bits: 1, name: 'UCRG'}, + {bits: 1, name: 'WPRI'}, {bits: 1, name: 'SD'} ], config:{lanes: 4, hspace:1024}} .... diff --git a/src/insns/load_exceptions.adoc b/src/insns/load_exceptions.adoc index de2b69eb..ab714c9d 100644 --- a/src/insns/load_exceptions.adoc +++ b/src/insns/load_exceptions.adoc @@ -23,10 +23,9 @@ listed below; in this case, _CHERI data fault_ is reported in the <> or |============================================================================== + -If {cheri_pte_ext_name} is implemented, and virtual memory is enabled, then the state of -<>.CW and <>.CRG from the current virtual memory page, -together with <>.CRG may cause a CHERI <> page fault exception -in addition to a normal RISC-V page fault exception. +If virtual memory is enabled, then the state of <>.CW, +and, if {cheri_pte_ext_name} is implemented, <>.CRG, <>.U and <>.UCRG, +may cause a CHERI <> page fault exception in addition to a normal RISC-V page fault exception. See <> for the exception reporting in this case. + :!load_res: diff --git a/src/insns/store_exceptions.adoc b/src/insns/store_exceptions.adoc index d3db318e..5518ffde 100644 --- a/src/insns/store_exceptions.adoc +++ b/src/insns/store_exceptions.adoc @@ -24,7 +24,8 @@ listed below; in this case, _CHERI data fault_ is reported in the <> or + If {cheri_pte_ext_name} is implemented, and virtual memory is enabled, then the state of <>.CW and <>.CRG from the current virtual memory page may -cause a CHERI <> page fault exception in addition to a normal RISC-V page fault. +cause a CHERI <> page fault exception in addition to a normal RISC-V page fault +when operating in user mode. See <> for the exception reporting in this case. + :!store_cond: