Signature-based Integrity

1. Introduction

This section is non-normative.

Subresource Integrity [SRI] defines a mechanism by which developers can ensure that script or stylesheet loaded into their pages' contexts are exactly those scripts or stylesheets the developer expected. By specifying a SHA-256 hash of a resource’s content, any malicious or accidental deviation will be blocked before being executed. This is an excellent defense, but its deployment turns out to be brittle. If the resource living at a specific URL is dynamic, then content-based integrity checks require pages and the resources they depend upon to update in lockstep. This turns out to be ~impossible in practice, which makes SRI less usable than it could be.

Particularly as the industry becomes more interested in supply-chain integrity (see Shopify’s [PCIv4-SRI-Gaps], for instance), it seems reasonable to explore alternatives to static hashes that could allow wider deployment of these checks, and therefore better understanding of the application experiences that developers are actually composing.

This document outlines the changes that would be necessary to [Fetch], and [SRI] in order to support the simplest version of a signature-based check:

Pages will embed an Ed25519 public key assertion into integrity attributes:

<script src="https://my.cdn/script.js"
        crossorigin="anonymous"
        integrity="ed25519-[base64-encoded-public-key]"
        ...></script>

Servers will deliver a signature over the resource content using the corresponding private key along with the resource as an HTTP message signature [RFC9421]:

HTTP/1.1 200 OK
Accept-Ranges: none
Vary: Accept-Encoding
Content-Type: text/javascript; charset=UTF-8
Access-Control-Allow-Origin: *
Unencoded-Digest: sha-512=:[base64-encoded digest of `console.log("Hello, world!");`]:
Signature-Input: sig1=("unencoded-digest";sf); keyid="[base64-encoded public key]"; tag="ed25519-integrity"
Signature: sig1=:[base64-encoded result of Ed25519(signature base)]:

console.log("Hello, world!");

The user agent will validate the signature using the expected public key before executing the response.

That’s it!

The goal here is to flesh out the proposal for discussion, recognizing that it might be too simple to ship. Then again, it might be just simple enough...

1.1. Signatures are not Hashes

Subresource Integrity’s existing hash-based checks ensure that specific, known _content_ executes. It doesn’t care who made the file or from which server it was retrieved: as long as the content matches the expectation, we’re good to go. This gives developers the ability to ensure that a specific set of audited scripts are the only ones that can execute in their pages, providing a strong defense against some kinds of threats.

The signature-based checks described briefly above are different. Rather than validating that a specific script or stylesheet is known-good, they instead act as a proof of _provenance_ which ensures that scripts will only execute if they’re signed with a known private key. Assuming good key-management practices (easy, right?), this gives a guarantee which is different in kind, but similarly removes the necessity to trust intermediaries.

With these properties in mind, signature-based integrity checks aim to protect against attackers who might be able to manipulate the content of resources that a site depends upon, but who cannot gain access to the signing key.

1.2. Overview and Threat Model

This proposal aims to defend against attackers who have the ability to deliver arbitrary responses to requests to a given server. For example, attackers may compromise a specific frontend server, CDN, or other third-party dependency (perhaps bypassing TLS entirely!), gaining control over the code a given origin delivers to clients.

The defenses described in the remainder of this document can be broken down into a few independent parts, layered on top of one another to achieve the goals developers are aiming for.

Server-initiated integrity checks: Servers can deliver an `Unencoded-Digest` header along with responses that contain one or more digests of the response’s content _after_ decoding any content codings (gzip, brotli, etc).

If such a header is present, user agents can enforce it by synthesizing a network error if the delivered content does not match the asserted digest. See § 2.2 Patches to Fetch below for more details.
Server-initiated signature checks: Servers can deliver HTTP Message Signature headers (`Signature` and `Signature-Input` from [RFC9421]) that allow the verification of request/response metadata. We can construct these headers in such a way that user agents can enforce them, and further ensure that the signed metadata includes the server-initiated integrity checks noted above. Enforcing signature verification, then, means ensuring that the private key’s possessor signed the specific content in question.

See the verification requirements for SRI described below for more detail about these headers' construction.
Client-initiated integrity checks: Pages need to be able to specify integrity metadata for script and link elements that can be matched against the server-initiated checks described above. The work necessary is described in § 2.1 Patches to SRI below.
CSP-driven enforcement: As described in Content Security Policy 3 § 8.4 Allowing external JavaScript via hashes, it’s possible today to safely allow JavaScript execution by specifying integrity metadata on a given element, matching that metadata against a page’s active policies, and relying upon SRI to enforce the constraints the metadata declares. The same should be possible for signatures. The work necessary is described in § 2.3 Patches to CSP below.

Implementing the mechanism in this document therefore requires:

Implementing `Unencoded-Digest` checks, at least for the subset of resource types upon which SRI can act: scripts and stylesheets.
Implementing the subset of HTTP Message Signatures required to support the headers which meet the verification requirements for SRI.
Implementing the patches against SRI necessary to support the new integrity types, described in § 2.1 Patches to SRI.

Revisiting the example above, the following things might happen to ensure that we’re only executing script correctly signed with a key we expect:

Prior to sending the request, the page’s CSP will verify the content of the relevent script element’s integrity attribute, ensuring that any public keys asserted match the page’s requirements.
The user agent receives response headers for https://my.cdn/script.js, parses the `Signature-Input` header, and uses it to verify the `Signature` header’s content, blocking the response if verification fails. This verification shows that we’ll only be dealing with responses for which we have proof that the private key’s possessor signed this response, including the integrity information.
The user agent matches the public key contained in the `Signature-Input` header with the request’s integrity metadata, blocking the response if there’s a mismatch. This ensures that we’re meeting the page’s requirements for resource inclusion.
Once the response has streamed in, we validate the integrity information contained in the `Unencoded-Digest` headers against the response body, refusing to execute any mismatched responses.
We’re done, executing probably-safe JavaScript to our heart’s content.

All of the above rests on the critical assumption that authors can generate, maintain, and use signing keys that attackers cannot obtain. This is not a small assumption (see § 4.1 Key Management and § 4.2 Key Rotation below), but it’s one we make elsewhere in the platform, and seems like a reasonable bar to ask developers to strive towards.

2. Monkey Patches

Extending SRI to support signatures will require changes to three specifications ([SRI], [Fetch], and [CSP]), along with some additional infrastructure.

2.1. Patches to SRI

At a high level, we’ll make the following changes to SRI:

We’ll define a profile of HTTP Message Signatures that meets the specific needs we have for this feature, specifying the requirements for signatures intended as proofs of integrity/provenance that can be enforced upon by clients without any pre-existing relationship to the server which delivered them. This requires locking down the components and properties of the signature itself, as well as some of the decision points available during the generation of the signature base
We’ll define the accepted algorithm values. Currently, these are left up to user agents in order to allow for future flexibility: given that the years since SRI’s introduction have left the set of accepted algorithms and their practical ordering unchanged, we should define that explicitly.
With known algorithms, we can adjust the prioritization model to return a set of the strongest content-based and signature-based algorithms specified in a given element. This would enable developers to specify both a hash and signature expectation for a resource, ensuring both that known resources load, and that they’re accepted by a trusted party.
Finally, we’ll adjust the matching algorithm to correctly handle signatures by passing the public key in to the comparison operation.

The following sections add content and adjust algorithms accordingly.

2.1.1. The `SRI` HTTP Message Signature Profile

This document defines an HTTP Message Signature profile that specifies the requirements for signatures intended as proofs of integrity/provenance that can be enforced upon by clients without any pre-existing relationship to the server which delivered them. This requires locking down the components and properties of the signature itself, as well as some of the decision points available during the generation of the signature base (Section 2.5 of [RFC9421]).

At a high-level, the constraints are simple: this profile supports only Ed25519 signatures, requires that the public key portion of the verification key material be included in the signature’s input, and specifies the ordering of the components and properties to remove potential ambiguity about the signature’s construction. The rest of this section spells out those constraints more formally as the verification requirements for SRI, following the guidelines from Section 1.4 of [RFC9421]:

Components and Parameters:: The signature’s input MUST:

Be included as part of an HTTP response.
Include the following component identifiers with their associated constraints:
- unencoded-digest, which MUST include the sf parameter and no other parameters.
Note: We’ll extend the set of allowed headers over time. The limitation to unencoded-digest is artificial, and aimed towards making a prototype of this approach as simple as possible to implement and evaluate as we decide what makes sense to ship at scale.
Include the following signature parameters with their associated constraints:
- keyid, whose value MUST be a string containing a base64 encoding of the public key portion of the signature’s verification key material.
- tag, whose value MUST be the string "ed25519-integrity".
Not include the alg signature parameter.

Note: The algorithm can be determined unambigiously from the tag, as this profile only supports Ed25519. Section 7.3.6 of RFC9421 suggests dropping the alg parameter in these cases, which is the recommendation we’re following here.

The signature’s input MAY:

Include the following derived components as part of the list of component identifiers, each of which MUST include the req parameter and no other paramters (as they are pulled from the request, not the response):
Include the @status derived component as part of the list of component identifiers, without any parameters (as it is pulled from the response, not the request).
Include the following signature parameters, with their associated constraints:
- created, an integer whose value MUST represent a time in the past, relative to the time at which the client verifies the response.
- expires, an integer whose value MUST represent a time in the future, relative to the time at which the client verifies the response.
- nonce, which is a string whose value SHOULD be generated in a fashion which guarantees uniqueness.
Include arbitrary signature parameters beyond those specifically registered. These will have no effect beyond being considered "used for this message signature", and therefore serialized as part of the signature base’s @signature-params.

Note: Supporting currently-unknown parameters can make an eventual forward-compatibility story simpler. See the extended discussion in WICG/signature-based-sri#38 for additional context.

Structured Field Types:

The unencoded-digest component references the `Unencoded-Digest` header defined in [ID.pardue-httpbis-identity-digest]. It is a Dictionary Structured Field.

Retrieving the Key Material:

The public key of the verification key material can be directly extracted from the signature input’s keyid parameter, where it’s represented as a base64 encoded string.

Signature Algorithms:

The only signature algorithm identifier allowed is "ed25519", as defined in Section 3.3.6 of RFC9421.

Determine Key/Algorithm Appropriateness:

Since the only accepted algorithm is ed25519, it is appropriate for any context in which this profile will be used.

Derivation Context

The context for derivation of message components from an HTTP message and its application context is the HTTP message itself, encompassing the response with which the signature was delivered, and the request to which it responds.

Error Reporting from Verifier to Signer

No error reporting is required.

Clients MUST represent verification failures as network errors, consistent with [Fetch]’s handling of other server-specified constraints on the usage of response data.

Security Considerations

See § 5 Security Considerations.

Other

The HTTP Message Signature must be delivered with a response.

The `Unencoded-Digest` header must be valid for SRI.

When instructed to "determine an order" while constructing the signature base, clients and servers both MUST choose the same order as the `Signature-Input` header they consume or produce, respectively.

Valid `Signature-Input` header values would therefore include:

("unencoded-digest";sf);keyid="MCowBQYDK2VwAyEAJrQLj5P/89iXES9+vFgrIy29clF9CC/oPPsw3c5D0bs=";tag="ed25519-integrity"

Note: These requirements are fairly draconian, allowing only a very small subset of the flexibility allowed by the HTTP Message Signature format. It is entirely probable that we can expand the scope of allowed signature inputs in the future, but as we’re figuring out how to do signature validation on the client it seems prudent to provide as much strict guidance as possible in order to keep the initial complexity under control.

For posterity, this set of requirements has a few helpful implications:

Specifying the tag parameter as "ed25519-integrity" is a pretty clear signal that the developer is aiming to validate the integrity and/or provenance of a given subresource, and can therefore be reasonably expected to adhere to the set of constraints and processing instructions described in this document. Developers specifying that tag can be expected to be unsurprised when resources are blocked if their signatures don’t properly validate.
Specifying the keyid parameter as a base64 encoding of the signer’s public key makes it possible for validation to be enforced whether or not the resource was requested from a page requiring integrity.
Supporting only the "ed25519" algorithm is a good place to start as the keys are small and the algorithm is broadly supported. Choosing one algorithm simplifies initial implementations, and reduces the set of choices we ask developers to make about crypto primitives.
The `Signature-Input` header is very flexible as specified, and most of the restrictions here aim to reduce its complexity as we gain implementation experience on both the client and server sides of the signature generation process. [RFC9421] leaves several important questions about the serialization of the "signature base" open to agreement between the signer and verifier: we’re locking most of those joints down here in order to ensure that we start with a simple story for both sides.

To that end, we’re supporting signatures only over the one specific header necessary to meaningfully assert something about the resource’s body. We’re explicitly specifying strict serialization of that header, and we’re requiring it to be a header, not a trailer.
In order to avoid potential disagreements between servers and clients about the serialization of a signature base for a given response, we’re specifying how both sides ought to "Determine an order for any signature parameters" through reference to the header as-delivered. Whatever order the server produces in the `Signature-Input` header is the order that the client will expect the signature base to represent.

2.1.1.1. `Unencoded-Digest` Validation for SRI

An `Unencoded-Digest` header (header) is valid for SRI if the following steps return "valid":

Let parsed be the result of parsing structured fields with input_string set to header’s value, and header_type set to "dictionary".
If parsing failed or if parsed is empty, return "invalid".
For each key → value of parsed:
If value is not a byte sequence, return "invalid".
If key is not contained within the list « "sha-256", "sha-384", "sha-512" », return "invalid".
If key is "sha-256", and value’s length is not 32, return "invalid".
If key is "sha-384", and value’s length is not 48, return "invalid".
If key is "sha-512", and value’s length is not 64, return "invalid".
Return "valid".

2.1.2. Parse `metadata`.

First, we’ll define valid signature algorithms:

The valid SRI signature algorithm token set is the ordered set « "ed25519" » (corresponding to Ed25519 [RFC8032]).
A string is a valid SRI signature algorithm token if its ASCII lowercase is contained in the valid SRI signature algorithm token set.

Then, we’ll adjust SRI’s Parse metadata. algorithm as follows:

This algorithm accepts a string, and returns a map containing one set of hash expressions whose hash functions are understood by the user agent, and one set of signature expressions which are likewise understood:

Let result be ~~the empty set~~ the ordered map «[ "hashes" → « », "signatures" → « » ]».
For each item returned by splitting metadata on spaces:
1. Let expression-and-options be the result of splitting item on U+003F (?).
2. Let algorithm-expression be expression-and-options[0].
3. Let base64-value be the empty string.
4. Let algorithm-and-value be the result of splitting algorithm-expression on U+002D (-).
5. Let algorithm be algorithm-and-value[0].
6. If algorithm-and-value[1] exists, set base64-value to algorithm-and-value[1].
7. ~~If algorithm is not a valid SRI hash algorithm token, then continue.~~
8. Let ~~metadata~~ data be the ordered map «["alg" → algorithm, "val" → base64-value]».
9. ~~Append metadata to result.~~
10. If algorithm is a valid SRI hash algorithm token, then append data to result["hashes"].
11. Otherwise, if algorithm is a valid SRI signature algorithm token, then append data to result["signatures"].
Return result.

2.1.3. Do `bytes` and `response` match `metadataList`?

Since we adjusted the result of § 2.1.2 Parse metadata. above, we need to adjust the matching algorithm to match. The core change will be processing both hashing and signature algorithms: if only one kind is present, the story will be similar to today, and multiple strong algorithms can be present, allowing multiple distinct resources. If both hashing and signature algorithms are present, both will be required to match. This is conceptually similar to the application of multiple Content Security Policies.

In order to validate signatures, we’ll need to change Fetch to pass in the relevant HTTP response header. For the moment, let’s simply pass in the entire response (response), as that makes the integration with [RFC9421] somewhat explicable.

To perform client-initiated integrity checks for a given byte sequence (bytes), request (request), and response (response), execute the following steps. They return "passed" or "failed":

Let parsedMetadata be the result of executing Parse metadata on request’s integrity metadata.
If both parsedMetadata["hashes"] and parsedMetadata["signatures"] are empty set, return "passed".
Let hash-metadata be the result of executing SRI § 3.3.3 Get the strongest metadata from set on parsedMetadata["hashes"].
Let signature-metadata be the result of executing SRI § 3.3.3 Get the strongest metadata from set on parsedMetadata["signatures"].
Let hash-match be true if hash-metadata is empty, and false otherwise.
Let signature-match be true if signature-metadata is empty, and false otherwise.
For each item in hash-metadata:
1. Let algorithm be the item["alg"].
2. Let expectedValue be the item["val"].
3. Let actualValue be the result of SRI § 3.3.1 Apply algorithm to bytes on algorithm and bytes.
4. If actualValue is a case-sensitive match for expectedValue, set hash-match to true and break.
For each item in signature-metadata:
1. Let algorithm be the item["alg"].
2. Let public key be the item["val"].
3. Let result be the result of validating an integrity signature over request and response using algorithm and public key.
4. If result is "valid", set signature-match to true and break.
Return "passed" if both hash-match and signature-match are true. Otherwise return "failed".

2.1.4. Validate a signature over `response` using `algorithm` and `public key`

The matching algorithm above calls into a new signature validation function. Let’s write that down. At core, it will execute the Ed25519 validation steps from [RFC8032] using signatures extracted from HTTP Message Signature headers defined in [RFC9421], then compare valid signatures against the expected public key.

To validate an integrity signature over a response response, string algorithm, and string public key, execute the following steps. They return valid if the signature is valid, or invalid otherwise.

Let result be the result of verifying an HTTP Message Signature as defined in Section 3.2 of [RFC9421], given response as the signature context, the verification requirements for SRI, and the following processing instructions:
1. When executing Step 1.1 of the verification algorithm referenced above, "determine which signature should be processed for this message" by evaluating all signatures whose input’s tag parameter is "ed25519-integrity".
2. When executing Step 4 of the verification algorithm, use the verification requirements for SRI described above.
3. When executing Step 5 of the verification algorithm:
  1. "Determine the verification key material" by base64 decoding the signature input’s keyid parameter.
  2. "Determine the trustworthiness of the key material" by comparing the signature input’s keyid parameter to public key. If the two do not match, fail validation for this signature.
4. Assert: When executing Step 6, the result of "Determine the algorithm" is "ed25519" due to the verification requirements for SRI applied above.
If result is failure, return "invalid".
Otherwise, return "valid".

2.2. Patches to Fetch

Support for this feature would require changes to Fetch § 4.1 Main fetch to support enforcement of server-initiated integrity checks through `Unencoded-Digest`, `Signature`, and `Signature-Input`, and to pass the right set of information into the version of SRI § 3.3.4 Do bytes match metadataList? altered by this specification in order to enable signature-based checks that require information from the request (integrity metadata on the one hand, request headers and properties for signature components on the other) and the response (integrity headers and the body).

It would also require changes to Fetch § 4.5 HTTP-network-or-cache fetch to support setting the `Accept-Signature` header on outgoing requests based on their integrity metadata.

2.2.1. Main Fetch

Fetch § 4.1 Main fetch step 22 will be updated as follows:

If request’s integrity metadata is not the empty string, or internalResponse’s header list contains `Unencoded-Digest`, then:
1. Let processBodyError be this step: run fetch response handover given fetchParams and a network error.
2. If response’s body is null, then run processBodyError and abort these steps.
3. Let processBody given bytes be these steps:
  1. Perform server-initiated integrity checks on bytes, request, and internalResponse. If the result is "failed", then run processBodyError and abort these steps.
  2. ~~If bytes do not match request’s integrity metadata~~ perform client-initiated integrity checks given request and internalResponse. If the result is "failed", then run processBodyError and abort these steps. [SRI]
  3. Set response’s body to bytes as a body.
  4. Run fetch response handover given fetchParams and response.

2.2.2. HTTP-network-or-cache Fetch

Fetch § 4.5 HTTP-network-or-cache fetch will be updated by injecting the following step between the existing step 13 and 14:

Append the Fetch metadata headers for httpRequest. [FETCH-METADATA]
Append the Accept-Signature header for httpRequest.
If httpRequest’s initiator is "prefetch", then set a structured field value given (<a http-header><code>Sec-Purpose</code></a>, the token prefetch) in httpRequest’s header list.

2.2.2.1. Append `Accept-Signature`

When a request’s integrity metadata contains signature-based assertions, user agents will attach `Accept-Signature` headers to the request to inform servers about the client’s expectations. The header’s value will match the grammar defined in [RFC9421], and contain the expected public key(s) as keyid parameters.

A request generated from the following HTML element:

<script src="https://my.cdn/script.js"
        crossorigin="anonymous"
        integrity="ed25519-JrQLj5P/89iXES9+vFgrIy29clF9CC/oPPsw3c5D0bs="></script>

would contain the following header:

Accept-Signature: sig0=("unencoded-digest";sf);keyid="JrQLj5P/89iXES9+vFgrIy29clF9CC/oPPsw3c5D0bs=";type="ed25519-integrity"

If multiple keys are acceptable (e.g. to support key rotation), the `Accept-Signature` header will contain multiple acceptable signatures. That is, the following HTML:

<script src="https://my.cdn/script.js"
        crossorigin="anonymous"
        integrity="ed25519-JrQLj5P/89iXES9+vFgrIy29clF9CC/oPPsw3c5D0bs=
                   ed25519-xDnP380zcL4rJ76rXYjeHlfMyPZEOqpJYjsjEppbuXE="></script>

would produce the following header in its request:

NOTE: '\' line wrapping per RFC 8792

Accept-Signature: sig0=("unencoded-digest";sf);keyid="JrQLj5P/89iXES9+vFgrIy29clF9CC/oPPsw3c5D0bs=";type="ed25519-integrity" \
                  sig1=("unencoded-digest";sf);keyid="xDnP380zcL4rJ76rXYjeHlfMyPZEOqpJYjsjEppbuXE=";type="ed25519-integrity"

To append the Accept-Signature header for a request (request):

If request’s header list contains `Accept-Signature`, return.

Note: Developers can set an `Accept-Signature` header for use by their own application. In this case, the user agent will not set additional `Accept-Signature` headers, and will perform a CORS-preflight request.
If request’s integrity metadata is the empty string, return.
Let parsed be the result of executing Parse metadata on request’s integrity metadata.
If parsed["signatures"] is empty, return.
Let counter be 0.
For each signature in parsed["signatures"]:
1. Let value be the concatenation of « `sig`, counter, `=("unencoded-digest";sf);keyid="`, signature["val"], `";type="ed25519-integrity"` ».
2. Append (`Accept-Signature`, value) to request’s header list.

To support this change, we also need to clean up the processing in CORS-safelisted request-header to support this header, as discussed in [Issue #21]

2.3. Patches to CSP

We’ll adjust CSP to support matching signature-based assertions in integrity metadata alongside the existing support for hashes (as described in Content Security Policy 3 § 8.4 Allowing external JavaScript via hashes).

2.3.1. Grammar

We’ll add the following ABNF rules [RFC5234] to support Ed25519 public keys, and add this new rule to the existing source-expression rule:

source-expression        = scheme-source / host-source / keyword-source
                                  / nonce-source / hash-source / public-key-source


; Public keys: 'ed25519-[digest goes here]'
public-key-source    = "'" public-key-algorithm "-" base64-value "'"
public-key-algorithm = "ed25519"

2.3.2. Matching Integrity Metadata

The Content Security Policy 3 § 6.7.2.4 Does integrity metadata match source list? algorithm must be adjusted to match the new integrity metadata structure that Parse metadata now produces.

To match integrity metadata to a source list given integrity metadata (metadata) and a source list (list), execute the following steps, which return "Matches" if each relevant item in the provided integrity metadata matches one or more source expressions in the source list, and "Does not match" otherwise:

Assert: list is not null.
Let parsed be the result of executing Parse metadata on metadata.
Let allowed hashes be the set of source expressions in source list that match the hash-source grammar.
Let allowed public keys be the set of source expressions in source list that match the public-key-source grammar.
If parsed["hashes"] and parsed["signatures"] are both empty, return "Does not match".
For each hash in parsed["hashes"]:
1. Let expected source be a source expression formed from the concatenation of «hash["alg"], "-", hash["val"]».
2. If allowed hashes does not contain expected source, return "Does not match"
Note: All hashes in the request’s integrity metadata are present in the source list, as SRI will allow a resource to execute if it matches any such hash. <script integrity="sha256-[good digest goes here] sha256-[bad digest goes here]"> would otherwise create an opportunity to bypass the relevant policy.
For each public key in parsed["signatures"]:
1. Let expected source be a source expression formed from the concatenation of «public key["alg"], "-", public key["val"]».
2. If allowed public keys does not contain expected source, return "Does not match"
Note: As above, we require all relevant keys to be allowed by the source list.
Return "Matches".

Note: Here, we verify only whether the metadata is a non-empty subset of the hash-source sources in source list. We rely on the browser’s enforcement of Subresource Integrity [SRI] to block non-matching resources upon response.

2.4. Server-Initiated Integrity Checks

Note: This set of algorithms could live either in [Fetch] or [SRI].

To perform server-initiated integrity checks given a byte sequence (bytes), a request (request), and a response (response), execute the following steps. They return "passed" or "failed" as appropriate:

Verify Identify-Digest assertions for bytes and response. If the result is "failed", return "failed".
Verify SRI Message Signature assertions for request, and response. If the result is "failed", return "failed".
Return "passed".

2.4.1. `Unencoded-Digest` Validation

To verify Unencoded-Digest assertions given a byte sequence (bytes) and a response (response), execute the following steps. They return "verified" or "failed":

Let header be the result of getting the `Unencoded-Digest` header as a "dictionary" from response’s header list.
If header is null, return "verified".
For each alg → digest of header:
1. If alg is not one of "sha-256", "sha-384", or "sha-512", then continue.
2. Let body digest be the result of executing SRI § 3.3.1 Apply algorithm to bytes on alg and bytes.
3. If body digest matches digest, continue.
4. Return "failed".
Return "verified".

Note: This algorithm requires all valid digests delivered via `Unencoded-Digest` to match the response’s decoded body. Since the server controls both the body and the headers, it seems unnecessary to allow the flexibility of allowing the asserted digests to match more than one resource (as we do in client-initiated checks, which need to support servers' content negotiation).

2.4.2. `Signature` and `Signature-Input` Enforcement

To verify SRI Message Signature assertions given a request (request), and a response (response), execute the following steps. They return "verified" or "failed":

Let inputs be the result of getting the `Signature-Input` header as a "dictionary" from response’s header list.
Let signatures be the result of getting the `Signature` header as a "dictionary" from response’s header list.
For each key → components of inputs:
1. If signatures does not contain key, continue.
2. If any of the following requirements for components are not met, continue:
  1. components is a parameterized Inner List.
  2. components size is 1.
  3. components[0] is the string "unencoded-digest".
  4. components[0] has a single parameter: sf.
3. Let params be components parameters.
4. If any of the following requirements for params are not met, continue:
  1. params does not contain alg.
  2. params contains keyid, and its value is a string which, when forgiving-base64 decoded, returns a byte sequence whose length is 32.
  3. params contains tag, and its value is the string "ed25519-integrity".
5. If params contains expires, and params["expires"] is greater than the number of seconds between the Unix epoch and the unsafe current time, return "failed".
  
  Should we try to leave some flexibility here for user agents to accept recently-expired signatures / deal with clock skew? [Issue #34]
6. Let signature-params be the result of executing the algorithm in Section 2.3 of [RFC9421] on components, treating all specified parameters as "used for this message signature".
7. Let signature base be the result of executing the algorithm in Section 2.5 of [RFC9421] on request, |response, and signature-params.
  
  If this algorithm produces an error, return "failed".
  
  Note: Errors here might represent invalid component or parameter names, missing headers, etc.
8. Let public key be the result of forgiving-base64 decoding params["keyid"].
9. Execute Ed25519’s "Verify" algorithm as defined in Section 5.1.7 of [RFC8023], to verify the signature signatures["key"] over the message signature base using public key.
10. If verification failed, return "failed".
Return "verified".

Note: This is a reformulation and simplification of the steps described in Section 3.2 of [RFC9421], making the integration with the § 2.1.1 The SRI HTTP Message Signature Profile described above explict.

Note: This algorithm requires all valid signatures delivered with the response to be verified in order to return "verified"

3. Deployment Scenarios

This section is non-normative.

Signature-based SRI is meant to be a general primitive that can be used in a wide variety of ways that we can’t possibly exhaustively document. But below we document a few different scenarios for how signature-based SRI can be used to enable new functionality for the web.

3.1. Non-versioned third-party libraries

The web is built on composability and it is quite common to include JS from third-parties (e.g. analytics scripts or tools for real user monitoring). These scripts are often non-versioned to allow third-parties to continually update and improve these libraries. Signature-based SRI makes it possible to enable integrity validation for these libraries, to ensure that the included libraries are built and served in a trustworthy manner.

3.1.1. Architectural Notes

In this deployment scenario, third-party.com/library.js would deploy signature-based SRI. third-party.com would then document that when including the library, reliant websites should specify integrity="ed25519-[base64-encoded-public-key]".

If third-party.com offers multiple different libraries for different purposes, it is recommended to use isolated keys for each library. This ensures that an attacker can’t swap in third-party.com/foo.js for third-party.com/bar.js.

3.2. Protecting first-party libraries

An alternate deployment scenario is a site using this to protect first-party resources. In many cases, hash-based SRI can work well for first-party use cases. But, some sites have deploy processes where they deploy the main-page separately from subresources, in which case it is possible for any hashes specified in the main-page to become out of date with the contents of subresources. Signature-based SRI makes it possible to enable integrity validation for these first-party resources without adding any constraints on how web apps are deployed.

4. Deployment Considerations

This section is non-normative.

4.1. Key Management

Key management is hard. This proposal doesn’t change that.

It aims instead to be very lightweight. Perhaps it errs in that direction, but the goal is to be the simplest possible mechanimsm that supports known use-cases.

A different take on this proposal could be arbitrarily complex, replicating aspects of the web PKI to chain trust, allow delegation, etc. That seems like more than we need today, and substantially more work. Perhaps something small is good enough?

4.2. Key Rotation

Since this design relies on websites pinning a specific public key in the integrity attribute, this design does not easily support key rotation. If a signing key is compromised, there is no easy way to rotate the key and ensure that reliant websites check signatures against an updated public key.

For now, we think this is probably enough. If the key is compromised, the security model falls back to the status quo web security model, meaning that the impact of a compromised key is limited. In the future if this does turn out to be a significant issue, we could also explore alternate designs that do support key rotation. One simple proposal could be adding support for the client to signal the requested public key in request headers, allowing different parties to specify different public keys. A more complex proposal could support automated key rotation.

Note: This proposal does support pinning multiple keys for a single resource, so it will be possible to support rotation in a coordinated way without requiring each entity to move in lockstep.

4.3. Key Discovery

Servers that support this feature need to include the public key used to validate a resource’s signature in the `Signature-Input` header’s keyid signature parameter. Developer who wish to enforce signature validation against a particular key can do so by requesting the relevant resource, and extracting the key from its headers and inserting it into, for example, a script’s integrity attribute.

5. Security Considerations

This section is non-normative.

5.1. Secure Contexts

SRI does not require a secure context, nor does it apply only to resources delivered via encrypted and authenticated channels. That means that it’s entirely possible to believe that SRI offers a level of protection that it simply cannot aspire to. Signatures do not change that calculus.

Thus, it remains recommended that developers rely on integrity metadata only within secure contexts. See also [SECURING-WEB].

5.2. Provenance, not Content

Signatures do not provide any assurance that the content delivered is the content a developer expected. They ensure only that the content was signed by the expected entity. This could allow resources signed by the same entity to be substituted for one another in ways that could violate developer expectations.

In some cases, developers can defend against this confusion by using hashes instead of signatures (or, as discussed above, both hashes and signatures). Servers can likewise defend against this risk by minting fresh keys for each interesting resource. This, of course, creates more key-management problems, but it might be a reasonable tradeoff.

5.3. Substitution Attacks

Signature checks described here can prevent attackers from delivering arbitrary content in response to requests, but they remain capable of using any code a developer has signed with a given key. Two categories of attack are worth considering in a bit more detail:

5.3.1. Content Replacement

In the simplest case allowed, the signature delivered with a response covers only the `Unencoded-Digest` header, and therefore the response body. Integrity checks on the response only verify consistency between the headers and the body, which creates an opportunity for an attacker to replace both with some other piece of content an author has signed.

Authors can defend themselves against this attack by ensuring that the signature covers additional metadata. In particular, including the @path derived component as part of the `Signature-Input` header will ensure that a response for /foo.js cannot be used in a response to /bar.js:

# This signature is valid only in response to a request for `/foo.js`:
Signature-Input: sig=("unencoded-digest";sf "@path";req);expires=4102444800;keyid="JrQLj5P/89iXES9+vFgrIy29clF9CC/oPPsw3c5D0bs=";tag="ed25519-integrity"
Signature: sig=:+ZKRNLfo/O18fxXir85++07TYvkgDFVm29Oz3OExGwWvxy86FOuy0HyQW9xeliv/I7Q8fckzU8Cvbp63kbHpAg==:

Other derived components can bind a response even more tightly to a request. @authority, @method, @query, @status, etc may all be useful depending on the scenario.

Note: We do not require any of these derived components to be present in a signature today, as they seem somewhat likely to complicate deployments in common scenarios. As we gain implementation and deployment experience, we might learn that these complications are easier to overcome than expected, at which point revisiting the verification requirements for SRI might well be reasonable.

5.3.2. Rollback Attacks

Even when signing additional metadata like @path to prevent § 5.3.1 Content Replacement, there’s still room for attackers to deliver an older version of a given response (perhaps one containing a vulnerability that was later patched). These rollback or downgrade attacks remain possible as long as signatures cover only content and location.

Developers can limit the efficacy of this kind of attack by specifying an expires parameter in the `Signature-Input` header when signing responses. This ensures that resources remain valid only for a finite period of time:

# This resource's signature will fail validation after Jan 1st, 2100.
# Take _that_, attackers!
Signature-Input: sig=("unencoded-digest";sf);expires=4102444800;keyid="JrQLj5P/89iXES9+vFgrIy29clF9CC/oPPsw3c5D0bs=";tag="ed25519-integrity"
Signature: sig=:2UoEGKEQnN9LULvI05/+kuPSEYsnr2oW7qCqhY1NLvqZXq5uWwpvm4xJNoxdLxVuI1/1bs0aRxnrkLeLI7+eCA==:

Expiration alone may be good enough. If we decide that additional protections would be worth building, two come to mind:

We could allow site authors to require an expires parameter in responses' `Signature-Input` on a request-by-request basis, or adjust the verification requirements for SRI to require the attribute on all signed resources.
We could allow developers to send a challenge along with the request (as an `Accept-Signature` parameter), and require that it be incorporated into the `Signature-Input`'s "nonce" parameter.

NOTE: This could take one of several forms, ranging from page-specified metadata as an option on a page’s asserted integrity metadata to a new request header that delivered a nonce the browser generates. If we end up wanting this functionality, there’s a bit of discussion to be had about the shape and spelling (see WICG/signature-based-sri issue #41).

5.3.3. Redirection

When both § 5.3.1 Content Replacement and § 5.3.2 Rollback Attacks are handled reasonably by authors, attackers' capabilities are limited. They will be unable to directly replace a response with malicious content, but would remain capable of pushing users to any other currently-signed resource through redirect responses. This could enable substitution of one script for another, putting a page into a vulnerable state in the worst case.

We can deal with this problem in a few ways:

Developers can use distinct keys for distinct categories of resource. If good.js is signed with key #1, and bad.js is signed with key #2, then integrity checks will prevent attackers from redirecting requests from the one to the other. This is effective, but relies on developers making good decisions about which keys to use where, and is somewhat fiddley from a deployment perspective. Still, it is quite effective.
We can incorporate redirect responses into the integrity checks, requiring that the entire chain be signed with the expected key(s). Redirect responses would be required to sign the @status derived component and the `Location` header. This signature would be checked against the request’s integrity metadata, and rejected with a network error if verification fails.

Adjust the profile accordingly (or create a redirect-variant of the profile, which might be simpler), along with the Fetch integration. [WICG/signature-based-sri Issue #45]

6. Privacy Considerations

This section is non-normative.

Given that the validation of a response’s signature continues to require the response to opt-into legibility via CORS, this mechanism does not seem to add any new data channels from the server to the client. The choice of private key used to sign the resource is potentially interesting, but doesn’t seem to offer any capability that isn’t possible more directly by altering the resource body or headers.

7. An End-to-End Example

The following example walks through the process a developer might go through to sign a given resource. Let’s start with the following JSON response:

HTTP/1.1 200 OK
Date: Tue, 20 Apr 2021 02:07:56 GMT
Content-Type: application/json
Content-Length: 18

{"hello": "world"}

First, the developer would deliver information to the client that would support an integrity check upon receipt. To do so, they’ll generate a digest over the response’s body:

user@host:~/path$  echo -n "{\"hello\": \"world\"}" | openssl dgst -binary -sha256 | base64
X48E9qOokqqrvdts8nOJRJN3OWDUoyWxBf7kbu9DBPE=

And send that digest along with the response via an `Unencoded-Digest` header;

HTTP/1.1 200 OKDate: Tue, 20 Apr 2021 02:07:56 GMTContent-Type: application/jsonContent-Length: 18Unencoded-Digest: sha-256=:X48E9qOokqqrvdts8nOJRJN3OWDUoyWxBf7kbu9DBPE=:{"hello": "world"}

Next, the developer will pick an Ed25519 public/private key pair to use when signing the response. Assume that they (through amazing coincidence!) generate the same key pair that’s used as an example in [RFC9421], section B.2:

user@host:~/path$  openssl genpkey -algorithm ed25519 -out /tmp/tmp_key.pem
user@host:~/path$  cat /tmp/tmp_key.pem
-----BEGIN PRIVATE KEY-----
MC4CAQAwBQYDK2VwBCIEIJ+DYvh6SEqVTm50DFtMDoQikTmiCqirVv9mWG9qfSnF
-----END PRIVATE KEY-----
user@host:~/path$  openssl pkey -in /tmp/tmp_key.pem -pubout
-----BEGIN PUBLIC KEY-----
MCowBQYDK2VwAyEAJrQLj5P/89iXES9+vFgrIy29clF9CC/oPPsw3c5D0bs=
-----END PUBLIC KEY-----

For the next step, we’ll need a base64 encoding of the public key’s raw bytes. There’s unfortunately not a trivial way to extract that from the PKCS#8-encoded PEM format above, but the following tiny Python script will do the work:

from cryptography.hazmat.primitives import serialization
import base64

with open("/tmp/tmp_key.pem", "rb") as pem:
    public_key = serialization.load_pem_private_key(
        pem.read(), password=None
    ).public_key()
    byte_string = base64.b64encode(public_key.public_bytes_raw())
    print(byte_string.decode("utf-8"))

With that encoding in hand, the developer can construct a `Signature-Input` header that specifies the `Unencoded-Digest` header as a signed component, and includes the base64-encoded public key as the keyid parameter (as discussed in [#profile]):

HTTP/1.1 200 OKDate: Tue, 20 Apr 2021 02:07:56 GMTContent-Type: application/jsonContent-Length: 18Unencoded-Digest: sha-256=:X48E9qOokqqrvdts8nOJRJN3OWDUoyWxBf7kbu9DBPE=:Signature-Input: signature=("unencoded-digest";sf);keyid="JrQLj5P/89iXES9+vFgrIy29clF9CC/oPPsw3c5D0bs=";tag="ed25519-integrity"{"hello": "world"}

Now we have everything we need to construct the signature base, following the steps described in Section 2.5 of [RFC9421], and choosing the same order as the header’s ordering each time it instructs us to "determine an order" in Section 2.3 of [RFC9421]. We’ll end up with:

"unencoded-digest";sf: sha-256=:X48E9qOokqqrvdts8nOJRJN3OWDUoyWxBf7kbu9DBPE=:
"@signature-params": ("unencoded-digest";sf);keyid="JrQLj5P/89iXES9+vFgrIy29clF9CC/oPPsw3c5D0bs=";tag="ed25519-integrity"

That’s the string we’ll sign, placing the base64-encoded signature into a `Signature` header on the response:

HTTP/1.1 200 OKDate: Tue, 20 Apr 2021 02:07:56 GMTContent-Type: application/jsonUnencoded-Digest: sha-256=:X48E9qOokqqrvdts8nOJRJN3OWDUoyWxBf7kbu9DBPE=:Content-Length: 18Signature-Input: signature=("unencoded-digest";sf);keyid="JrQLj5P/89iXES9+vFgrIy29clF9CC/oPPsw3c5D0bs=";tag="ed25519-integrity"Signature: signature=:SbCdPUyjc0IBJjFbVRWs81ucEUcFz87b37nQ63d6kDW+/JvDmET6O5cSdwlddePvlwemLdaWFuY6pQGO+hrkAg==:{"hello": "world"}

Done!

Signature-based Integrity

Abstract

Status of this document

1. Introduction

1.1. Signatures are not Hashes

1.2. Overview and Threat Model

2. Monkey Patches

2.1. Patches to SRI

2.1.1. The SRI HTTP Message Signature Profile

2.1.1.1. Unencoded-Digest Validation for SRI

2.1.2. Parse metadata.

2.1.3. Do bytes and response match metadataList?

2.1.4. Validate a signature over response using algorithm and public key

2.2. Patches to Fetch

2.2.1. Main Fetch

2.2.2. HTTP-network-or-cache Fetch

2.2.2.1. Append Accept-Signature

2.3. Patches to CSP

2.3.1. Grammar

2.3.2. Matching Integrity Metadata

2.4. Server-Initiated Integrity Checks

2.4.1. Unencoded-Digest Validation

2.4.2. Signature and Signature-Input Enforcement

3. Deployment Scenarios

3.1. Non-versioned third-party libraries

3.1.1. Architectural Notes

3.2. Protecting first-party libraries

4. Deployment Considerations

4.1. Key Management

4.2. Key Rotation

4.3. Key Discovery

5. Security Considerations

5.1. Secure Contexts

5.2. Provenance, not Content

5.3. Substitution Attacks

5.3.1. Content Replacement

5.3.2. Rollback Attacks

5.3.3. Redirection

6. Privacy Considerations

7. An End-to-End Example

Conformance

Document conventions

Index

Terms defined by this specification

Terms defined by reference

References

Normative References

Informative References

Issues Index

2.1.1. The `SRI` HTTP Message Signature Profile

2.1.1.1. `Unencoded-Digest` Validation for SRI

2.1.2. Parse `metadata`.

2.1.3. Do `bytes` and `response` match `metadataList`?

2.1.4. Validate a signature over `response` using `algorithm` and `public key`

2.2.2.1. Append `Accept-Signature`

2.4.1. `Unencoded-Digest` Validation

2.4.2. `Signature` and `Signature-Input` Enforcement