JS Self-Profiling API

Introduction

Complex web applications currently have limited visibility into where JS execution time is spent on clients. Without the ability to efficiently collect stack samples, applications are forced to instrument their code with profiling hooks that are imprecise and can significantly slow down execution. By providing an API to manipulate a sampling profiler, applications can gather rich execution data for aggregation and analysis with minimal overhead.

Examples

The following example demonstrates how a user may profile an expensive operation, gathering JS execution samples every 10ms. The trace can be sent to a server for analysis to debug outliers and JS execution characteristics in aggregate.

        const profiler = new Profiler({ sampleInterval: 10, maxBufferSize: 10000 });
        const start = performance.now();
        for (let i = 0; i < 1000000; i++) {
             doWork();
        }
        const duration = performance.now() - start;
        const trace = await profiler.stop();
        const traceJson = JSON.stringify({
          duration,
          trace,
        });
        sendTrace(traceJson);

Another common real-world scenario is profiling JS across a pageload. This example profiles the onload event, sending performance timing data along with the trace.

        const profiler = new Profiler({ sampleInterval: 10, maxBufferSize: 10000 });

        window.addEventListener('load', async () => {
          const trace = await profiler.stop();
          const traceJson = JSON.stringify({
            timing: performance.timing,
            trace,
          });
          sendTrace(traceJson);
        });

        // Rest of the page's JS initialization logic

Profiling Sessions

A profiling session is an abstract producer of samples. Each session has:

A state, which is one of {started, paused, stopped}.
A sample interval, defined as the periodicity at which the session obtains samples.
The UA is NOT REQUIRED to take samples at this rate. However, it is RECOMMENDED that sampling is prioritized to take samples at this rate to produce higher quality traces.
An agent to profile.
A realm to profile.
A time origin that samples' timestamps are measured relative to.
A sample buffer size limit.
A ProfilerTrace storing captured samples.

Multiple profiling sessions on the same page SHOULD be supported.

States

In the started state, the UA SHOULD make a best-effort to capture samples by executing the take a sample algorithm [= in parallel =] each time the sample interval has elapsed. In the paused and stopped states, the UA SHOULD NOT capture samples.

Profiling sessions MUST begin in the started state.

The UA MAY move a session from started to paused, and from paused to started.

The user agent is RECOMMENDED to pause the sampling of a profiling session if the browsing context is not in the foreground.

A stopped session MUST NOT move to the started or paused states.

Processing Model

To take a sample given a profiling session, perform the following steps:

If the length of ProfilerTrace.samples is greater than or equal to the sample buffer size limit associated with the profiling session, fire a new event of type samplebufferfull to the associated Profiler, move the state to stopped, and return.
Let sample be a new ProfilerSample.
Set the ProfilerSample.timestamp property of sample to the current high resolution time relative to the profiling session's time origin.
Let stack be the execution context stack associated with the profiling session's agent.
Set the ProfilerSample.stackId property of sample to the result of the get a stack ID algorithm on stack.
Add sample to the ProfilerTrace.samples associated with the session's ProfilerTrace.

To get a stack ID given an execution context stack bound to stack, perform the following steps:

If stack is empty, return undefined.
Let head be the top element of stack, and tail be the remainder of stack after removing its top element.
Let parentId be the result of calling get a stack ID recursively on tail.
Let frameId be the result of calling get a frame ID on head.
If frameId is undefined, return parentId.
Let profilerStack be a new ProfilerStack with ProfilerStack.frameId equal to frameId, and ProfilerStack.parentId equal to parentId.
Return the result of running get an element ID on profilerStack and ProfilerTrace.stacks.

To get a frame ID given an execution context bound to context, perform the following steps:

If the [= realm =] associated with context does not match the realm associated with the profiling session, return undefined.
Let instance be equal to the function instance associated with context.
Let scriptOrModule be equal to the ScriptOrModule associated with context.
Let |attributedScriptOrModule : ScriptOrModule| be equal to the result of running the following algorithm:
1. If |scriptOrModule| is non-null, return |scriptOrModule|.
2. If |instance| is a built-in function object, return the ScriptOrModule containing the function that invoked |instance|.
  The purpose of the above logic is to ensure that built-in functions invoked by inaccessible scripts are not exposed in traces, by using the ScriptOrModule that invoked them for attribution.
  
  "[...] the ScriptOrModule containing the function that invoked |instance|" should be defined more rigorously. We could leverage the top-most execution context on the stack that defines a ScriptOrModule to provide this, but it's not ideal -- there may (theoretically) be other mechanisms for a builtin to be enqueued on the execution context stack, in which case the attribution would be invalid.
3. Otherwise, return null.
If |attributedScriptOrModule| is null, return undefined.
Let |attributedScript : Script| be the [= script =] obtained from |attributedScriptOrModule|.[[\HostDefined]].
If |attributedScript| is a [= classic script =] and its muted errors boolean is equal to true, return undefined.
This check ensures that we avoid including stack frames from cross-origin scripts served in a CORS-cross-origin response. We may want to consider renaming muted errors to better reflect this use case.
Let frame be a new ProfilerFrame.
Set ProfilerFrame.name of frame to the function instance name associated with |instance|.
If |scriptOrModule| is non-null:
1. Let script be the script obtained from scriptOrModule.[[\HostDefined]].
2. Let resourceString be equal to the base URL of script.
3. Set ProfilerFrame.resourceId to the result of running get an element ID on resourceString and ProfilerTrace.resources.
4. Set ProfilerFrame.line of frame to the 1-based index of the line at which instance is defined in |script|.
5. Set ProfilerFrame.column of frame to the 1-based index of the column at which instance is defined in |script|.
Return the result of running get an element ID on frame and ProfilerTrace.frames.

To get an element ID for an item in a list, run the following steps:

If there exists an element in list component-wise equal to item, return its index.
Otherwise, append item to the end of list and return its index.

The Profiler Interface

      [Exposed=Window]
      interface Profiler : EventTarget {
        readonly attribute DOMHighResTimeStamp sampleInterval;
        readonly attribute boolean stopped;

        constructor(ProfilerInitOptions options);
        Promise<ProfilerTrace> stop();
      };

Each Profiler MUST be associated with exactly one profiling session.

The sampleInterval attribute MUST reflect the sample interval of the associated profiling session expressed as a DOMHighResTimeStamp.

The stopped attribute MUST be true if and only if the profiling session has state stopped.

{{Profiler}} is only exposed on {{Window}} until consensus is reached on [[Permissions-Policy]] and {{Worker}} integration.

new Profiler(options)

new Profiler(options) runs the following steps given an object options of type ProfilerInitOptions:

If options' {{ProfilerInitOptions/sampleInterval}} is less than 0, throw a RangeError.
Get the policy value for "js-profiling" in the Document. If the result is false, throw a "NotAllowedError" DOMException.
Create a new profiling session where:

The associated sample interval is set to either ProfilerInitOptions.sampleInterval OR the next lowest interval supported by the UA.
The associated time origin is equal to the time origin of the current global object.
The associated sample buffer size limit is set to {{ProfilerInitOptions/maxBufferSize}}.
The associated [= agent =] is set to the surrounding agent.
The associated [= realm =] is set to the current realm record.
The associated ProfilerTrace is set to «[{{ProfilerTrace/resources}} → «», {{ProfilerTrace/frames}} → «», {{ProfilerTrace/stacks}} → «», {{ProfilerTrace/samples}} → «»]».

Return a new Profiler associated with the newly created profiling session.

stop() method

Stops the profiler and returns a trace. This method MUST run these steps:

If the associated [= profiling session =]'s state is stopped, return [= a promise rejected with =] an "InvalidStateError" DOMException.
Set the [= profiling session =]'s state to stopped.
Let |p:Promise| be [= a new promise =].
Run the following steps [= in parallel =]:
1. Perform any [= implementation-defined =] work to stop the [= profiling session =].
2. Resolve |p| with the {{ProfilerTrace}} associated with the profiler's [= profiling session =].
Return |p|.

Any samples taken after stop() is invoked SHOULD NOT be included by the profiling session.

The ProfilerTrace Dictionary

      typedef DOMString ProfilerResource;

      dictionary ProfilerTrace {
        required sequence<ProfilerResource> resources;
        required sequence<ProfilerFrame> frames;
        required sequence<ProfilerStack> stacks;
        required sequence<ProfilerSample> samples;
      };

The resources attribute MUST return the ProfilerResource list set by the take a sample algorithm.

The frames attribute MUST return the ProfilerFrame list set by the take a sample algorithm.

The stacks attribute MUST return the ProfilerStack list set by the take a sample algorithm.

The samples attribute MUST return the ProfilerSample list set by the take a sample algorithm.

Inspired by the V8 trace event format and Gecko profile format, this representation is designed to be easily and efficiently serializable.

The ProfilerSample Dictionary

        dictionary ProfilerSample {
          required DOMHighResTimeStamp timestamp;
          unsigned long long stackId;
        };

timestamp MUST return the value it was initialized to.

stackId MUST return the value it was initialized to.

The ProfilerStack Dictionary

        dictionary ProfilerStack {
          unsigned long long parentId;
          required unsigned long long frameId;
        };

parentId MUST return the value it was initialized to.

frameId MUST return the value it was iniitalized to.

The ProfilerFrame Dictionary

        dictionary ProfilerFrame {
          required DOMString name;
          unsigned long long resourceId;
          unsigned long long line;
          unsigned long long column;
        };

name MUST return the value it was initialized to.

resourceId MUST return the value it was initialized to.

line MUST return the value it was initialized to.

column MUST return the value it was initialized to.

HTTP Method	URI Template
POST	`/session/{session id}/forcesample`

Privacy and Security

The following sections detail some of the privacy and security choices of the API, illustrating protection strategies against various types of attacks.

Cross-origin script contents

The API avoids exposing contents of cross-origin scripts by requiring all functions included via the take a sample algorithm to be defined in a script served with CORS-same-origin through the muted errors property. Browser builtins (such as performance.now()) must also only be included when invoked from [= CORS-same-origin =] script.

As a result, the API does not expose any new insight into the contents or execution characteristics of cross-origin script, beyond what is already possible through manual instrumentation. UAs are encouraged to verify this holds if they choose to support extremely low sample interval values (e.g. less than one millisecond).

Cross-origin execution

Cross-origin execution contexts should not be observable by the API through the realm check in the take a sample algorithm. Cross-origin iframes and other execution contexts that share an agent with a profiler will therefore not have their execution observable through this API.

Timing attacks

Timing attacks remain a concern for any API that could introduce a new source of high-resolution timing information. Timestamps gathered in traces should be obtained from the same source as [[?HR-Time]]'s current high resolution time to avoid exposing a new vector for side-channel attacks.

See [[?HR-Time]]'s discussion on clock resolution.