Use abs-capture-time header extension instead of RTCP SR for calculating captureTime on remote sources when available #86

murillo128 · 2024-02-19T15:09:51Z

Current approach to use the RTCP SR synchronization timestamp for the captureTime has two flaws:

It doesnt' work on uniderctional streams (it requires implementing rtrr)
Does not work on SFU scenarios as you will get the timestamp from the SFU, but not the timestamp of the originating source.

In webrtc we already have a working solution which will allow to support it on both cases:

https://w3c.github.io/webrtc-extensions/#dom-rtcrtpcontributingsource-capturetimestamp

Should we include that if the abs-capture-time header extension is available we should use it instead of the RTCP SR value?

tguilbert-google · 2024-02-20T21:30:12Z

@drkron, any thoughts? I don't have much WebRTC experience.

murillo128 · 2024-02-20T21:52:04Z

pinging the usual suspects @fippo @henbos @alvestrand @jan-ivar @aboba

aboba · 2024-02-20T22:52:41Z

In practice, captureTime is only available via RVFC for locally captured frames. Are you looking to obtain it on the remote peer as well?

murillo128 · 2024-02-20T23:06:33Z

captureTime is already supported for remote webrtc sources

captureTime, of type DOMHighResTimeStamp
For video frames coming from a local source, 
this is the time at which the frame was captured by 
the camera. For video frames coming from remote 
source, the capture time is based on the RTP 
timestamp of the frame and estimated using clock 
synchronization. This is best effort and can use 
methods like using RTCP SR as specified in RFC 
3550 Section 6.4.1, or by other alternative means if 
use by RTCP SR isn’t feasible.

However relying on RTCP RR or rtrr doesn't provide insightfull information on an SFU scenario. Using the abs-capture-time value would be thr best in this case.

Arctunix · 2024-03-05T12:28:57Z

When it comes to the "remote" part of captureTime, the current definition of it is very difficult to utilize in practice:

a) RFC 3550 Section 6.4.1 provides the sender with RTT estimations but what we need is RTT estimations at the receiver. This means that the receiver must either also send its own RTCP Sender Report or the receiver must send an RTCP Extended Report with a Receiver Reference Time Report Block and getting a DLRR Report Block back (see RFC 3611).

Note that even if the receiver does send its own SR, it may still not be sufficient. WebRTC is (if I remember correctly) implemented to always put the Delay Since Last SR response into a separate RTCP Receiver Report even if the receiver is sending media. This leads us to the awkward situation where the receiver has to "cheat" and use RTT estimations and NTP timestamps from a completely different set of RTCP reports (i.e. from completely different SSRCs) than the ones involved with each video frame in VideoFrameCallbackMetadata.

b) As @murillo128 mentioned above, RFC 3550 Section 6.4.1 and its derivatives are unable to "look beyond" RTCP-terminating mixers.

I believe that it would be more useful to redefine captureTime so that it's always based on timestamps from capture system's reference clock rather than having to be re-synced to the "local" system's reference clock. This would leave things as-is for the "local" case while allowing abs-capture-time (and possibly "timestamps baked into video frame headers") to be used for the "remote" case.

For example, changing the text from:

For video frames coming from a local source, this is the time at which the frame was captured by the camera. For video frames coming from remote source, the capture time is based on the RTP timestamp of the frame and estimated using clock synchronization. This is best effort and can use methods like using RTCP SR as specified in RFC 3550 Section 6.4.1, or by other alternative means if use by RTCP SR isn't feasible.

To say something along the lines of:

For video frames coming from a local source, this is the time at which the frame was captured by the camera. For video frames coming from a remote source, this is timestamp set by the system that originally captured the frame and with its reference clock being the capture system's NTP clock (same clock used to generate NTP timestamps for RTCP sender reports on that system).

In an ideal world, VideoFrameCallbackMetadata would have a full set of properties for the "remote" case:

Capture timestamp from the original capture system's reference clock. This is what's proposed here.
Estimated clock offset between the original capture system's reference clock and the local system's reference clock. This lets us calculate the one-way delay when combined with (1).
CSRC or SSRC associated with (1) and (2). Knowing timestamps, but not knowing from where they are coming from, is problematic when mixers are involved.

This is basically RTCRtpContributingSource but on a per-frame basis:

drkron · 2024-03-08T14:13:19Z

The neat thing (when it works) with the current definition is that all timesstamps are using the same reference and can be compared to performance.now(). This makes it very simple to calculate glass-to-glass delay, receive-to-render delay, etc.

I would suggest that absoluteCaptureTime is added next to the capture timestamp. This timestamp would then be the unaltered capture timestamp in the sender's NTP clock.

This was referenced Mar 5, 2024

Add picture timing requirement to Ultra Low latency Broadcast with Fanout use case w3c/webrtc-nv-use-cases#130

Open

What is the clock reference for the captureTime for webrtc remote sources? #87

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use abs-capture-time header extension instead of RTCP SR for calculating captureTime on remote sources when available #86

Use abs-capture-time header extension instead of RTCP SR for calculating captureTime on remote sources when available #86

murillo128 commented Feb 19, 2024

tguilbert-google commented Feb 20, 2024

murillo128 commented Feb 20, 2024

aboba commented Feb 20, 2024 •

edited

Loading

murillo128 commented Feb 20, 2024 •

edited

Loading

Arctunix commented Mar 5, 2024

drkron commented Mar 8, 2024

Use abs-capture-time header extension instead of RTCP SR for calculating captureTime on remote sources when available #86

Use abs-capture-time header extension instead of RTCP SR for calculating captureTime on remote sources when available #86

Comments

murillo128 commented Feb 19, 2024

tguilbert-google commented Feb 20, 2024

murillo128 commented Feb 20, 2024

aboba commented Feb 20, 2024 • edited Loading

murillo128 commented Feb 20, 2024 • edited Loading

Arctunix commented Mar 5, 2024

drkron commented Mar 8, 2024

aboba commented Feb 20, 2024 •

edited

Loading

murillo128 commented Feb 20, 2024 •

edited

Loading