Using ffmpeg filter_complex to split output to two (or more) rtsp streams #1546

DJ-DotSoft · 2025-01-12T18:21:25Z

I have been using the recommended way of generating sub-streams from a camera - sub-streams have the main stream as input, e.g.:

Camera1:
  - ffmpeg:device?video=/dev/video0&<parameters_omitted>

Camera1_sub:1 ffmpeg:http://localhost:1984/api/stream.mp4?src=Camera1#<parameters_omitted>
Camera1_sub2: ffmpeg:http://localhost:1984/api/stream.mp4?src=Camera1#<parameters_omitted>

This works but:

My RPi struggles as I am using three webcams (almost 100% CPU)
Sometimes (might be due to 100% CPU) the main stream stops but doesn't generate an error, so there is no output from either main or sub-streams (eventually the go2rtc service seems to be restarted, maybe due to a lot of dead ffmpeg tasks?)
There is a big (>3s) lag between main and sub-streams in Frigate, which means the recorded video never catches any detections (but they are visible on snapshots and when "scrubbing" through the video)

So, I am experimenting with combining all the streams into one "exec" command to see if this solves some of the issues.

The problem I have is that I am not sure how to set the outputs of the ffmpeg command. I read that I can use an empty stream but I can't find an example, so tried this:

streams:
  Camera1_sub:
    # empty

  Camera1:
    - exec:ffmpeg -hide_banner -v error -f v4l2 -input_format h264 -video_size 1920x1080 -framerate 30 -i /dev/video0 -filter_complex [0:v]split=2[v1][v2];[v1]format=yuv420p[v1out];[v2]scale=1280x720:flags=bilinear[v2out] -map [v1out] -c:v h264 -g 50 -profile:v high -level:v 4.1 -preset:v superfast -tune:v zerolatency -f rtsp {output} -map [v2out] -c:v h264 -g 50 -profile:v high -level:v 4.1 -preset:v superfast -tune:v zerolatency -f rtsp rtsp://127.0.0.1:8554/Camera1_sub

The important bit is I am using "{output}" for the main stream output, and "rtsp://127.0.0.1:8554/Camera1_sub" for the sub-stream.

The log shows:

[rtsp] new producer stream=Camera1_sub
[exec] [rtsp @ 0x30d2b80] method SETUP failed: 461 Unsupported transport
[rtsp] new consumer stream=Camera1_sub
[rtsp] error=streams: unknown error stream=Camera1_sub

I think I must be close, but some advice or an example would be fantastic.

Obviously, any pointers to what I should be doing instead to solve the issues is also appreciated!

The text was updated successfully, but these errors were encountered:

AlexxIT · 2025-01-12T18:38:34Z

Please show go2rtc WebUI > Add > FFmpeg devices page. And let me know the models of your cameras.

DJ-DotSoft · 2025-01-12T19:36:43Z

I'm actually developing this on a RPi with one v2 camera:

Planar YUV 4:2:0 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=yuv420p&video_size={32-4056,#video=h264#hardware
YUYV 4:2:2 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=yuyv422&video_size={32-4056,#video=h264#hardware
24-bit RGB 8-8-8 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=rgb24&video_size={32-4056,#video=h264#hardware
JFIF JPEG | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=mjpeg&video_size={32-4056,
H.264 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=h264&video_size={32-4056,
Motion-JPEG | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=mjpeg&video_size={32-4056,
YVYU 4:2:2 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=Unsupported&video_size={32-4056,#video=h264#hardware
VYUY 4:2:2 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=Unsupported&video_size={32-4056,#video=h264#hardware
UYVY 4:2:2 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=uyvy422&video_size={32-4056,#video=h264#hardware
Y/CbCr 4:2:0 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=nv12&video_size={32-4056,#video=h264#hardware
24-bit BGR 8-8-8 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=bgr24&video_size={32-4056,#video=h264#hardware
Planar YVU 4:2:0 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=yuv420p&video_size={32-4056,#video=h264#hardware
Y/CrCb 4:2:0 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=Unsupported&video_size={32-4056,#video=h264#hardware
32-bit XBGR 8-8-8-8 | {32-4056, 2}x{32-3040, 2} | ffmpeg:device?video=/dev/video0&input_format=Unsupported&video_size={32-4056,#video=h264#hardware

The RPi with the three cameras (not tried the complex_filter on that RPi as I thought I must have the rtsp output part wrong):

Motion-JPEG	2560x1440 640x360 640x480 1024x576 1280x720 1920x1080 2560x1440	ffmpeg:device?video=/dev/video0&input_format=mjpeg&video_size=2560x1440
YUYV 4:2:2	320x240 480x272 640x368 640x480 1024x576 1280x720 1920x1080 320x240	ffmpeg:device?video=/dev/video0&input_format=yuyv422&video_size=320x240#video=h264#hardware
Y/UV 4:2:0	320x240 640x368 640x480 1024x576 1280x720 1920x1080 320x240	ffmpeg:device?video=/dev/video0&input_format=nv12&video_size=320x240#video=h264#hardware
H.264	2560x1440 640x360 640x480 1024x576 1280x720 1920x1080 2560x1440	ffmpeg:device?video=/dev/video0&input_format=h264&video_size=2560x1440
HEVC	2560x1440 640x360 640x480 1024x576 1280x720 1920x1080 2560x1440	ffmpeg:device?video=/dev/video0&input_format=Unsupported&video_size=2560x1440
Motion-JPEG	2560x1440 640x360 640x480 1024x576 1280x720 1920x1080 2560x1440	ffmpeg:device?video=/dev/video2&input_format=mjpeg&video_size=2560x1440
YUYV 4:2:2	320x240 480x272 640x368 640x480 1024x576 1280x720 1920x1080 320x240	ffmpeg:device?video=/dev/video2&input_format=yuyv422&video_size=320x240#video=h264#hardware
Y/UV 4:2:0	320x240 640x368 640x480 1024x576 1280x720 1920x1080 320x240	ffmpeg:device?video=/dev/video2&input_format=nv12&video_size=320x240#video=h264#hardware
H.264	2560x1440 640x360 640x480 1024x576 1280x720 1920x1080 2560x1440	ffmpeg:device?video=/dev/video2&input_format=h264&video_size=2560x1440
HEVC	2560x1440 640x360 640x480 1024x576 1280x720 1920x1080 2560x1440	ffmpeg:device?video=/dev/video2&input_format=Unsupported&video_size=2560x1440
Motion-JPEG	2560x1440 640x360 640x480 1024x576 1280x720 1920x1080 2560x1440	ffmpeg:device?video=/dev/video4&input_format=mjpeg&video_size=2560x1440
YUYV 4:2:2	320x240 480x272 640x368 640x480 1024x576 1280x720 1920x1080 320x240	ffmpeg:device?video=/dev/video4&input_format=yuyv422&video_size=320x240#video=h264#hardware
Y/UV 4:2:0	320x240 640x368 640x480 1024x576 1280x720 1920x1080 320x240	ffmpeg:device?video=/dev/video4&input_format=nv12&video_size=320x240#video=h264#hardware
H.264	2560x1440 640x360 640x480 1024x576 1280x720 1920x1080 2560x1440	ffmpeg:device?video=/dev/video4&input_format=h264&video_size=2560x1440
HEVC	2560x1440 640x360 640x480 1024x576 1280x720 1920x1080 2560x1440	ffmpeg:device?video=/dev/video4&input_format=Unsupported&video_size=2560x1440

DJ-DotSoft · 2025-01-12T19:40:24Z

My ideal solution would allow the RPi to output h264 (for each of the three cameras) main stream at 2560x1440 and sub-stream at 1280x720, both streams with a timestamp. I don't think the RPi is going to be powerful enough for that though, so a compromise on main stream to 1920x1080 would be OK.

AlexxIT · 2025-01-13T07:38:53Z

You didn't say your camera models. I am looking for test cameras like yours myself. Maybe you can send a link to the shop.

I don't think you should convert the stream in the go2rtc. You need to take the H264 codec directly from the camera and not change it.

It's better to resize the picture at the entrance to Frigate. Frigate will have to convert the H264 codec to YUV anyway. And along with this step it can resize the picture.

DJ-DotSoft · 2025-01-13T10:01:59Z

They are USB webcams from Ali Express: https://www.aliexpress.com/item/1005004404049549.html. They have IR cut and can control IR LEDs (from 12V PSU). Happy to share more details if you want. I did think about using the non-compressed output of the camera, but the max resolution is 1920x1080. It is a possibility though.

I tried using one full resolution H264 stream from each camera at the very beginning. It works, and the RPi CPU is almost nothing (as you'd expect), but my camera system will have 24+ cameras and I would need a very powerful PC for Frigate. So instead, I want to make use of the RPi processing power and create more of a distributed system.

Also, I made a mistake in the first post - the delay between main and sub-stream is >3 minutes (not seconds). Here is an example from this morning.

This is when scrubbing with the mouse (sub-stream is used in Frigate):

Here is when watching the video (main stream is used in Frigate):

The time difference is 2 minutes and 49 seconds, so less than I said. So it may get bigger the longer the cameras have been running?

AlexxIT · 2025-01-13T11:27:57Z

Thanks! Looks like there are 3 cameras in the link:

4MP 3.6MM
NO LENS
5MP 1.8MM

What's yours?

This seems to be a link to "The RPi with the three cameras". Becase "2560*1440 CMOS" in the link name.

And what about model for this camera "I'm actually developing this on a RPi with one v2 camera"?

You can't reject H264 to YUV transcoding on input to Frigate. It's going to be in any case.
The exception is if you connect the cameras directly to the Frigate server and take the YUV format from them.
Ignoring MJPEG and H264 formats.

DJ-DotSoft · 2025-01-13T12:56:20Z

The difference is in what lens you want. I got mine with no lenses and bought lenses to suit each camera (depends on how wide angle you need). If you press View More and scroll down on the Ali Express website there are some examples and a graph showing the different lenses.

Yes, this is for the the RPi with three cameras.

The other RPi has the RPi Foundation high quality camera: https://www.raspberrypi.com/products/raspberry-pi-high-quality-camera/ (not v2 as that is something else, my bad)

The pipeline for using non-compressed would be:

YUV from camera -> ffmpeg complex filter {add timestamp text -> h264 compress -> output 1 rtsp, reduce to 1280x720 -> add timestamp text -> h264 compress -> output 2 rtsp}

output 1 is full resolution h264 for recording in Frigate, output 2 is lower resolution for display and detection in Frigate.

What I am trying now is:

h264 from camera -> ffmpeg complex filter {no change to stream -> output 1 rtsp, decompress -> reduce to 1280x720 -> add timestamp text -> h264 compress -> output 2 rtsp}

At the moment (with the two streams per camera) Frigate is using about 3-4% CPU per camera, so it is OK for 24+ cameras.

AlexxIT · 2025-01-13T15:25:44Z

h264 from camera -> ffmpeg complex filter {no change to stream -> output 1 rtsp, decompress -> reduce to 1280x720 -> add timestamp text -> h264 compress -> output 2 rtsp}

How you plan to use output 1 and output 2?

AlexxIT · 2025-01-13T15:39:07Z

Thank you. I ordered the camera from Ali.
For your information, I recently added native support for v4l2 without ffmpeg. YUV and MJPEG are supported there.
But H264 isn't supported yet because I don't have those cameras.
Without FFmpeg it will consume even less resources. But there is no possibility to use transcoding.
If you take the YUV format from the camera, you can output it to a remote FFmpeg via HTTP protocol with almost zero CPU cost. But to get H264 from YUV - you still need transcoding.
https://github.com/AlexxIT/go2rtc/commits/master/

DJ-DotSoft · 2025-01-13T16:21:23Z

Let me know if you want to use the IR cut and add IR LEDs as it took me a while to work out how to wire them.

I will experiment with YUV and the latest master.

Before I do, can you explain how to set two rtsp outputs (this is from the original question I asked)? I found exec.go takes the md5 of the name of the camera stream to make the URL to send to. With the second output, is that what I am missing? At the moment I am trying:

streams:
  Camera1_medium:  # empty source (as documented in "Source: Webtorrent")
  
  Camera1:
    - exec:ffmpeg -hide_banner -v error -f v4l2 -input_format h264 -video_size 1920x1080 -framerate 30 -i /dev/video0 -filter_complex [0:v]split=2[v1][v2];[v1]format=yuv420p[v1out];[v2]scale=1280x720:flags=bilinear[v2out] -map [v1out] -c:v h264 -g 50 -profile:v high -level:v 4.1 -preset:v superfast -tune:v zerolatency -f rtsp {output} -map [v2out] -c:v h264 -g 50 -profile:v high -level:v 4.1 -preset:v superfast -tune:v zerolatency -f rtsp rtsp://localhost:8554/Camera1_medium

Should "Camera1_medium" be the md5 of this string? i.e. 73e5073b7420fb75abecb3d48525683c

AlexxIT · 2025-01-13T18:17:36Z

Should "Camera1_medium" be the md5 of this string? i.e. 73e5073b7420fb75abecb3d48525683c

No. Your version should work.

DJ-DotSoft · 2025-01-13T18:26:04Z

Ahh. OK, thanks.

I noticed there are some changes since v1.9.7, so I will upgrade and see if that fixes it. Unless you have any other ideas?

AlexxIT · 2025-01-13T18:29:36Z

Splitting a single input into multiple outputs is a complex task, it has not been explored yet.

DJ-DotSoft · 2025-01-13T18:41:28Z

OK, probably best for me to leave multiple output for now then.

How you plan to use output 1 and output 2?

Output 1 will be used for record and output 2 for display and detection (in Frigate):

cameras:
  Garden:
    enabled: true
    ffmpeg:
      inputs:
        - path: rtsp://127.0.0.1:8554/Garden
          roles:
            - record
        - path: rtsp://127.0.0.1:8554/Garden_medium
          roles:
            - detect

This means Frigate doesn't have to decode the full resolution for detection and reduces the CPU by quite a lot.

AlexxIT · 2025-01-26T13:12:41Z

I got the camera you recommended. V4L2 now supports a bunch of new codecs.

DJ-DotSoft · 2025-01-27T13:49:56Z

Just looked at the latest commit. From what I understand you can now do:

Camera1:
    - v4l2:device?video=/dev/video0&input_format=h264&video_size=2560x1440

...instead of:

Camera1:
  - ffmpeg:device?video=/dev/video0&input_format=h264&video_size=2560x1440

Is this correct?

As you said, the v4l2 source will perform better than the ffmpeg source, however, even with 3 USB cameras, that was never an issue when there is no transcoding or timestamping happening.

I think for my situation I should investigate the named pipe way (which you have mentioned in other discussions, but I have not seen any examples of). And if that doesn't perform well enough, to write a python script using OpenCV which outputs to two named pipes (using Python/OpenCV it would be possible to really optimise things - e.g. the generation of the timestamp bitmap only needs doing once for all the cameras, and only once per second). If this sounds the best way to you, do you have any examples using named pipes?

I'm glad your USB camera works! I have had quite a few working for nearly a year, so they are reliable. One thing I have never got working is the audio - if you configure audio the camera will stop working after some time (minutes to hours). It fails on Windows too (when used a webcam), so I am pretty sure it is the camera. I have not tried with go2rtc though...

AlexxIT · 2025-01-28T10:24:31Z

This optimisation probably won't help in your case. But for other users it will allow to avoid unnecessary ffmpeg launching.

AlexxIT added the question Further information is requested label Jan 12, 2025

AlexxIT added a commit that referenced this issue Jan 26, 2025

Add support H264, H265, NV12 for V4L2 source #1546

82f6c2c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using ffmpeg filter_complex to split output to two (or more) rtsp streams #1546

Using ffmpeg filter_complex to split output to two (or more) rtsp streams #1546

DJ-DotSoft commented Jan 12, 2025 •

edited

Loading

AlexxIT commented Jan 12, 2025

DJ-DotSoft commented Jan 12, 2025

DJ-DotSoft commented Jan 12, 2025

AlexxIT commented Jan 13, 2025 •

edited

Loading

DJ-DotSoft commented Jan 13, 2025

AlexxIT commented Jan 13, 2025 •

edited

Loading

DJ-DotSoft commented Jan 13, 2025

AlexxIT commented Jan 13, 2025

AlexxIT commented Jan 13, 2025

DJ-DotSoft commented Jan 13, 2025

AlexxIT commented Jan 13, 2025

DJ-DotSoft commented Jan 13, 2025

AlexxIT commented Jan 13, 2025

DJ-DotSoft commented Jan 13, 2025

AlexxIT commented Jan 26, 2025

DJ-DotSoft commented Jan 27, 2025

AlexxIT commented Jan 28, 2025

Using ffmpeg filter_complex to split output to two (or more) rtsp streams #1546

Using ffmpeg filter_complex to split output to two (or more) rtsp streams #1546

Comments

DJ-DotSoft commented Jan 12, 2025 • edited Loading

AlexxIT commented Jan 12, 2025

DJ-DotSoft commented Jan 12, 2025

DJ-DotSoft commented Jan 12, 2025

AlexxIT commented Jan 13, 2025 • edited Loading

DJ-DotSoft commented Jan 13, 2025

AlexxIT commented Jan 13, 2025 • edited Loading

DJ-DotSoft commented Jan 13, 2025

AlexxIT commented Jan 13, 2025

AlexxIT commented Jan 13, 2025

DJ-DotSoft commented Jan 13, 2025

AlexxIT commented Jan 13, 2025

DJ-DotSoft commented Jan 13, 2025

AlexxIT commented Jan 13, 2025

DJ-DotSoft commented Jan 13, 2025

AlexxIT commented Jan 26, 2025

DJ-DotSoft commented Jan 27, 2025

AlexxIT commented Jan 28, 2025

DJ-DotSoft commented Jan 12, 2025 •

edited

Loading

AlexxIT commented Jan 13, 2025 •

edited

Loading

AlexxIT commented Jan 13, 2025 •

edited

Loading