Added a decode+resize benchmark and cuda decoder #378

ahmadsharif1 · 2024-11-15T23:08:34Z

Make the generate_readme_data.py script not regenerate the videos when the parent dir exists.
Add a benchmark column for "dataloader style" benchmark that measures throughput of a batch of videos that are decoded and resized.
Add a bar for cuda decoder which is nothing but torchcodec_public with device="cuda"

Benchmark results show cuda decoder is faster than CPU decoder at dataloader style benchmark.

benchmarks/decoders/benchmark_decoders_library.py

scotts · 2024-11-16T03:18:06Z

benchmarks/decoders/benchmark_decoders_library.py

+        import torchvision  # noqa: F401
+        from torchvision.transforms import v2 as transforms_v2
+
+        self.torchvision = torchvision


I don't think we actually use self.torchvision? We should be able to remove it.

Good point. done.

scotts · 2024-11-16T03:22:34Z

benchmarks/decoders/benchmark_decoders_library.py

@@ -535,6 +595,8 @@ def run_benchmarks(
    results = []
    df_data = []
    verbose = False
+    # TODO: change this back before landing.
+    min_runtime_seconds = 0.1


I have an unmerged change in PR #362 to make this sort of benchmark testing easier: https://github.com/pytorch/torchcodec/pull/362/files#diff-c378bd5d03e7daa116eaeeeb86921e8134f0feefb0fcdf9f020f872676c5c00dR31-R34

benchmarks/decoders/benchmark_readme_data.json

scotts · 2024-11-16T03:30:01Z

benchmarks/decoders/benchmark_decoders_library.py

+            frame = next(reader)
+            frames.append(frame["data"].permute(1, 2, 0))
+        frames = [frame.to(device) for frame in frames]
+        frames = self.transforms_v2.functional.resize(frames, (height, width))


~~Naive question that applies to all implementations that use the transformation: how do we ensure it's done on the GPU?~~

Realized after walking away from my laptop that it's controlled by where the data lives, not by some parameter on the transform. So to answer my question: in the frame.to(device) call.

scotts · 2024-11-19T02:13:40Z

Ideally we would also update the README description, as once this PR is merged, the chart is live. But we can also do that on a follow-up. That is, a 22-core Linux system with whatever kind of GPU.

ahmadsharif1 · 2024-11-19T13:06:55Z

I tweaked the README.md file as well. Later on we can put the machine info in the chart itself so this doesn't have to be kept manually up-to-date.

Added a decode+resize benchmark

2f08c01

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 15, 2024

.

284c83f

scotts reviewed Nov 16, 2024

View reviewed changes

benchmarks/decoders/benchmark_decoders_library.py Show resolved Hide resolved

scotts reviewed Nov 16, 2024

View reviewed changes

benchmarks/decoders/benchmark_readme_data.json Show resolved Hide resolved

scotts reviewed Nov 16, 2024

View reviewed changes

ahmadsharif1 added 3 commits November 18, 2024 08:00

.

6644986

.

c205fee

.

86f7903

ahmadsharif1 marked this pull request as ready for review November 18, 2024 21:13

scotts approved these changes Nov 19, 2024

View reviewed changes

.

35d86c3

ahmadsharif1 merged commit 425c94f into pytorch:main Nov 19, 2024
37 of 47 checks passed

NicolasHug pushed a commit to NicolasHug/torchcodec that referenced this pull request Nov 19, 2024

Added a decode+resize benchmark and cuda decoder (pytorch#378)

faa2fd9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added a decode+resize benchmark and cuda decoder #378

Added a decode+resize benchmark and cuda decoder #378

ahmadsharif1 commented Nov 15, 2024 •

edited

Loading

scotts Nov 16, 2024

ahmadsharif1 Nov 18, 2024 •

edited

Loading

scotts Nov 16, 2024

scotts Nov 16, 2024 •

edited

Loading

ahmadsharif1 Nov 18, 2024

scotts commented Nov 19, 2024

ahmadsharif1 commented Nov 19, 2024

Added a decode+resize benchmark and cuda decoder #378

Added a decode+resize benchmark and cuda decoder #378

Conversation

ahmadsharif1 commented Nov 15, 2024 • edited Loading

scotts Nov 16, 2024

Choose a reason for hiding this comment

ahmadsharif1 Nov 18, 2024 • edited Loading

Choose a reason for hiding this comment

scotts Nov 16, 2024

Choose a reason for hiding this comment

scotts Nov 16, 2024 • edited Loading

Choose a reason for hiding this comment

ahmadsharif1 Nov 18, 2024

Choose a reason for hiding this comment

scotts commented Nov 19, 2024

ahmadsharif1 commented Nov 19, 2024

ahmadsharif1 commented Nov 15, 2024 •

edited

Loading

ahmadsharif1 Nov 18, 2024 •

edited

Loading

scotts Nov 16, 2024 •

edited

Loading