API overhaul #26

scottlamb · 2021-06-11T19:06:53Z

for #4

I think this is a lot more straightforward to use. It's also faster. Looking at the benchmarks, throughputs in GiB/s:

                                  |---commit---|
                                  orig api  rbsp

parse_annexb/onepush_null         8.8  8.5  9.2
parse_annexb/chunksize1440_null   8.0  7.8  8.2
parse_annexb/chunksize184_null    6.2  6.0  6.3

parse_annexb/onepush_rbsp         4.6  4.8  8.3
parse_annexb/chunksize1440_rbsp   4.4  4.4  4.2
parse_annexb/chunksize184_rbsp    4.0  4.0  6.6

parse_annexb/onepush_parse        3.6  3.6  6.2
parse_annexb/chunksize1440_parse  5.3  5.3  5.9
parse_annexb/chunksize184_parse   4.1  4.1  4.6

commits:
orig: 744582e
api:  6db563f
rbsp: cdc1da0

BitRead::read_unary1 is faster than count_zeros. Add test for overflow.

* make the bit reader type take a BufRead rather than a slice so we don't have to keep a buffered copy of the RBSP. * reduce "stuttering" by taking the module name out of the struct name. * use a trait so there's less type bounds to deal with in callers. * take a name in all BitReader operations. This will improve error messages and trace logs/println debugging.

My goal is to establish a good baseline for performance impact of upcoming API changes. * include a complete usage of NalSwitch, RbspDecoder, and NAL parsers. The slice header parse will stop decoding RBSP after it's gotten a full slice header. * parse in one push, 184-byte pushes (like MPEG-TS), and 1440-byte pushes (~typical for RTP). RTP doesn't even use Annex B, but it needs RTSP decoding and NAL parsing.

I haven't removed RbspDecoder or adjusted decode_nal yet, and the code is a little ugly as a result. Seems to work though.

It now more closely matches NalAccumulator's interface. Getting ready to plug that in.

No more need to deal with a user context, Box<RefCell<>>, or separate traits for the various handlers. In the simplest case, just a closure will do. This is essentially performance-neutral by itself. It allows RBSP parsing to be lazy though, and after the next commit that will pretty significantly speed up the case where slice NALs are processed in a single push.

* remove RbspDecoder in favor of ByteReader * return ErrorKind::InvalidData on illegal byte sequences. This interface is straightforward now with the std::io interface. I held off until removing the RbspDecoder interface because doing it there was ugly. * re-implement decode_nal on top of ByteReader. behavior change: it now strips the NAL header byte, which I think makes it easier to use. Also replace the unit test with a doctest to better explain what it does. * don't look as far ahead for the next zero byte. This speeds things up in general but particularly the case where a push has a full slice NAL and we're only interested in the header.

* return error rather than panic for unimplemented B frame * set limits for things passed to Vec::with_capacity

scottlamb mentioned this pull request Jun 11, 2021

Create simpler facades for decoding steps #4

Open

scottlamb force-pushed the pr-api branch 6 times, most recently from f04a8c3 to 3b0871f Compare June 12, 2021 01:26

scottlamb added 9 commits June 23, 2021 14:35

speed up read_ue and read_se

ac87074

BitRead::read_unary1 is faster than count_zeros. Add test for overflow.

add rbsp::ByteReader

69cc7a0

I haven't removed RbspDecoder or adjusted decode_nal yet, and the code is a little ugly as a result. Seems to work though.

Nal abstraction, push parser helper

9f7c8a9

simplify NalReader interface

4f0465b

It now more closely matches NalAccumulator's interface. Getting ready to plug that in.

fix problems found in fuzz testing

940fca4

* return error rather than panic for unimplemented B frame * set limits for things passed to Vec::with_capacity

scottlamb force-pushed the pr-api branch from 3b0871f to 940fca4 Compare June 23, 2021 22:36

scottlamb changed the title ~~DRAFT: push parser helper to accumulate NALs~~ API overhaul Jun 23, 2021

scottlamb marked this pull request as ready for review June 23, 2021 22:38

dholroyd merged commit 6778a7f into dholroyd:master Dec 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API overhaul #26

API overhaul #26

scottlamb commented Jun 11, 2021 •

edited

Loading

API overhaul #26

API overhaul #26

Conversation

scottlamb commented Jun 11, 2021 • edited Loading

scottlamb commented Jun 11, 2021 •

edited

Loading