store: track HashPosition for first and last elements #134

dktapps · 2023-11-13T13:38:20Z

this produces a huge performance improvement for queues with large internal tables.

an internal table of large size may appear if the array had lots of elements inserted into it and later deleted. this resulted in major performance losses for the reader of the elements, as zend_hash_internal_pointer_reset_ex() had to scan through many IS_UNDEF offsets to find the actual first element.

there are two ways to attack this problem:

reallocate the internal table as elements are deleted to reduce the internal table size - this proved to be relatively ineffective 2) track the start and end of the hashtable to avoid repeated scans during every shift() call - this is the approach taken in this commit, and provides major performance benefits

the test case written in #42 now runs to completion substantially faster, without any performance degradation.

more tests are needed to ensure that this works fully as intended, but I chose to take the safe route with invalidating vs updating the offsets, so I think it should be good.

this produces a huge performance improvement for queues with large internal tables. an internal table of large size may appear if the array had lots of elements inserted into it and later deleted. this resulted in major performance losses for the reader of the elements, as zend_hash_internal_pointer_reset_ex() had to scan through many IS_UNDEF offsets to find the actual first element. there are two ways to attack this problem: 1) reallocate the internal table as elements are deleted to reduce the internal table size - this proved to be relatively ineffective 2) track the start and end of the hashtable to avoid repeated scans during every shift() call - this is the approach taken in this commit, and provides major performance benefits the test case written in #42 now runs to completion substantially faster, without any performance degradation. more tests are needed to ensure that this works fully as intended, but I chose to take the safe route with invalidating vs updating the offsets, so I think it should be good.

…are no elements ... as well as modifying the HashPosition and potentially borking it

this caused elements to be removed in the wrong order, or not to be removed at all.

dktapps added 6 commits November 13, 2023 13:36

cleanup, fix invalidation on every property write even if appended

1e27571

Apparently move_forward and move_backwards return success when there …

ee44844

…are no elements ... as well as modifying the HashPosition and potentially borking it

Fixed shift, pop and chunk misbehaving after table resize

b16e326

this caused elements to be removed in the wrong order, or not to be removed at all.

change back var names

0a45229

remove more debug changes

a16c9d5

dktapps merged commit b2b6100 into fork Nov 15, 2023
40 checks passed

dktapps deleted the track-first-last branch November 15, 2023 16:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

store: track HashPosition for first and last elements #134

store: track HashPosition for first and last elements #134

dktapps commented Nov 13, 2023

store: track HashPosition for first and last elements #134

store: track HashPosition for first and last elements #134

Conversation

dktapps commented Nov 13, 2023