-
Notifications
You must be signed in to change notification settings - Fork 228
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sliding Sync: Avoid fetching left rooms and add back newly_left
rooms
#17725
Sliding Sync: Avoid fetching left rooms and add back newly_left
rooms
#17725
Conversation
# Expect to see room1 because it is `newly_left` thanks to being state reset out | ||
# of it since the last time we synced. We need to let the client know that | ||
# something happened and that they are no longer in the room. | ||
self.assertIncludes( | ||
set(response_body["rooms"].keys()), {space_room_id}, exact=True | ||
) | ||
# We set `initial=True` to indicate that the client should reset the state they | ||
# have about the room | ||
self.assertEqual(response_body["rooms"][space_room_id]["initial"], True) | ||
# Empty `required_state` to indicate that the client should clear their state | ||
# (as the user is no longer in the room) | ||
self.assertIsNone( | ||
response_body["rooms"][space_room_id].get("required_state"), | ||
response_body["rooms"][space_room_id], | ||
) | ||
# Where the state reset happened | ||
self.assertEqual( | ||
response_body["rooms"][space_room_id]["bump_stamp"], | ||
join_rule_event_pos.stream, | ||
response_body["rooms"][space_room_id], | ||
) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How do we actually expect a room that you are state reset out of to look when it comes down Sliding Sync?
This is just based off of my best guess in the comments before. The main thing is initial: True
and required_state
that doesn't include your membership to try to indicate to the client that the new state doesn't include you anymore. But there is an edge case with that if you're not requesting your own membership event, then you wouldn't expect it to be missing.
This is better than before though so we could punt until the MSC figures that out more 🤷
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is the best we can do for now, and we should deal with this question in the MSC. I don't see any problem with compatibility with landing this as it is
We already don't fetch self-leaves and add back `newly_left`
# TODO: It would be nice to avoid these copies | ||
room_membership_for_user_map = dict(room_membership_for_user_map) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it crazy to introduce something like a SoftDeleteChainMap
? Where we can insert sentinel values on map[0]
whenever we want to "delete" something from the map and avoid copying the immutable room_membership_for_user_map
.
The Python docs encourage making your own ChainMap
subclass variations like DeepChainMap
.
Can also just tackle this in a follow-up PR.
SoftDeleteChainMap
first draft
class Sentinel(Enum):
# defining a sentinel in this way allows mypy to correctly handle the
# type of a dictionary lookup and subsequent type narrowing.
UNSET_SENTINEL = object()
class SoftDeleteChainMap(ChainMap):
"""
Variant of ChainMap that allows deleting keys without affecting the underlying maps (maps[1:]).
Only adds sentinel values to the top-most mutable map[0].
aka `DeletableChainMap`, `NullableChainMap`
"""
def __getitem__(self, key):
for mapping in self.maps:
try:
value = mapping[key] # can't use 'key in mapping' with defaultdict
if value is Sentinel.UNSET_SENTINEL:
return self.__missing__(
key
) # support subclasses that define __missing__
return value
except KeyError:
pass
return self.__missing__(key) # support subclasses that define __missing__
def get(self, key, default=None):
if key in self:
value = self[key]
if value is Sentinel.UNSET_SENTINEL:
return default
return value
else:
return default
def __delitem__(self, key):
self.maps[0] = Sentinel.UNSET_SENTINEL
def popitem(self):
raise NotImplementedError("popitem is not supported")
def pop(self, key, *args):
"""
Remove specified key and return the corresponding value.
If the key is not found, return the default if given; otherwise, raise a
KeyError.
"""
if key not in self:
if len(args) == 0:
raise KeyError(key)
# Return the default if given
return args[0]
value = self[key]
if value is Sentinel.UNSET_SENTINEL:
if len(args) == 0:
raise KeyError(key)
# Return the default if given
return args[0]
del self[key]
return value
def __iter__(self):
# TODO: Fix/implement this one
d = {}
for mapping in map(dict.fromkeys, reversed(self.maps)):
d |= mapping # reuses stored hash values if possible
return iter(d)
def __contains__(self, key):
return any(
(key in m and m[key] is not Sentinel.UNSET_SENTINEL) for m in self.maps
)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, I think that could make a lot of sense. Annoying that we have to roll our own
response_body, _ = self.do_sync(sync_body, since=from_token, tok=user1_tok) | ||
|
||
if self.use_new_tables: | ||
if server_leaves_room: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This if-tree
is a bit messy but the test setup is so similar I think I'd rather have this than duplicating the test a bunch.
The extra self.use_new_tables
layer will disappear in the future
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Woo, thanks!
…ing-sync-add-back-newly-left
Thanks for the review @erikjohnston 🐅 |
Performance optimization: We can avoid fetching rooms that the user has left themselves (which could be a significant amount), then only add back rooms that the user has
newly_left
(left in the token range of an incremental sync). It's a lot faster to fetch less rooms than fetch them all and throw them away in most cases. Since the user only leaves a room (or is state reset out) once in a blue moon, we can avoid a lot of work.Based on @erikjohnston's branch, develop...erikj/ss_perf
Todo
SoftDeleteChainMap
, Sliding Sync: Avoid fetching left rooms and add backnewly_left
rooms #17725 (comment)Pull Request Checklist
EventStore
toEventWorkerStore
.".code blocks
.(run the linters)