Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transactional MlsProvider #327

Merged
merged 13 commits into from
Nov 15, 2023
Merged

Transactional MlsProvider #327

merged 13 commits into from
Nov 15, 2023

Conversation

insipx
Copy link
Contributor

@insipx insipx commented Nov 10, 2023

There are a few key changes in this PR. There are also some things that definitely could be better, but overall I think it's good. I'm happy to accept feedback if there are things that should be better

  • Most of our calls to EncryptedMessageStore which interact with the database don't need '&self', and carrying this around pollutes our structs/method calls even though the types are not tightly coupled. Removing this cleaned some things up, as well as makes those calls clearer. The stores purpose now is mostly to hand out connections to things that need it
  • SqlKeyStore now carries around a &mut DbConnection. We need this to be mutable b/c of diesels transaction design, which strictly requires a mutable reference. Transactions are started with XmtpOpenMlsProvider, which provides a provider that is able to access the underlying connection with provider.conn().borrow_mut()
  • The underlying connection is wrapped in a RefCell. This is mostly for performance/remain close to the DbConnection !Sync bound, since we're doing mostly single-threaded work. I thought it was OK to make it a RefCell at first anyway, since making it a Mutex isn't so different.
  • The transaction that is started is the default one on the Connection trait. We can control where/how the database locks to some extent, however, through more specific Sqlite Transaction methods: methods

In a follow up PR, it might be good to:

  • Restrict usage of conn() method to only be available within a transaction via a XmtpOpenMlsProvider wrapper type. This makes it much more difficult to run into borrow errors resulting from the underlying RefCell type. And, if using a mutex, run into deadlocks. For now we should be extra careful about borrowing the underlying connection.
  • We could add some level of error handling to our transaction method to clean up matching/continuing on the errors we see where the transaction code would go into today (I put a todo in for this)

@insipx insipx changed the title proto scaffolding for just using a Mutex on SqliteConnection Transactional MlsProvider Nov 10, 2023
@insipx insipx marked this pull request as ready for review November 15, 2023 17:47
@insipx insipx requested a review from a team November 15, 2023 17:47
.into_iter()
.map(|stored_group| MlsGroup::new(self, stored_group.id, stored_group.created_at_ns))
.collect())
Ok(EncryptedMessageStore::find_groups(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm perfectly fine with making all the store methods static, given how few of them ever need to call another store method.

There's probably a case to make that these should just be methods on the models Groups::find instead of EncryptedMessageStore::find_groups. Probably more a matter of taste. Having them all be methods on one big store does limit the number of imports needed, which is simpler.

Good to leave as is setup in this PR if everyone else feels the same way.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't have a problem reverting back if there are strong feelings around it, but it does clean up some of the fields in structs as well

I'm not sure how much more the store will grow, but considering we already have most of the methods within their model files, it shouldn't be difficult to refactor if we decide that pulling in all the methods is too much down the line

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good to leave it for now, and we can re-open this discussion if the EncryptedMessageStore keeps growing out of control and becomes unwieldy.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was actually wondering if these made sense as non-static methods on the connection. Every DB operation requires a connection anyway. But happy with this too

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's what Wire does and the ergonomics actually seem pretty nice to me

Copy link
Contributor Author

@insipx insipx Nov 15, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Essentially we would then just be wrapping diesel in a NewType and abstracting it away which I would generally be in favor of. I wasn't trying to re-do the design too much here, just figured making these methods static makes it easier to think of the store as just something that provides connections and makes db operations, without having to worry about where to access it

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can exp with this tonight

@insipx insipx requested a review from richardhuaaa November 15, 2023 18:04
/// provider.conn().borrow_mut()
/// })
/// ```
pub fn transaction<T, F, E>(connection: &mut DbConnection, fun: F) -> Result<T, E>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I could see this being very useful. Smart addition.

@neekolas
Copy link
Contributor

Restrict usage of conn() method to only be available within a transaction via a XmtpOpenMlsProvider wrapper type. This makes it much more difficult to run into borrow errors resulting from the underlying RefCell type. And, if using a mutex, run into deadlocks. For now we should be extra careful about borrowing the underlying connection.

This sounds fine. My feeling is that we are going to want anything that touches the SqlKeyStore to be transactional anyways, since openmls does so much deleting of key material under the hood. There are a few cases like load_mls_group that are read only, but that feels like the exception and not the rule.

Copy link
Contributor

@richardhuaaa richardhuaaa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! I'm curious about what the tradeoff is of doing it this way with dispensable keystores versus having a long-lived keystore that always uses the same connection, but this works

@richardhuaaa
Copy link
Contributor

richardhuaaa commented Nov 15, 2023

Restrict usage of conn() method to only be available within a transaction via a XmtpOpenMlsProvider wrapper type.

  1. Are we basically trying to enforce there is only ever one DB connection at a time? Or do we envision multiple XmtpOpenMlsProvider existing simultaneously?
  2. Spitballing, one possible way is to move all of the DB methods find_groups, etc into class methods on a struct that holds a connection inside of itself. And only create/return that struct from XmtpOpenMlsProvider

@insipx
Copy link
Contributor Author

insipx commented Nov 15, 2023

I'm curious about what the tradeoff is of doing it this way with dispensable keystores versus having a long-lived keystore that always uses the same connection, but this works

I think either way, we'd have to have some function that starts a transaction with a mutable connection in a key store in order to remain scoped to one conn/provider. Quickly thinking through it, but we could definitely add a layer that hands out transactional key stores, if we decide it would be helpful. If we're dealing with one &mut reference, the lifetimes and borrow errors should be easier to deal with if we don't require things to live for too long. It's still possible to do, the tricky part is just satisfying diesels transaction reqs.

@insipx
Copy link
Contributor Author

insipx commented Nov 15, 2023

Are we basically trying to enforce there is only ever one DB connection at a time? Or do we envision multiple XmtpOpenMlsProvider existing simultaneously?

Not necessarily one db connection, but just one connection in one transaction.

Currently, it's easy to use the connection on provider rather than requesting a new connection from the pool if there's any provider type around. If there is lots of nested code using this conn, then it's easy to run into BorrowMut Errors at runtime b/c of the RefCell. In order to enforce only using the connection from provider when in the transaction, I was testing out adding a new struct, TransactionalMlsProvider that wraps XmtpOpenMlsProvider, but with the conn fn on it that allows accessing the underlying connection. then the transaction callback would pass this wrapper rather than Self. However, this might warrant some redesign in methods we already have that use the provider

I ran into some issues in Identity::new this morning for instance, where the tests only passed when using the same connection as provider. If i tried changing StoredIdentity::from to use a new conn from store, I would get a "this is not a db file" error. Not completely sure where it came from, but might just be an issue with the test database setup

Spitballing, one possible way is to move all of the DB methods find_groups, etc into class methods on a struct that holds a connection inside of itself. And only create/return that struct from XmtpOpenMlsProvider

I do think it would be worth looking into this, it might make things much cleaner and easier to handle w.r.t to the lifetime of the DbConnection. Might have to be careful of pulling too many DbConnections at once. Although i'm not sure what you mean by returning the struct from XmtpOpenMlsProvider. Would this create an interface encapsulating each database item? i.e

provider.groups().find_groups()

Or something else?

@insipx insipx merged commit b0f00f9 into main Nov 15, 2023
5 checks passed
@insipx insipx deleted the transactional-openmlsprovider branch November 15, 2023 21:00
@richardhuaaa
Copy link
Contributor

the tests only passed when using the same connection as provider

Do you know if the tests were using an ephemeral or persistent DB store? For ephemeral store, there is a pool size of 1, because any additional connections seem to make a new in-memory store rather than writing to the same one.

Would this create an interface encapsulating each database item?

Something like this - it might be a struct called something like ValidatedConnection, it has one field which is the connection. It has multiple methods like find_groups, get_last_synced_topic_timestamp, etc - no connection param needed on these methods. ValidatedConnection is provided to the callback used in the transaction() method in XmtpOpenMlsProvider and is not available anywhere else

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants