Discussion on Database plans/ORM #10

adenner · 2022-11-20T04:25:24Z

For ease of development, any objections to using Entity Framework and sqlsql lite? By using ef it will make it far easier to switch to a "real" database like mysql, mssql et.al.

benrick · 2022-11-21T00:04:33Z

No objections at all. In fact, I was going to recommend we start with SQLite during development regardless of how we store data long-term. In-memory and SQLite have basically no up-front cost or setup, which is perfect for us, since we don't have dev containers, etc. configured yet.

kirkbrauer · 2022-11-22T01:39:10Z

I think long-term we should probably look into PostgreSQL and Npgsql + EF Core. I'm currently using this in production, it's a super powerful solution for most general use cases. However, SQLite is a great place to start development, single file DB for the win!

edmistond · 2022-11-22T02:07:17Z

I'll reiterate my somewhat more ephemeral Mastodon comments here - I think SQLite's great for getting some code down to start, but I'm not sure it would work well if we need to implement background workers outside the main process... which, just based on what I know so far about ActivityPub and Mastodon generally, I am assuming we will.

I agree that Postgres is a great choice for long-term, though. I wouldn't want to tie this to Azure (or AWS) since I suspect that would be a disincentive for a lot of folks, which kind of argues against something like CosmosDB.

benrick · 2022-11-22T04:42:08Z

Yes, I think SQLite is a good starting point, but the background workers will require some kind of storage that isn't as tied to the file system.

And I agree that we don't want to add anything that requires any specific provider (Azure, AWS, etc.). Someone should be able to host this on a PC in their house if they want to (no idea why they would).

adenner · 2022-11-22T13:09:54Z

Both azure's cosmos and aws's aurora have compatibility and support for postgres in one way or another. For the homelab situation there is always k8s or docker compose.

kirkbrauer · 2022-11-22T14:46:52Z

@adenner there's also other options available like Azure Database for PostgreSQL and Amazon RDS for PostgreSQL. Looking at the docs for Mastodon, it appears that they use Postgres and Redis for their own data store and cache.

Perhaps it would be a good idea to make this project schema-compatible with any original Mastodon Postgres databases?

Ruby on Rails has a similar ORM to EF Core, so it wouldn't be too difficult to replicate their migrations

benrick · 2022-11-22T15:02:29Z

Yeah, I think there's a good chance postgres will be our answer. Obviously, it's also the choice mastodon uses.

benrick · 2022-11-22T15:03:52Z

Yeah, I was figuring that we'd either need to support a mastodon database directly, or have some kind of conversion to/from in order for us to be compatible.

benrick · 2022-11-22T16:12:18Z

Have a look at #18 @adenner and @kirkbrauer . I'm happy to have us start with SQLite or Postgres. Whichever someone wants to put in first, we can use.

kirkbrauer · 2022-11-22T16:14:34Z

@benrick you know what, the more I think about it, I think we should just start with Postgres. That would make our lives a lot easier and makes it much easier to run regression tests comparing it to the original DB when ready.

edmistond · 2022-11-22T16:18:49Z

I'm fine with either. Postgres runs great in a Docker container for dev purposes so it shouldn't be a heavy lift to set it up.

benrick · 2022-11-22T16:19:00Z

Yeah @kirkbrauer , I was leaning the same way. I had Postgres installed in PR 18, but removed it before sending the PR. 😆

benrick · 2022-11-22T16:20:10Z

The other aspect of this is ORM discussion. I do like Entity Framework, but I'm thinking about how quick a lighter solution can be. Any of you work with Dapper?

edmistond · 2022-11-22T16:26:43Z

@benrick I've used Dapper on and off since it was a single file you just could pull into your project... some might say I've been doing this stuff for too long now. 😂 Big fan of it though, it made my life a lot easier.

I had kind of the same thought about performance; the one option I could see is taking a more CQRS-y approach where you have separate read/write EF contexts and all the reads go through AsNoTracking()... but that gets to be a lot to set up and maintain, and if I'm honest I feel like taking a CQRS approach also brings in Mediatr and then you end up going down a huge rabbit hole of indirection. Nice library, just feels like overkill sometimes. :)

benrick · 2022-11-22T16:31:41Z

Agreed. I've gone down that rabbit hole before. EF is good, but even with no tracking, it'll get you on performance versus a lighter approach.

…

On Tue, Nov 22, 2022, 11:26 AM David Edmiston ***@***.***> wrote: @benrick <https://github.com/benrick> I've used Dapper on and off since it was a single file you just could pull into your project... some might say I've been doing this stuff for too long now. 😂 Big fan of it though, it made my life a lot easier. I had kind of the same thought about performance; the one option I could see is taking a more CQRS-y approach where you have separate read/write EF contexts and all the reads go through AsNoTracking()... but that gets to be a lot to set up and maintain, and if I'm honest I feel like taking a CQRS approach also brings in Mediatr and then you end up going down a huge rabbit hole of indirection. Nice library, just feels like overkill sometimes. :) — Reply to this email directly, view it on GitHub <#10 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AARDXKUDU7OVXP6IBS4EEDLWJTX45ANCNFSM6AAAAAASFUUWJE> . You are receiving this because you were mentioned.Message ID: ***@***.***>

kirkbrauer · 2022-11-22T16:33:36Z

@edmistond yeah, I totally feel you. My current CQRS implementation is nice, but we have a whole abstraction library on top of MediatR, which is another thing to maintain. The developer experience is amazing, but you need to make sure that the abstractions you build work for you and aren't too fine-grained.

@benrick EF Core these days has pretty good performance, there are a few libraries like Ardalis.Specification are a really nice way to make your queries testable and re-usable. I think the main reason to go with EF Core is because that will draw in the larger .NET community because it's basically the de-facto standard for most projects, even if it isn't the most performant.

adenner · 2022-11-22T17:38:35Z

There is something to be said about not prematurely optimizing. The other argument for ef core is that it's longterm support and viability is almost guaranteed

devployment · 2022-11-27T18:35:34Z

I would love to see SqlLite as a first class citizen. It's always a hassle to be dependant on a certain db engine for running such things either on my home lab, or on a really small VPS. Usually I don't want to deal with the things that come with a "proper" database.

If SQLite would be a first class citizen, I could imagine running Smilodon as my own / or small community instance, and use https://litestream.io/ for peace of mind to backup my database. No servers, no worries on how to backup my database server and what not.

Never used Litestream, but stumbled upon while searching for alternatives to the traditional database solutions for a side project I'm currently researching for.

kirkbrauer · 2022-11-27T18:37:52Z

@devployment yeah, SQLite is pretty nice. Since we're using EF Core, potentially we could shift to SQLite in the future. For the reference implementation, I went ahead and did it in PostgreSQL since it's easy to compare to the original Mastodon schema and has great support.

benrick · 2022-11-27T20:00:51Z

Yeah, we decided on Postgres to support existing Mastodon databases. In theory, we could support either/both.

benrick · 2022-11-27T20:20:11Z

@devployment here's an article that shows how you can set up postgres and sqlite

https://blog.jetbrains.com/dotnet/2022/08/24/entity-framework-core-and-multiple-database-providers/

edmistond · 2022-11-28T04:04:53Z

I've been fascinated in following Litestream and have wanted to find a good excuse to use it in a project for a long time. 😄

While I think the reference implementation with Postgres is the first priority, I'm hopeful that with the bulk of the database being tables and only having a small number of views/materialized views, porting shouldn't be overly difficult.

My bigger concern would be handling background jobs and the associated job queue. I believe we can embed background workers into the main web API application (I've never tried implementing this personally) but trying to handle an external queue like Redis/Rabbit for larger installs and some kind of internal one for single-file deployments might be a challenge.

benrick added the question Further information is requested label Nov 21, 2022

kirkbrauer mentioned this issue Nov 25, 2022

Add models and initial migration for database #24

Merged

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discussion on Database plans/ORM #10

Discussion on Database plans/ORM #10

adenner commented Nov 20, 2022

benrick commented Nov 21, 2022

kirkbrauer commented Nov 22, 2022

edmistond commented Nov 22, 2022

benrick commented Nov 22, 2022

adenner commented Nov 22, 2022

kirkbrauer commented Nov 22, 2022 •

edited

Loading

benrick commented Nov 22, 2022 •

edited

Loading

benrick commented Nov 22, 2022

benrick commented Nov 22, 2022

kirkbrauer commented Nov 22, 2022

edmistond commented Nov 22, 2022

benrick commented Nov 22, 2022

benrick commented Nov 22, 2022

edmistond commented Nov 22, 2022

benrick commented Nov 22, 2022 via email

kirkbrauer commented Nov 22, 2022

adenner commented Nov 22, 2022

devployment commented Nov 27, 2022

kirkbrauer commented Nov 27, 2022

benrick commented Nov 27, 2022

benrick commented Nov 27, 2022

edmistond commented Nov 28, 2022

Discussion on Database plans/ORM #10

Discussion on Database plans/ORM #10

Comments

adenner commented Nov 20, 2022

benrick commented Nov 21, 2022

kirkbrauer commented Nov 22, 2022

edmistond commented Nov 22, 2022

benrick commented Nov 22, 2022

adenner commented Nov 22, 2022

kirkbrauer commented Nov 22, 2022 • edited Loading

benrick commented Nov 22, 2022 • edited Loading

benrick commented Nov 22, 2022

benrick commented Nov 22, 2022

kirkbrauer commented Nov 22, 2022

edmistond commented Nov 22, 2022

benrick commented Nov 22, 2022

benrick commented Nov 22, 2022

edmistond commented Nov 22, 2022

benrick commented Nov 22, 2022 via email

kirkbrauer commented Nov 22, 2022

adenner commented Nov 22, 2022

devployment commented Nov 27, 2022

kirkbrauer commented Nov 27, 2022

benrick commented Nov 27, 2022

benrick commented Nov 27, 2022

edmistond commented Nov 28, 2022

kirkbrauer commented Nov 22, 2022 •

edited

Loading

benrick commented Nov 22, 2022 •

edited

Loading