pageserver: support 1 million relations #9516

jcsp · 2024-10-25T11:37:33Z

We do not currently define a maximum number of relations that we support, but it is known that beyond about 10k relations things get dicey. The exact number of issues is unknown, but the primary architectural issue is how we store RelDirectory as a monlithic blob that gets rewritten whenever we add/remove one.

Postgres itself does not define a practical limit on relations per database: the hard limit is approximately one billion, but it is well known that the practical limit is much lower, and dependent on hardware+config:

To pick an arbitrary but realistic goal, let's support+test 1 million tables. This is realistic because:

Something like an array of relation sizes is only single digit megabytes with a million tables (whereas with a billion tables, such structures would likely need to be disk-based rather than simple in-memory structures)
If we the can create a few thousand tables per second, then a test that creates a million tables can run in minutes, not hours (i.e. within the envelope of what our CI supports)

A tiny initial step in this direction is #9507, which adds a test that creates 8000 tables (not very many!) to reproduce a specific scaling bug in transaction aborts. That test currently has a relatively long runtime (tens of seconds) because our code for tracking timeline metadata is still very inefficient.

The goal is to make it work "fast enough", in the sense that a database is usable and things don't time out, but not necessarily to implement every possible optimisation. For example, logical size calculations will be expensive with 1 million relations (requiring many megabytes of reads from storage), and that is okay as long as the expense does not cause the system to fail from the user's point of view.

Out of scope:

High database counts (Neon cloud already limits databases per project to 500 by default)
Revising pg_stat (Persist pg_stat information in pageserver #6560 ) code to handle large relation counts (current code skips writing pg_stat if the snapshot exceeds a size threshold)
Any postgres CLI/tooling issues around high relation counts

The text was updated successfully, but these errors were encountered:

## Problem In preparation to #9516. We need to store rel size and directory data in the sparse keyspace, but it does not support inheritance yet. ## Summary of changes Add a new type of keyspace "sparse but inherited" into the system. On the read path: we don't remove the key range when we descend into the ancestor. The search will stop when (1) the full key range is covered by image layers (which has already been implemented before), or (2) we reach the end of the ancestor chain. --------- Signed-off-by: Alex Chi Z <[email protected]>

## Problem Part of #9516 per RFC at #10412 ## Summary of changes Adding the necessary config items and index_part items for the large relation count work. --------- Signed-off-by: Alex Chi Z <[email protected]>

jcsp added c/storage/pageserver Component: storage: pageserver t/feature Issue type: feature, for new features or requests labels Oct 25, 2024

erikgrinaker mentioned this issue Nov 22, 2024

pageserver: slow get_rel_exists() during WAL ingestion with many relations #9855

Closed

skyzh self-assigned this Jan 6, 2025

skyzh mentioned this issue Jan 8, 2025

feat(pageserver): support inherited sparse keyspace #10313

Merged

skyzh mentioned this issue Jan 17, 2025

feat(pageserver): add reldir migration configs #10439

Merged

skyzh mentioned this issue Jan 30, 2025

feat(pageserver): store reldir in sparse keyspace #10593

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pageserver: support 1 million relations #9516

pageserver: support 1 million relations #9516

jcsp commented Oct 25, 2024 •

edited by skyzh

Loading

pageserver: support 1 million relations #9516

pageserver: support 1 million relations #9516

Comments

jcsp commented Oct 25, 2024 • edited by skyzh Loading

jcsp commented Oct 25, 2024 •

edited by skyzh

Loading