Add ElastiCache L2 RFC #464

dbartholomae · 2022-11-13T18:21:35Z

This is a request for comments about an RFC to add L2 constructs to ElastiCache. See #456 for
additional details.

APIs will be signed off by @corymhall.

By submitting this pull request, I confirm that my contribution is made under
the terms of the Apache-2.0 license

corymhall

This looks really great! I have a couple of questions just to clarify some
things, but I think the implementation looks pretty solid.

Can you also add a more advanced example for both types of replication groups?
Seeing how all the properties play together would help.

text/0456-elasticache-l2.md

corymhall · 2022-11-16T17:03:21Z

text/0456-elasticache-l2.md

+
+## Constructs
+
+### Defining a RedisReplicationGroup


There are a couple of things that I think we need to explore:

UseOnlineResharding seems to have a lot to take into consideration
Should this be enabled by default? If so, how should we handle some of these?

NumNodeGroups & NodeGroupConfiguration
The
docs
state that "As a best practice, when you create a replication group in a stack
template, include an ID for each node group you specify". It seems like we
should require the ID if they specify a configuration (or maybe we can
generate one).

NodeGroupConfiguration.Slots
"When you use an UseOnlineResharding update policy to update the number of node
groups without interruption, ElastiCache evenly distributes the keyspaces
between the specified number of slots. This cannot be updated later. Therefore,
after updating the number of node groups in this way, you should remove the
value specified for the Slots property of each NodeGroupConfiguration from the
stack template, as it no longer reflects the actual values in each node group.
For more information" docs

On this, I am missing deep enough understanding of ElastiCache to make the best trade-offs. E.g. for UseOnlineResharding, there is a warning that you should update NumNodeGroups and NodeGroupConfiguration only in isolation, which might not be obvious when using AWS CDK.

I agree on making ids required and will add it to the RFC. Seems to be a no-brainer.

The last point also speaks against setting UseOnlineResharding by default.

After looking into this, I would actually suggest that we make useOnlineResharding a required property for now. This way, introducing a default value in a later PR is a non-breaking change that can be done with close support from someone with deep ElastiCache understanding.
If you actually have deep enough ElastiCache understanding (or can bring in an expert who has), I'm also open to just implementing what you discussed. Though I would be prefer to not delay the implementation too much based on this.

Just saw that NodeGroupConfiguration currently is marked as a prop that is not part of the first implementation anyways, so I would only add the information on making the id required if we actually go deeper into this part anyways. Otherwise, I would leave this up for a future RFC or implementation discussion.

The default behavior should be to use online resharding. It increases availability and allows you to easily add and remove shards from your cluster. Offline resharding is necessary when you want to do more than just add or remove shards.

@dorser if this is enabled by default and someone who doesn't know about the details updates NumNodeGroups and other properties at the same time, what would happen? The docs caution against this, and I would aim for an L2 construct to be usable without deep understanding of the service details.
Similarly, would we need to handle NodeGroupConfiguration.Slots differently in that case to align with the recommended approach of deleting it from the template after initial deployment?

I'm not sure how to proceed on this discussion. @dorser @corymhall what do you think?

I would suggest that by default you omit the node group configuration slots and you always set the UseOnlineResharding. This is a bit of legacy about our service, but we initially launched only with OfflineResharding. Customers cared way more about online resharding than they did about configuring slots, so when we added online resharding we didn't allow custom slot configurations. If someone REALLY wants to set slots, they can use the L1 constructs, but my guess is that is a very small number of users.

@dbartholomae based on @madolson let's have UseOnlineResharding as the default.

text/0456-elasticache-l2.md

Pull request has been modified.

dbartholomae · 2022-12-07T20:31:57Z

@corymhall quick bump :)

corymhall

@dbartholomae sorry for the delay! Everything looks really good and I'm ready to approve. I'll send an email to the team notifying them that this has entered final comments period. If no blocking issues are raised I'll merge this next week.

text/0456-elasticache-l2.md

dbartholomae · 2022-12-09T18:43:39Z

Thanks! I've fixed the markdown linting and reached out for community feedback both in the Slack and in the related issues.

dorser · 2022-12-15T18:09:29Z

text/0456-elasticache-l2.md

+A `RedisClusterReplicationGroup` is similar to a `RedisReplicationGroup`, but
+with multiple shards. The documentation calls them "Redis (cluster mode enabled)".
+The main difference is that a `RedisClusterReplicationGroup` requires a
+`numNodeGroups` to be set to a value of 2 or higher. Only `RedisClusterReplicationGroup`s


numNodeGroups can be set to 1 or higher. In CloudFormation it defaults to 1 and I think we should keep it that way.

But at 1, it is a RedisReplicationGroup, while at 2 or higher, it is a RedisClusterReplicationGroup, which accept different props. It doesn't make sense to allow 1 for a RedisClusterReplicationGroup.

Don't get me wrong, you most likely know way more about ElastiCache than me, I'm just trying to wrap my head around this :)

@dorser @corymhall how do we proceed here? I currently see no need for change, but I might be missing something.

RedisClusterReplicationGroup could have a single shard, it's allowed in the API. What is different is the API on the dataplane protocol.

@madolson are you agreeing with @dbartholomae then?

@corymhall Yes

text/0456-elasticache-l2.md

dorser · 2022-12-15T19:44:13Z

text/0456-elasticache-l2.md

+
+## Constructs
+
+### Defining a RedisReplicationGroup


The default behavior should be to use online resharding. It increases availability and allows you to easily add and remove shards from your cluster. Offline resharding is necessary when you want to do more than just add or remove shards.

text/0456-elasticache-l2.md

edisongustavo

Will there be support for memcached too?

dbartholomae · 2023-01-10T16:38:19Z

Will there be support for memcached too?

This RFC does not cover memcached, but should leave a way ahead to add memcached later on.

text/0456-elasticache-l2.md

dbartholomae · 2023-01-28T23:03:45Z

I've updated the RFC based on the comments. Since this has been a bit of time now for comments, I would like to get the RFC to a close. There are three open conversations left.

corymhall · 2023-02-27T13:05:27Z

@dbartholomae great job on this! Sorry it took so long!!

dbartholomae · 2023-03-05T08:16:53Z

Thanks! I'll start working on an implementation maybe already this week :)

evgenyka · 2023-05-15T17:28:15Z

@dbartholomae how's going? Are you able to make any progress on this one?

dbartholomae · 2023-05-16T17:22:42Z

@evgenyka Unfortunately I didn't get to start with this yet, as it isn't highest priority for me right now.

corymhall previously requested changes Nov 16, 2022

View reviewed changes

corymhall reviewed Dec 8, 2022

View reviewed changes

dbartholomae changed the title ~~Add first RFC draft~~ Add ElastiCache L2 RFC Dec 9, 2022

dbartholomae mentioned this pull request Dec 9, 2022

📊Tracking: Amazon ElastiCache aws/aws-cdk#6908

Open

dbartholomae commented Dec 9, 2022

View reviewed changes

text/0456-elasticache-l2.md Outdated Show resolved Hide resolved

dbartholomae commented Dec 9, 2022

View reviewed changes

text/0456-elasticache-l2.md Outdated Show resolved Hide resolved

dorser reviewed Dec 15, 2022

View reviewed changes

fullsailor reviewed Dec 22, 2022

View reviewed changes

text/0456-elasticache-l2.md Show resolved Hide resolved

edisongustavo reviewed Jan 10, 2023

View reviewed changes

dbartholomae commented Jan 28, 2023

View reviewed changes

text/0456-elasticache-l2.md Outdated Show resolved Hide resolved

dbartholomae added 6 commits January 29, 2023 00:05

Add first RFC draft

15e24a5

Update RFC based on first discussions

b8b6e11

Fix linting

4059066

Remove incorrect line

cefba2d

Clear up that ParameterGroup will not be exposed as L2 construct

1b1be4c

Generate CacheNodeType from InstanceType

5efd10b

dbartholomae force-pushed the patch-1 branch from fcff296 to 5efd10b Compare January 28, 2023 23:05

corymhall approved these changes Feb 27, 2023

View reviewed changes

corymhall merged commit ad94145 into aws:master Feb 27, 2023

dbartholomae deleted the patch-1 branch March 5, 2023 08:16

evgenyka added the l2-request request for new L2 construct label Oct 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ElastiCache L2 RFC #464

Add ElastiCache L2 RFC #464

dbartholomae commented Nov 13, 2022

corymhall left a comment

corymhall Nov 16, 2022

dbartholomae Nov 16, 2022

dbartholomae Nov 16, 2022

dorser Dec 15, 2022

dbartholomae Dec 15, 2022

dbartholomae Jan 28, 2023

madolson Feb 10, 2023 •

edited

Loading

corymhall Feb 13, 2023

dbartholomae commented Dec 7, 2022

corymhall left a comment

dbartholomae commented Dec 9, 2022

dorser Dec 15, 2022

dbartholomae Dec 15, 2022

dbartholomae Dec 15, 2022

dbartholomae Jan 28, 2023

madolson Feb 10, 2023

corymhall Feb 13, 2023

madolson Feb 13, 2023

dorser Dec 15, 2022

edisongustavo left a comment

dbartholomae commented Jan 10, 2023

dbartholomae commented Jan 28, 2023

corymhall commented Feb 27, 2023

dbartholomae commented Mar 5, 2023

evgenyka commented May 15, 2023

dbartholomae commented May 16, 2023

Add ElastiCache L2 RFC #464

Add ElastiCache L2 RFC #464

Conversation

dbartholomae commented Nov 13, 2022

corymhall left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

madolson Feb 10, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dbartholomae commented Dec 7, 2022

corymhall left a comment

Choose a reason for hiding this comment

dbartholomae commented Dec 9, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

edisongustavo left a comment

Choose a reason for hiding this comment

dbartholomae commented Jan 10, 2023

dbartholomae commented Jan 28, 2023

corymhall commented Feb 27, 2023

dbartholomae commented Mar 5, 2023

evgenyka commented May 15, 2023

dbartholomae commented May 16, 2023

madolson Feb 10, 2023 •

edited

Loading