Skip to content

Commit

Permalink
Add Design for Allowing Instance-Level Resource Status Restore
Browse files Browse the repository at this point in the history
Signed-off-by: Shubham Pampattiwar <[email protected]>

add changelog file

Signed-off-by: Shubham Pampattiwar <[email protected]>

typo fix

Signed-off-by: Shubham Pampattiwar <[email protected]>
  • Loading branch information
shubham-pampattiwar committed Nov 13, 2024
1 parent 1fbd22f commit d0621c7
Show file tree
Hide file tree
Showing 2 changed files with 111 additions and 0 deletions.
1 change: 1 addition & 0 deletions changelogs/unreleased/8403-shubham-pampattiwar
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Add Design for Allowing Instance-Level Resource Status Restore
110 changes: 110 additions & 0 deletions design/resource-status-restore.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,110 @@
# Allow Instance-Level Resource Status Restore in Velero

## Abstract
This design proposes a way to enhance Velero’s restore functionality by enabling instance-level resource status restoration through annotations.
Currently, Velero allows restoring resource statuses only at a resource type level, which lacks granularity of restoring the status of specific resources.
By introducing an annotation that controllers can set on individual resource instances, this design aims to improve flexibility and autonomy for users/resource-controllers, providing a more way
to enable resource status restore.


## Background
Velero provides the `restoreStatus` field in the Restore custom resource to specify resource types for status restoration. However, this feature is limited to resource types as a whole, lacking the granularity needed to restore specific instances of a resource type. Resource controllers, especially those managing custom resources with external dependencies, may need to restore status on a per-instance basis based on internal logic and dependencies.

This design adds an annotation-based approach to allow controllers to specify status restoration at the instance level, enabling Velero to handle status restores more flexibly.

## Goals
- Provide a mechanism to specify the restoration of a resource’s status at an instance level.
- Maintain backwards compatibility with existing functionality, allowing gradual adoption of this feature.
- Integrate the new annotation-based instance-level status restore with Velero’s existing resource-type-level `restoreStatus` configuration.

## Non-Goals
- Alter Velero’s existing resource type-level status restoration mechanism.

## Use-Cases/Scenarios

1. Controller managing specific Resources
- A resource controller identifies that a specific instance of a resource should have its status restored due to particular dependencies
- The controller automatically sets the `velero.io/restore-status: true` annotation on the resource.
- During restore, Velero restores the status of this instance, while leaving other resources unaffected.

2. Resource-Type level Restore
- A user specifies a resource type (e.g., workflows) in the restoreStatus.includedResources field within the Restore custom resource.
- Velero restores the status for all instances of the specified resource type, regardless of whether they have the `velero.io/restore-status` annotation.

3. Default Behavior for Instances Without the Annotation
- Instances without the `velero.io/restore-status` annotation behave as they currently do: Velero skips their status restoration unless the resource type is specified in the `restoreStatus.includedResources` field.

## High-Level Design

- Instance-Level Status Restore Annotation: We are introducing the `velero.io/restore-status` annotation at the resource instance level to mark specific instances for status restoration.
- `true`: Indicates that the status should be restored for this instance, even if the resource type is not listed in `restoreStatus.includedResources`.
- `false` (or absence): Indicates no special treatment, and the instance will only restore status if its resource type is in `restoreStatus.includedResources`.

- Velero Restore Logic Update: During a restore operation, Velero will:
- Check the restoreStatus.includedResources field for resource types that should have status restored.
- For instances in these types, restore status for all instances, regardless of the annotation.
- For other resource types, check each instance for the `velero.io/restore-status: true` annotation, restoring status only for those marked instances.


## Detailed Design

1. Annotation for Instance-Level Status Restore: The `velero.io/restore-status` annotation will be set on individual resource instances by users/controllers as needed:
```yaml
metadata:
annotations:
velero.io/restore-status: "true"
```
2. Restore Logic Modifications: During the restore operation, the restore controller will follow these steps:
- Check `restoreStatus.includedResources` to get a list of resource types that should have their statuses restored.
- For each instance in the specified types, restore the status regardless of any annotation.
- For other resource types (not in `restoreStatus.includedResources`), check each instance’s annotations.
- If an instance has `velero.io/restore-status: true`, restore the status for that instance.

3. Error Handling: If an invalid annotation value or format is encountered, Velero logs a warning and continues without restoring the status for that instance.


## Implementation

We are targeting the implementation of this design for Velero 1.16 release.

Current restoreStatus logic resides here: https://github.com/vmware-tanzu/velero/blob/32a8c62920ad96c70f1465252c0197b83d5fa6b6/pkg/restore/restore.go#L1652

The modified logic would look somewhat like:

```go
// Determine whether to restore status based on resource type configuration and instance-level annotation
shouldRestoreStatus := ctx.resourceStatusIncludesExcludes != nil && ctx.resourceStatusIncludesExcludes.ShouldInclude(groupResource.String())
// Check for the instance-level annotation on the resource object
instanceAnnotation := obj.GetAnnotations()["velero.io/restore-status"]
shouldRestoreStatusInstance := instanceAnnotation == "true"
// If either the resource type is included or the instance-level annotation indicates restoration
if (shouldRestoreStatus || shouldRestoreStatusInstance) && statusFieldErr != nil {
err := fmt.Errorf("could not get status to be restored %s: %v", kube.NamespaceAndName(obj), statusFieldErr)
ctx.log.Errorf(err.Error())
errs.Add(namespace, err)
return warnings, errs, itemExists
}
ctx.log.Debugf("status field for %s: exists: %v, should restore: %v, should restore by annotation: %v", newGR, statusFieldExists, shouldRestoreStatus, shouldRestoreStatusInstance)
// Proceed with status restoration if required by either resource type or annotation
if statusFieldExists && (shouldRestoreStatus || shouldRestoreStatusInstance) {
if err := unstructured.SetNestedField(obj.Object, objStatus, "status"); err != nil {
ctx.log.Errorf("could not set status field %s: %v", kube.NamespaceAndName(obj), err)
errs.Add(namespace, err)
return warnings, errs, itemExists
}
obj.SetResourceVersion(createdObj.GetResourceVersion())
updated, err := resourceClient.UpdateStatus(obj, metav1.UpdateOptions{})
if err != nil {
ctx.log.Infof("status field update failed %s: %v", kube.NamespaceAndName(obj), err)
warnings.Add(namespace, err)
} else {
createdObj = updated
}
}
```

0 comments on commit d0621c7

Please sign in to comment.