Typed catchers #2814

the10thWiz · 2024-06-26T02:01:19Z

Addresses #749, adds support for typed catchers via the transient crate.

When a FromRequest, FromParam, FromQuery, or FromData implementation returns an Error or Forward, if the returned error implements Any<Co<'r>>, it is boxed and travels with the Outcome.

A catcher can have a third parameter, now (&E, Status, &Request), where E is some type that implements Any<Co<'r>>. If the error contained in the Outcome is of the same type, it is downcast to
the specific type, and passed to the catcher.

A working example can be seen in the error-handling example - simply visit /hello/name/a to see a ParseIntError returned by the FromParam of i8.

TODO

SergioBenitez

Thank you so, so much for this. As you know, this is a feature that's been
requested for quite some time now, so it's exciting to see some real progress
here.

While reviewing this PR, I found myself consistently asking:

Does this overcome the fundamental technical hurdles that have plagued this
getting this feature into Rocket?
Does this meet the usability requirements we expect?
Is this feature-complete enough to avoid missing expectation due to missing
features?

I believe the answer to the first question is yes, which is truly exciting!
I temper this statement only because I also believe we're missing some key
usability-impacting features, and there's a concern that the approach won't
extend to these features, but at a first glance it seems probable.

Feature-Completeness

This PR current allows errors from input guards (FromParam, FromSegments,
FromForm, and FromData) to be type-erased and later downcast into a concrete
type in an error catcher. The critical missing piece is allowing the same for
responders. In particular, we want the following to compile and work as one
might expect:

#[derive(Debug, Transient /* Error */)]
struct MyType<'r>(&'r str);

#[get("/<foo>")]
fn hello(foo: &str) -> Result<&'static str, MyType<'_>> {
    Err(MyType(foo))
}

#[catch(default)]
fn error(foo: &MyType) -> &str {
    foo.0
}

This means that Responder needs to change to accommodate an error type.

type Outcome<'o> = Outcome<Response<'o>, E, Status>;

pub trait Responder<'r, 'o: 'r> {
    type Error: Debug + 'o;

    fn respond_to(self, request: &'r Request<'_>) -> Outcome<'o, Self::Error>;
}

Where:

Outcome::Success(Response) is akin to today's Ok(Response)
Outcome::Forward(Status) is akin to today's Err(Status) and can be read
as "forwarding to the catcher for Status"
Outcome::Error(Self::Error) is new. For many types, Self:Error will be
set to Infallible, implying that the responder only succeeds or forwards.

One thing to consider is that we probably want to associate a status with the
error, and we also want to allow the error to respond directly if nothing
catches it.

Here we have two obvious approaches:

Require Responder::Error: Responder, apply the static dispatch
specialization trick to check for T: Transient as this PR does. Check that
we don't run into an infinite error-loop (i.e, a responder that returns
itself as an error which then errors with itself, and so on).

Create a new trait, Error, and apply the static dispatch specialization
trick to check for T: Error. The trait might look something like:

trait Error<'r>: Transient + .. {
    fn respond_to(&self, request: &'r Request<'_>) -> Result<Response<'r>, Status> {
        Err(self.status())
    }

    /// Maybe? Ignore for now.
    fn status(&self) -> Status { Status::InternalServerError }

    /// Log on catcher failure: expected type Foo, have error Bar.
    fn type_name(&self) -> &'static str { .. }

    fn upcast(self) -> Box<dyn Transient<'r>> { .. }

    fn source(&self) -> Option<Box<dyn Error<'r>>> { .. }
}

It's not obvious to me at this point which approach might be better, or if
there's another approach that's missing here. The main difference is that the
former requires the error type to implement Responder, which may not be what
we want, and furthermore that the Responder implementation not vary between
being used as a regular response and as an error. As such, I lean towards the
latter approach.

Usability

My main concerns on the usability front are two-fold:

the extensions to the #[catch] macro. In particular, I'd like to find a
way to remove the need for additional attribute arguments of status and
perhaps even error.
the difficulty in implementing custom downcastable errors

`#[catch]`

In other attributes, like #[route], we need attribute parameters because
either they provide information about what to parse (path, segments, and query
params) or they operate in a fundamentally different manner at the type-system
level: (data and request). There's also a security and category-theoretical
argument to be made about creating different class types for parseable
parameters.

But with status and error, none of these argument need necessarily apply:
all of the information derives from references or copy-able types without and
particular security considerations. So the only reason* (there's one reason we
might want an error argument, actually) for the arguments to exist is for our
sake in implementing codegen.

More concretely, the #[catch] attribute separates types into three categories:

T: FromRequest
T: Transient via error =
Status via status =

I argue that in reality, these should all be one:

T: FromError

trait FromError<'r> {
    fn from_error(request: &'r Request<'_>, error: Option<ErasedError>);
}

Then we can recover all of the obvious implementations:

impl<'r> FromError<'r> for Status {
    fn from_error(request: &'r Request<'_>, error: Option<ErasedError>) {
        error.as_ref().map_or(Status::InternalServerError, |e| e.status())
    }
}

Implementing Custom Errors

Consider the following complete program:

use rocket::*;
use rocket::request::{self, Request, FromRequest};
use rocket::outcome::Outcome;
use rocket::http::Status;

#[derive(Debug, transient::Transient)]
struct MyError<'r>(&'r str);

#[derive(Debug)]
struct MyGuard;

#[rocket::async_trait]
impl<'r> FromRequest<'r> for MyGuard {
    type Error = MyError<'r>;

    async fn from_request(req: &'r Request<'_>) -> request::Outcome<Self, Self::Error> {
        Outcome::Error((Status::InternalServerError, MyError(req.uri().path().as_str())))
    }
}

#[get("/")]
fn index(guard: MyGuard) -> &'static str {
    "Hello, world!"
}

#[catch(default, error = "<foo>")]
fn error<'a>(foo: &MyError<'a>) -> &'a str {
    foo.0
}

#[launch]
fn rocket() -> _ {
    rocket::build()
        .mount("/", routes![index])
        .register("/", catchers![error])
}

There are a few issues here:

I needed to depend on transient directly or else the derive wouldn't work
since it references transient::. This shouldn't need to be the case.
The error catcher fails at runtime because, for whatever reason, the error
type MyError is not meeting the specialization requirements. I would
expect that if my type doesn't meet the requirements to be an error, then
I wouldn't be allowed to use it as typed error at all, and my program would
fail to compile.
Covariance needs to be declared: can we figure it out automatically, i.e,
via by calling rustc, or for references syntactically (&T)?

This also begs the question: what exactly is required to be implemented of error
types? Transient does not appear to be sufficient. This further supports the
idea of a Rocket-specific Error trait that has all of the requirements
built-in. And if Error is a Rocket trait, then we can re-export the derive
to allow users to do the following, completely relegating transient to an
implementation detail.

#[derive(rocket::Error)]
struct MyError<'r> { .. }

As an aside, the following gives an error message that we should look to improve:

#[derive(Debug, Responder, Transient)]
struct MyType<'r>(&'r str);

#[catch(default, error = "<foo>")]
fn error<'a>(uri: &Origin, foo: &'a MyType) -> &'a str {
    foo.0
}

core/lib/src/catcher/catcher.rs

core/lib/src/router/collider.rs

core/lib/src/lifecycle.rs

core/lib/src/catcher/types.rs

core/lib/src/lifecycle.rs

examples/responders/src/main.rs

the10thWiz · 2024-09-21T19:24:33Z

@SergioBenitez I've made some significant progress, and I've found a couple ways to improve the API. I wanted to put them in writing, both so you can comment on them sooner rather than later, and for future reference.

The 'final' API

I've opted to remove Status from anywhere a TypedError is returned, rather relying on the error to provide a status. To aid in this, there are TypedError implementations for Status, (Status, T) and Custom<T> (where T: TypedError), so the status can be customized without being forced to add a status member.

The final form of the TypedError trait:

// `AsAny` is a sealed/blanket impl for a `fn as_any(&self) -> &dyn Any`
pub trait TypedError<'r>: AsAny<Inv<'r>> + Send + Sync + 'r {
    /// Generates a default response for this type (or forwards to a default catcher)
    fn respond_to(&self, request: &'r Request<'_>) -> Result<Response<'r>, Status> {
        Err(self.status())
    }

    /// A descriptive name of this error type. Defaults to the type name.
    fn name(&self) -> &'static str { std::any::type_name::<Self>() }

    /// The error that caused this error. Defaults to None.
    ///
    /// # Warning
    /// A typed catcher will not attempt to follow the source of an error
    /// more than (TODO: exact number) 5 times.
    fn source(&'r self) -> Option<&'r (dyn TypedError<'r> + 'r)> { None }

    /// Status code
    fn status(&self) -> Status { Status::InternalServerError }
}

It should be fairly obvious what these methods get used for.

One change, is that .source() will be called multiple times (currently up to 5). I'm preventing loops by simply placing a reasonable upper limit on the number of times it can be called, since I don't think there is a better option. (I considered checking for pointer equality with a previous value, but I'm not confident we can rely on this, since ZSTs don't have a meaningful address, and false positives might be possible).

Remaining decisions

Is TypedError too onerous of a trait bound for error types? I don't think so. Pretty much every std error is already a typed error, and Status can be used if you don't want to deal with an actual type. The derive macro has very few requirements (we only support a single lifetime parameter), so any type you control can pretty trivially implement TypedError. The one exception would be third-party errors (such as sqlite::Error, which obviously doesn't implement TypedError), but we could provide some kind of Debug<E> impl, that enables any 'static to act as a TypedError. I also don't think it's a significant burden to ask rocket-specific crates to add TypedError to their error types, and wrap any error types they don't control.

With this in mind, if it isn't too much of a burden, we might be able to eliminate the specialization trick, since each of these error types would be required to implement TypedError. This would also allow Responder to return a Result<Response, Self::Error>, since it provides all the functionality we would require.

The main alternative would be to require some kind of HasStatus trait, with a single fn status(&self) -> Status method, and specialize on TypedError. However, I don't think there is much value in allowing types to implement HasStatus without TypedError - the only additional req is Transient, which is trivially derived with the TypedError derive.

I'm also considering removing respond_to from TypedError, since it's pretty unlikely it actually gets called. Any catcher (even an untyped one) prevents the respond_to method from getting called.

Potential issues

The respond_to method is tried after untyped catchers, so an untyped catcher can prevent respond_to from ever getting called. Because of this, a default catcher mounted at the root will prevent respond_to from ever getting called, for any error.

We likely need to provide error wrappers for the contrib db crates, since there really isn't a good way to implement TypedError for the DB error types.

the10thWiz · 2024-11-22T16:47:59Z

There are a few major updates with this. It's a partial rewrite, based on the latest version of main, along with a few other changes along the way.

First, the TypedError's source() method now takes a parameter, so that it can return multiple errors. The only use-case in current code is Either<A, B>, where the error type is (A, B).

Second, we now use a git dependency for transient, since there are a few changes that have not yet been merged into upstream. There are also some other issues I need to sort out with the CI for transient, but that shouldn't prevent reviewing this PR.

Third, TypedError is no longer optional, and the specialization trick is no longer used. We provide two primary options for implementing TypedError: the derive macro, which can be applied to almost any type, and Debug<E>, which provides a default implementation for any 'static type.

Overall, I'm happy with this PR. From updating the tests, and a personal project I'm working on at the same time, there are only a couple of pain points. Since I'm deep in this project, I can't say with any level of certainty how easy it is to identify what type you should use in the catcher. I hope it's fairly easy with the error_name parameter in the trace logs. Also, a number of examples in Rocket itself use Result as a responder, in a way that no longer works, since Result now converts the Err variant into a TypedError that can be caught by a catcher. I rewrote the examples to use Either, but it did make them more verbose.

@SergioBenitez Please review this when you get a chance.

the10thWiz self-assigned this Jun 26, 2024

the10thWiz modified the milestone: 0.6.0 Jun 26, 2024

the10thWiz linked an issue Jun 26, 2024 that may be closed by this pull request

Typed Error Catchers, Fairings #749

Open

the10thWiz requested a review from SergioBenitez June 26, 2024 03:07

SergioBenitez requested changes Aug 9, 2024

View reviewed changes

the10thWiz added 15 commits November 17, 2024 18:35

Apply basic design to the core (and http) library

b11d407

Add FromError and update Fairings

ba1a7ff

Update sync_db_pools for new API

3eb4154

Update db_pools

368bd06

Update websockets

0c5f576

Update rocket_dyn_templates

fded2a8

Add todo notes

35cc4fa

Fixing more compile issues

e0fbae6

Complete initial implementation

ed5618a

Update with working implementation

88636e2

Add tests, and update collition logic

32e25a1

Add better tests

ddc8417

Update derive docs

2a63797

Updating docs

aa845ad

Clean up initial implementation

0b72867

the10thWiz force-pushed the typed-catchers branch from c263a6c to 0b72867 Compare November 22, 2024 05:31

the10thWiz marked this pull request as ready for review November 22, 2024 05:31

Fix minor issues

1a64607

the10thWiz added 4 commits November 22, 2024 12:56

Update s2n-quic-h3 to fix compatibility issue

4ad4c08

Remove old dependency

443d381

Re-add display for error to tracing

10b9faa

Fix more small things

b3806e6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Typed catchers #2814

Typed catchers #2814

the10thWiz commented Jun 26, 2024 •

edited

Loading

SergioBenitez left a comment

the10thWiz commented Sep 21, 2024

the10thWiz commented Nov 22, 2024

Typed catchers #2814

Are you sure you want to change the base?

Typed catchers #2814

Conversation

the10thWiz commented Jun 26, 2024 • edited Loading

TODO

SergioBenitez left a comment

Choose a reason for hiding this comment

Feature-Completeness

Usability

#[catch]

Implementing Custom Errors

the10thWiz commented Sep 21, 2024

The 'final' API

Remaining decisions

Potential issues

the10thWiz commented Nov 22, 2024

the10thWiz commented Jun 26, 2024 •

edited

Loading

`#[catch]`