Distinguish TimeoutErrors for open and read timeouts #718

coberlin · 2017-08-09T17:02:31Z

In faraday/adapter/rack.rb, TimeoutError is raised for both open and read timeouts:

timeout  = env[:request][:timeout] || env[:request][:open_timeout]
response = if timeout
  Timer.timeout(timeout, Faraday::Error::TimeoutError) { execute_request(env, rack_env) }
else ... end

According to https://stackoverflow.com/questions/10322283/what-is-timeout-and-open-timeout-in-faraday, open_timeout is for the tcp connection and timeout is for the response read.

It would be nice to have separate exception types for these timeouts. Then we could determine whether or not to retry the request. Does adding something like Faraday::Error::OpenTimeoutError and Faraday::Error::ResponseTimeoutError and using those here make sense?

The text was updated successfully, but these errors were encountered:

iMacTia · 2017-08-10T11:09:11Z

Hi @coberlin I believe this might be a nice addition, I'm just scared about backwards compatibility.
However, a possible solution for this might be to have OpenTimeoutError and ResponseTimeoutError to inherit from TimeoutError, so that existing rescues will keep working as expected.
It's definitely worth some testing 😃

coberlin · 2017-08-15T18:18:56Z

The rack_adapter might be the wrong place for this feature. I think Rack applications don't necessarily distinguish between open and read timeouts. Perhaps this feature would work in the HTTPClient adapter or other adapters? From adapter/httpclient.rb:

    @app.call env
  rescue ::HTTPClient::TimeoutError, Errno::ETIMEDOUT
    raise Faraday::Error::TimeoutError, $!
  rescue ::HTTPClient::BadResponseError => err
    if err.message.include?('status 407')
      raise Faraday::Error::ConnectionFailed, %{407 "Proxy Authentication Required "}
    else
      raise Faraday::Error::ClientError, $!
    end
  rescue Errno::ECONNREFUSED, IOError, SocketError
    raise Faraday::Error::ConnectionFailed, $!
  rescue => err
    if defined?(OpenSSL) && OpenSSL::SSL::SSLError === err
      raise Faraday::SSLError, err
    else
      raise
    end

::HTTPClient::TimeoutError has 3 subclasses ConnectTimeoutError, ReceiveTimeoutError, SendTimeoutError, see e.g http://www.rubydoc.info/gems/httpclient/2.1.5.2/HTTPClient/TimeoutError

Faraday has Faraday::Error::ConnectionFailed already. Is that appropriate for ConnectTimeoutError? Faraday::Error::TimeoutError could be subclassed into Faraday::Error::ReceiveTimeoutError and Faraday::Error::SendTimeoutError.

iMacTia · 2017-08-16T09:06:03Z

Faraday has Faraday::Error::ConnectionFailed already. Is that appropriate for ConnectTimeoutError?

This makes sense, but it wouldn't be backwards compatible. We have to keep in mind that people are already catching Faraday::Error::TimeoutError in their application so switching to ConnectionFailed will brake those cases.
What we want to do, instead, is defining 2 subclasses for Faraday::Error::TimeoutError whose names should be as much generic as possible:

Faraday::Error::OpenTimeoutError
Faraday::Error::ReadTimeoutError

Next step is to go into each adapter and map the adapter exceptions accordingly. E.g. for the HTTPClient:

HTTPClient::ConnectTimeoutError ==> Faraday::Error::OpenTimeoutError
HTTPClient::ReceiveTimeoutError ==> Faraday::Error::ReadTimeoutError
HTTPClient::TimeoutError ==> Faraday::Error::TimeoutError (this will catch also SendTimeoutError, which I'm not sure have a corresponding mapping in Faraday or a specific setting)

Finally, tests should be added where possible :)

mistersourcerer · 2017-10-05T13:28:02Z

Hey guys.

We had this discussion a "little" time ago (#324).

I'm giving it another try (https://github.com/mistersourcerer/faraday/tree/718_mrsrcr_timeout-wrapping-2nd-chance), will try and open a new PR as soon as I have some progress on it.

iMacTia · 2017-10-05T15:35:54Z

Hi @mistersourcerer, thanks for the nudge, I was totally unaware that discussion took place.
I'm a bit confused as I see the PR closed, but the change in the code, haps to know you got your change merged somehow in the end 😄
Your help would be appreciated in this case as I think you're already comfortable with Timeout testing from your previous work (even though we're talking about 3 years ago!).
I hope my explanation on the OpenTimeoutError and ReadTimeoutError is clear, but if that's not the case then please let me know.
Take your time and open a pull request once you're done 👍

mistersourcerer · 2017-10-05T16:00:15Z

Hey @iMacTia.

If I remember correctly, we didn't manage to solve the situation back then. But I'm not sure exactly why.
The main problem was to write a test that failed consistently among all the adapters. So, I don't think my code was merged at all at the time.
Anyways, I have an idea for this some years after haha, let's see how it goes.

Right now, the tests for EMSynchrony are failing on Travis, but not locally. Trying to figure it out. I'm thinking even on open an "early" PR so maybe we can discuss this.

And your explanation is crystal clear, seems the perfect way to go with it.

Thanks for the awesome work on this, man.

iMacTia · 2017-10-05T16:09:43Z

Thank you @mistersourcerer!

RE your changes: I'm not really sure of what happened, but I see @mislav finally merged your changes here: f73d13e

So rejoice, Errno::ETIMEDOUT is already wrapped under Faraday::Error::TimeoutError on most (if not all) adapters 😄

coberlin · 2017-10-05T19:18:08Z

Thanks for working on this @mistersourcerer!

Looking at your commit here, I wonder if for backwards compatibility, we need 2 new subclasses: OpenConnectionError < ConnectionError for Net::HTTP and OpenTimeoutError < TimeoutError for HttpClient?

iMacTia · 2017-11-13T13:53:19Z

There appears to be some confusion around this issue.
The reason is that a decision was taken on #438 to handle "open timeout" errors as ConnectionFailed. That is arguably the best decision, but reality is that someone decided to go down that route.
Now, this doesn't affect only the Rack adapter but also all other adapters, and their behaviour is probably not even consistent.
I'm planning to standardise them on the same behaviour with v1.0 and I'll keep this issue as a reference.

iMacTia · 2017-11-13T15:39:47Z

Follow-up in my previous comment.

Basically, we're currently raising a Faraday::ConnectionFailed error in case of an open timeout, while we raise a Faraday::TimeoutError for a read timeout. Although different adapters are currently behaving in different ways, this seems to be the most common behaviour.
This was decided something like 3 years ago, but here we're discussing on having a Faraday::TimeoutError for the former case as well (with proper sub-classes to distinguish between open and close).

On one side I understand that would be closer to reality, but if I analyse the issue from an implementation point of view, I find it hard to justify this change.
If I call a service and I get back a ConnectionFailed, I know that my call can't possibly have been processed. I probably didn't reach the server, or couldn't resolve the hostname, or something else happened.
If I get back a TimeoutError, then my request might have been processed, or partially processed, and I might have missed the response. That's a completely different case and requires to double-check with the server I was calling what happened.

Making the open timeout a sub-category of TimeoutError means taking a simple situation (request not processed) under a more complex domain, and surely requires additional checks to decide what to do: was it an open timeout or a read timeout?

We need to:

Decide how to bubble-up open timeouts
Standardise all adapters to the same behaviour

@coberlin @erik-escobedo @mislav @mistersourcerer would like to hear your thoughts after considering the above 😄

coberlin · 2017-11-13T20:13:14Z

Going with ConnectionFailed for open timeout errors makes sense to me and would provide what I was hoping to get by distinguishing the open timeout errors from the other timeout errors. For adapter consistency, this would mean, for example, that Net::HTTP is ok as it is, but HTTPClient would change, with ConnectTimeoutErrors mapping to ConnectionFailed instead of to TimeoutError.

iMacTia · 2017-11-13T20:41:52Z

That's OK, once we decide we'll standardise all adapters to the same behaviour (in v1.0 obviously, as this will be backward incompatible)

philsturgeon · 2017-11-14T18:55:35Z

Throwing a use-case into the ring:

At work we've been suffering from some Open Timeouts due to Nginx + Kubernetes failing to route to hanging pods (or something). Anyway, NetHTTP used to throw OpenTimeout and ReadTimeout errors, and that was really handy for us debugging which was which.

Now we've switched to Typhoeus we sadly have all timeouts munged together, and it's tough for us to tell if our work on the nginx + kuber problems have been improved, or if we're just now successfully making more requests to an increasingly struggling system. Either way the number of timeouts are about the same, and without getting them separated we're kinda stuck guessing.

I don't think just adding Faraday::OpenTimeoutError is enough, we should have Faraday::OpenTimeoutError and Faraday::ReadTimeoutError extending from Faraday::TimeoutError IMO.

iMacTia · 2017-11-15T10:22:34Z

@philsturgeon and what about the other proposed solution, would that help as well?

Open timeout -> Faraday::ConnectionFailed
Read timeout -> Faraday::TimeoutError

That should be the behaviour on all adapters, but unfortunately some are not behaving as expected (i.e. Typhoeus)

philsturgeon · 2017-11-15T12:47:03Z

I feel like those are different things.

ConnectionFailed seems like "I have no idea how to talk to this server", like an invalid DNS/IP etc.

OpenTimeout is "I know where this server is im just waiting for it to do a thing"

iMacTia · 2017-11-15T14:25:45Z

OpenTimeout is "I know where this server is im just waiting for it to do a thing"

I disagree with that, I'd rather say:

Open Timeout: I'm trying to contact the server, but I can't reach it (Note: connection not established or "opened" yet).
Read Timeout: I've established a connection with the server but I'm waiting for it to do a thing (reading the output).

A faulty firewall/proxy/load_balancer are just simple examples of how you might get an open timeout, but in all this cases the connection to the server has not started yet. That's the most important bit for me. "ConnectionFailed" to me simply means: I couldn't connect to the server. And it perfectly suits these cases.

If you still think that a specific Faraday::OpenTimeoutError should exist, then I'd suggest that to inherit from ConnectionFailed rather than TimeoutError but I agree that would be a bit confusing and not sure how it would help in practice.
Please see my previous #718 (comment) on how this might actually help to manage the error.

Does it make sense? I would like to find a solution that fits everyone

philsturgeon · 2017-11-16T14:06:13Z

I accept your more accurate definitions for open timeout but I come to a different conclusion.

You consider open timeout to be considered a connection failure as the amount of time you're wiling to wait for that connection is considered part of the connection. "Failed to make a connection in 5s" certainly makes sense if you explain it like that, but that's not how a lot of people think.

For many, open timeout just means it has not happened yet. That makes it less of a definitive statement than most connection failures, which is "The server is down" or "This DNS is garbage".

I suppose it doesn't much matter, as connection failures and open timeouts should both be retried, here as a read timeout might be considered grounds to back off?

iMacTia · 2017-11-16T15:23:28Z

I agree, we can argue as much as we want on the reading one can apply to it, but the practicality of my point is what you said as well: If you get an Open Timeout it means you can retry the request, if you get a Read Timeout it means you have to be VERY careful as your request might have been process (entirely or partially). Coincidentally, the practical meaning of an open timeout matches the one of a failed connection, hence I would make it inherit from there.

Today people are catching ConnectionFailed and TimeoutError exceptions and the logics behind is very probably reflecting what we said before. If we introduce the new exception as a subclass of ConnectionFailed then chances are high that most (If not all) application won't need any change.

I understand (and agree) from a semantic point of view though that an OpenTimeout is just another type of Timeout.

But hey, what if we call it ConnectionTimedOut instead?

philsturgeon · 2017-11-16T16:05:50Z

There would be some confusion around open_timeout: X being the name of the property that says how long to wait until throwing a ConnectionTimedOut.

iMacTia · 2017-11-16T16:13:18Z

Good point 😞

philsturgeon · 2017-11-16T16:18:38Z

Call it ConnectionOpenTimeout? It makes it clear its a connection problem and keeps it clear that its an opening timeout. I think this name keeps understanding in line with the "Failed to make a connection in X seconds" meaning, even though some people might still wonder why timeout is not a timeout. 😅

iMacTia · 2017-11-16T16:46:58Z

Sounds good to me 👍!

ragav0102 · 2019-10-07T18:50:56Z

@iMacTia Is anyone working on this change? I don't mind taking this up for v2.0.

iMacTia · 2019-10-08T10:34:27Z

Hi @ragav0102, thanks for the support!
No one is working on this yet, as we're still pushing to get v1.0 out of the door.

We'd definitely appreciate the help, but we don't have a plan yet for v2.0 so I can't tell when it will be released, so your changes may need to wait months before they can be used.

If you need this in one of your projects, then that's probably not feasible.
If you just passed and would like to contribute, I'd suggest you to pick something scheduled for v1.0 as it will be released much sooner 😄

ragav0102 · 2019-10-09T04:51:55Z

Got it!

The versions of each of these had fallen a reasonable way behind mainline, which could cause issues when trying to depend on this code in more up-to-date codebases. There were 2 tests that I updated to make them more robust. The timeout one I made the queries explicit that were being mocked. There was another test that only worked up to the year 2020, so updating that to work until the year 3000. The newer versions of Faraday now use a ConnectionFailed error to indicate a timeout when opening the connection (lostisland/faraday#718), so updated accordingly. Restforce 5 requires ruby 2.5, so updated that requirement as well. Also updated the tests to point to the latest version of the API at time of writing.

FangjianLu · 2024-08-01T05:48:18Z

Hi, I've a dumb question: I think it makes sense that read timeout should raise Faraday::TimeoutError, but why Faraday::TimeoutError is a subclass of Faraday::ServerError? Based on Faraday::ServerError doc it says "Represents 5xx status responses", but I guess there's no 5xx for read timeout?

iMacTia added the feature label Aug 10, 2017

iMacTia added the help wanted label Aug 15, 2017

coberlin referenced this issue Oct 5, 2017

Handle net http open timeout error.

d1bbabf

This was referenced Nov 13, 2017

Handle all connection timeout messages in Patron #687

Merged

List of features for the next big releases #620

Closed

iMacTia added this to the v1.0 milestone Nov 13, 2017

escoberik mentioned this issue Nov 13, 2017

Faraday::Adapter::Test stubs now support entire urls (with host) #741

Merged

iMacTia modified the milestones: v1.0, v2.0 Feb 28, 2019

iMacTia mentioned this issue May 16, 2019

Net::OpenTimeout is a Faraday::ConnectionFailed and not a Faraday::Ti… #980

Closed

binarycode mentioned this issue Jan 30, 2020

wrap connection and timeout exceptions socketry/async-http-faraday#10

Merged

sigalsax mentioned this issue Mar 1, 2020

Decode JWT token fixes cyberark/conjur#1380

Merged

adam-harwood mentioned this issue Feb 18, 2021

Upgrade json, restforce, and webmock to latest versions. adam-harwood/salesforce_bulk_query#1

Merged

iMacTia modified the milestones: v2.0, v3.0 Jul 31, 2021

iMacTia modified the milestones: v3.0, v2.0 Aug 16, 2021

iMacTia mentioned this issue Aug 16, 2021

Faraday raises different timeout errors depending on adapter #1310

Closed

iMacTia modified the milestones: v2.0, v3.0 Jan 2, 2022

iMacTia mentioned this issue Dec 20, 2022

Error class is different according to adapters lostisland/faraday-net_http#29

Closed

github-actions bot mentioned this issue Jun 5, 2024

ruby3.2-faraday/2.9.1 package update wolfi-dev/os#21387

Merged

github-actions bot mentioned this issue Jun 18, 2024

ruby3.2-faraday/2.9.2 package update wolfi-dev/os#22199

Merged

github-actions bot mentioned this issue Jul 8, 2024

ruby3.2-faraday/2.10.0 package update wolfi-dev/os#23317

Merged

github-actions bot mentioned this issue Jul 31, 2024

ruby3.2-faraday/2.10.1 package update wolfi-dev/os#25340

Merged

github-actions bot mentioned this issue Aug 26, 2024

ruby3.2-faraday/2.11.0 package update wolfi-dev/os#27078

Merged

This was referenced Sep 18, 2024

ruby3.2-faraday/2.12.0 package update wolfi-dev/os#28665

Merged

ruby3.2-faraday/2.12.0 package update wolfi-dev/os#28669

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distinguish TimeoutErrors for open and read timeouts #718

Distinguish TimeoutErrors for open and read timeouts #718

coberlin commented Aug 9, 2017 •

edited

Loading

iMacTia commented Aug 10, 2017 •

edited

Loading

coberlin commented Aug 15, 2017

iMacTia commented Aug 16, 2017

mistersourcerer commented Oct 5, 2017

iMacTia commented Oct 5, 2017

mistersourcerer commented Oct 5, 2017

iMacTia commented Oct 5, 2017

coberlin commented Oct 5, 2017

iMacTia commented Nov 13, 2017

iMacTia commented Nov 13, 2017

coberlin commented Nov 13, 2017

iMacTia commented Nov 13, 2017

philsturgeon commented Nov 14, 2017 •

edited

Loading

iMacTia commented Nov 15, 2017

philsturgeon commented Nov 15, 2017

iMacTia commented Nov 15, 2017

philsturgeon commented Nov 16, 2017

iMacTia commented Nov 16, 2017

philsturgeon commented Nov 16, 2017

iMacTia commented Nov 16, 2017

philsturgeon commented Nov 16, 2017 •

edited

Loading

iMacTia commented Nov 16, 2017

ragav0102 commented Oct 7, 2019

iMacTia commented Oct 8, 2019

ragav0102 commented Oct 9, 2019

FangjianLu commented Aug 1, 2024 •

edited

Loading

Distinguish TimeoutErrors for open and read timeouts #718

Distinguish TimeoutErrors for open and read timeouts #718

Comments

coberlin commented Aug 9, 2017 • edited Loading

iMacTia commented Aug 10, 2017 • edited Loading

coberlin commented Aug 15, 2017

iMacTia commented Aug 16, 2017

mistersourcerer commented Oct 5, 2017

iMacTia commented Oct 5, 2017

mistersourcerer commented Oct 5, 2017

iMacTia commented Oct 5, 2017

coberlin commented Oct 5, 2017

iMacTia commented Nov 13, 2017

iMacTia commented Nov 13, 2017

coberlin commented Nov 13, 2017

iMacTia commented Nov 13, 2017

philsturgeon commented Nov 14, 2017 • edited Loading

iMacTia commented Nov 15, 2017

philsturgeon commented Nov 15, 2017

iMacTia commented Nov 15, 2017

philsturgeon commented Nov 16, 2017

iMacTia commented Nov 16, 2017

philsturgeon commented Nov 16, 2017

iMacTia commented Nov 16, 2017

philsturgeon commented Nov 16, 2017 • edited Loading

iMacTia commented Nov 16, 2017

ragav0102 commented Oct 7, 2019

iMacTia commented Oct 8, 2019

ragav0102 commented Oct 9, 2019

FangjianLu commented Aug 1, 2024 • edited Loading

coberlin commented Aug 9, 2017 •

edited

Loading

iMacTia commented Aug 10, 2017 •

edited

Loading

philsturgeon commented Nov 14, 2017 •

edited

Loading

philsturgeon commented Nov 16, 2017 •

edited

Loading

FangjianLu commented Aug 1, 2024 •

edited

Loading