Make health probe server more general purpose #1079

jklina · 2023-09-17T17:15:18Z

Howdy and thanks for the great library! 👋 I was looking into a way of adding an endpoint for monitoring metrics on job servers and saw that there was already an existing server used for health checks that uses the Rack interface. I did a quick spike that makes the health check server more general purpose and allows users to configure their own Rack apps for running on instances. Just wanted to get some input before pursuing it further and cleaning things up.

The health check and catchall logic are moved into simple Rack middleware that can be composed by users however they like and be used to preserve existing health check behavior while transitioning to a more general purpose utility server.

Additionally, this allows users to use WEBrick for the probe server in order to have a fully Rack compliant option.

All and all this pattern will allow users to add whatever functionality they like to GoodJob's web server by composing Rack apps and using GoodJob's configuration to pass in users' Rack apps. IE:

config.good_job.middleware = Rack::Builder.app do
  use GoodJob::Middleware::MyCustomMiddleware
  use GoodJob::Middleware::PrometheusExporter
  use GoodJob::Middleware::Healthcheck
  run GoodJob::Middleware::CatchAll
end
config.good_job.middleware_port = 7001

This could help resolve:

This removes the health check logic from the ProbeServer and renames the ProbeServer to UtilityServer that accepts any Rack based app. The health check and catchall logic are moved into simple Rack middleware that can be composed by users however they like and be used to preserve existing health check behavior while transitioning to a more general purpose utility server. All and all this pattern will allow users to add whatever functionality they like to GoodJob's web server by composing Rack apps and using GoodJob's configuration to pass in users' Rack apps. IE: ``` config.good_job.middleware = Rack::Builder.app do use GoodJob::Middleware::MyCustomMiddleware use GoodJob::Middleware::PrometheusExporter use GoodJob::Middleware::Healthcheck run GoodJob::Middleware::CatchAll end config.good_job.middleware_port = 7001 ``` This could help resolve: * bensheldon#750 * bensheldon#532

bensheldon

I have mixed feelings:

I'm worried that folks will want more and more out of the probe server (it's a single thread / blocking) and that's not where I want to spend me development time. The extreme reaction (which I realize you're not proposing but is my fear) is that if someone needs a webserver, they should be running GoodJob in Puma, not recreating Puma in GoodJob.
I'm not sure how defensive to be if folks are running arbitrary Rack Apps. Like should GoodJob be including ActionDispatch::Executor middleware in case someone adds customize middleware that touches Rails autoloaded objects or Active Record even though GoodJob doesn't need it itself. Because otherwise there might be deadlocks or leaked database connections which I imagine will be a support burden.
I recognize that being able to add prometheus metrics or other custom healthchecks would be nice without having to spin up yet-another-process (though maybe if you're already running kube and prometheus that shouldn't be a dealbreaker). But I get it, that would be helpful for advanced systems.

So maybe my objections here could be overcome fully by defensive naming:

Leaving the name of it as ProbeServer implying the the purpose is for probing the process (I think "Utility" is too vague).
"Middleware" is a bit overloaded in the background job space, as it's also used for describing job customization (though not in GoodJob). We should keep it namespaced as ProbeServer::Middleware::....,
Let's call the configuration for the Rack app: config.good_job.probe_server_app =

Otherwise I like the PR 😄 I think it cleaves the behaviors in the right place and allows flexibility without necessitating it.

shouichi · 2023-09-25T05:53:12Z

Is Good::HttpServer meant to be rack compliant? The current implementation doesn't seem so. Rack specifies various mandatory fields and some rack apps assume those fields to exist. Thus some apps won't work.

jklina · 2023-09-25T13:16:24Z

@shouichi probably not. The more I think about this the more I'm worried it'll open a can of worms. I'm thinking of just closing this for now and if there's more demand down the line, I'd be happy to revisit and see it through making its limitations very visible.

bensheldon · 2023-09-26T01:52:50Z

oh, the Rack spec is kind of a lot: https://github.com/rack/rack/blob/main/SPEC.rdoc

I do like the idea of allowing it to be extensible, but agree that I don't want to set the expectation that anything more than a trivial rack middleware will work.

shouichi · 2023-09-26T02:11:31Z

I tried a "trivial" example from https://www.rubydoc.info/gems/rack/Rack/Builder but didn't work.

app = Rack::Builder.new do
  use Rack::CommonLogger
  map "/ok" do
    run lambda { |env| [200, {'content-type' => 'text/plain'}, ['OK']] }
  end
end

run app

The spec is a lot, implementing it in GoodJob might be a rabbit hole. I guess it's better to use an existing server (e.g., webrick).

Other than the spec, error handling is not trivial work. I recently opened #1083 but there are potentially more corner cases. Search Errno in webrick/puma, you'll see a lot of them.

bensheldon · 2023-12-02T17:45:24Z

I think I do like the idea of pluggable middleware; it would be nice if any Rack middleware would work. I think that has me lean towards reintroducing webrick.

Would it be possible to keep the array of middleware and add an additional option that was like (ugh, I don't like the name):

config.good_job.probe_server_server = :good_job # or webrick

And then folks could choose and we'd just have a warning in the readme of like "the default :good_job is very barebones and likely won't work if you get complicated.

jklina · 2023-12-04T13:34:43Z

I'm wondering, if Webrick is going to become a dependency of GoodJob anyway, do we keep the current homespun server or nix it completely? Removing the homespun server would be one less configuration option to worry about (I feel like there might be some confusion about selecting servers) and anything that'd work with the homespun server should work with Webrick.

If that's the case maybe we break this into two parts:

Reverting Replace Webrick with custom simple http server #1030
And finishing up this PR for use with Webrick

bensheldon · 2023-12-04T15:42:04Z

if Webrick is going to become a dependency of GoodJob anyway

Because it would be optional, I wouldn't add Webrick to the gemspec, and instead simply require it, if configured, and rescue the LoadError if it hasn't been added.

I hear you about complexity though 😰

…al-purpose

We decided to leave the original ProbeServer name better sets expectations. See: bensheldon#1079 (review) This also splits out middleware testing into separate specs.

This also helps ensure that the existing behavior and API remain intact.

Since the probe server has the option to use WEBrick as a server handler, but this library doesn't have WEBrick as a dependency, we want to throw a warning when WEBrick is configured, but not in the load path. This will also gracefully fallback to the built in HTTP server.

As opposed to manipulating the load path.

jklina · 2024-01-05T21:12:34Z

I went ahead and took a stab at implementing the suggestions. I'll keep my eyes open for any further suggestions during review!

… rename Rack middleware/app for clarity

bensheldon · 2024-01-06T01:37:01Z

I like this! I just pushed up a few tweaks and I will accept this.

* Make health probe server more general purpose This removes the health check logic from the ProbeServer and renames the ProbeServer to UtilityServer that accepts any Rack based app. The health check and catchall logic are moved into simple Rack middleware that can be composed by users however they like and be used to preserve existing health check behavior while transitioning to a more general purpose utility server. All and all this pattern will allow users to add whatever functionality they like to GoodJob's web server by composing Rack apps and using GoodJob's configuration to pass in users' Rack apps. IE: ``` config.good_job.middleware = Rack::Builder.app do use GoodJob::Middleware::MyCustomMiddleware use GoodJob::Middleware::PrometheusExporter use GoodJob::Middleware::Healthcheck run GoodJob::Middleware::CatchAll end config.good_job.middleware_port = 7001 ``` This could help resolve: * bensheldon/good_job#750 * bensheldon/good_job#532 * Use new API * Revert server name change We decided to leave the original ProbeServer name better sets expectations. See: bensheldon/good_job#1079 (review) This also splits out middleware testing into separate specs. * Restore original naming This also helps ensure that the existing behavior and API remain intact. * Appease linters * Add required message for mock * Make test description relevant * Allow for handler to be injected into ProbeServer * Add WEBrick WEBrick handler * Add WEBrick as a development dependency * Add WEBrick tests and configuration * Add idle_timeout method to mock * Namespace server handlers * Warn and fallback when WEBrick isn't loadable Since the probe server has the option to use WEBrick as a server handler, but this library doesn't have WEBrick as a dependency, we want to throw a warning when WEBrick is configured, but not in the load path. This will also gracefully fallback to the built in HTTP server. * inspect load path * Account for multiple webrick entries in $LOAD_PATH * try removing load path test * For error on require to initiate test As opposed to manipulating the load path. * Handle explicit nils in intialization * Allow probe_handler to be set in configuration * Add documentation for probe server customization * Appease linter * retrigger CI * Rename `probe_server_app` to `probe_app`; make handler name a symbol; rename Rack middleware/app for clarity * Update readme to have relevant app example * Fix readme grammar --------- Co-authored-by: Ben Sheldon [he/him] <[email protected]>

bensheldon reviewed Sep 18, 2023

View reviewed changes

jklina added 9 commits December 6, 2023 20:30

Merge remote-tracking branch 'origin/main' into make-web-server-gener…

ff1c8ad

…al-purpose

Use new API

7a2ee38

Revert server name change

eb63619

We decided to leave the original ProbeServer name better sets expectations. See: bensheldon#1079 (review) This also splits out middleware testing into separate specs.

Restore original naming

ea9a964

This also helps ensure that the existing behavior and API remain intact.

Appease linters

6535288

Add required message for mock

8ff54d7

Make test description relevant

a6ee6db

Allow for handler to be injected into ProbeServer

d5520aa

Add WEBrick WEBrick handler

ac2885c

jklina force-pushed the make-web-server-general-purpose branch from dac7ded to ac2885c Compare January 2, 2024 19:36

jklina added 11 commits January 2, 2024 19:45

Add WEBrick as a development dependency

f92de5b

Add WEBrick tests and configuration

911798d

Merge branch 'main' into make-web-server-general-purpose

ed08af0

Add idle_timeout method to mock

cd59f43

Namespace server handlers

54955d1

inspect load path

59efbe9

Account for multiple webrick entries in $LOAD_PATH

0239a9b

try removing load path test

c0d66ce

For error on require to initiate test

4226b27

As opposed to manipulating the load path.

Handle explicit nils in intialization

0b8375f

jklina added 5 commits January 5, 2024 17:42

Allow probe_handler to be set in configuration

be6bb19

Add documentation for probe server customization

0615186

Appease linter

3359a1d

Merge branch 'main' into make-web-server-general-purpose

7ecca37

retrigger CI

5a1fdcd

jklina marked this pull request as ready for review January 5, 2024 21:05

bensheldon added 3 commits January 5, 2024 17:31

Rename probe_server_app to probe_app; make handler name a symbol;…

5429fd7

… rename Rack middleware/app for clarity

Update readme to have relevant app example

1c67ba6

Fix readme grammar

206660a

Merge branch 'main' into make-web-server-general-purpose

2d9a4d2

bensheldon added the enhancement New feature or request label Jan 23, 2024

bensheldon merged commit eec1f4b into bensheldon:main Jan 23, 2024
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make health probe server more general purpose #1079

Make health probe server more general purpose #1079

jklina commented Sep 17, 2023 •

edited

Loading

bensheldon left a comment

shouichi commented Sep 25, 2023

jklina commented Sep 25, 2023

bensheldon commented Sep 26, 2023

shouichi commented Sep 26, 2023

bensheldon commented Dec 2, 2023

jklina commented Dec 4, 2023

bensheldon commented Dec 4, 2023

jklina commented Jan 5, 2024

bensheldon commented Jan 6, 2024

Make health probe server more general purpose #1079

Make health probe server more general purpose #1079

Conversation

jklina commented Sep 17, 2023 • edited Loading

bensheldon left a comment

Choose a reason for hiding this comment

shouichi commented Sep 25, 2023

jklina commented Sep 25, 2023

bensheldon commented Sep 26, 2023

shouichi commented Sep 26, 2023

bensheldon commented Dec 2, 2023

jklina commented Dec 4, 2023

bensheldon commented Dec 4, 2023

jklina commented Jan 5, 2024

bensheldon commented Jan 6, 2024

jklina commented Sep 17, 2023 •

edited

Loading