-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pluginify Ops Agent #1840
base: master
Are you sure you want to change the base?
Pluginify Ops Agent #1840
Conversation
e555318
to
34cb619
Compare
56e169b
to
92d4e0a
Compare
@@ -4435,7 +4435,7 @@ func getRecentServiceOutputForImage(imageSpec string) string { | |||
}, ";") | |||
return cmd | |||
} | |||
return "sudo systemctl status google-cloud-ops-agent" | |||
return "sudo cat /var/log/google-cloud-ops-agent/health-checks.log && sudo systemctl status google-cloud-ops-agent" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why are we moving health checks to their own log file? systemctl status
is where users expect to see error messages.
@@ -27,4 +27,4 @@ ExecStartPre=@PREFIX@/libexec/google_cloud_ops_agent_engine -service=fluentbit - | |||
ExecStart=@PREFIX@/libexec/google_cloud_ops_agent_wrapper -config_path @SYSCONFDIR@/google-cloud-ops-agent/config.yaml -log_path ${LOGS_DIRECTORY}/subagents/logging-module.log @PREFIX@/subagents/fluent-bit/bin/fluent-bit --config ${RUNTIME_DIRECTORY}/fluent_bit_main.conf --parser ${RUNTIME_DIRECTORY}/fluent_bit_parser.conf --storage_path ${STATE_DIRECTORY}/buffers | |||
Restart=always | |||
# For debugging: | |||
RuntimeDirectoryPreserve=yes | |||
RuntimeDirectoryPreserve=yes |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You have whitespace changes in these service files (it looks like you're removing the trailing newline).
Is there a design doc you can link me to for these changes? |
90a9278
to
6aa4435
Compare
…Please run `go run ./dockerfiles`"
…-agent/libexec/google_cloud_ops_agent_uap_plugin"
status`. Updated the test to check `health-checks.log` to confirm health check statuses.
off commands that restart. Also added designated signal handlers for each go routines; the signal handler responds to SIGTERM and cancels the context on trigger.
now causes Start() to return an RPC error. Context will be cancelled on config generation failures, resulting in GetStatus() returning an inactive status.
start up more than 3 times, the plugin will stop restarting the command.
b8ee43e
to
ff2299a
Compare
output then the systemd service is also not present. Added the case to the conflicting installation check.
Description
Related issue
How has this been tested?
Checklist: