Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pluginify Ops Agent #1840

Draft
wants to merge 18 commits into
base: master
Choose a base branch
from
Draft

Pluginify Ops Agent #1840

wants to merge 18 commits into from

Conversation

XuechunHou
Copy link
Contributor

Description

Related issue

How has this been tested?

Checklist:

  • Unit tests
    • Unit tests do not apply.
    • Unit tests have been added/modified and passed for this PR.
  • Integration tests
    • Integration tests do not apply.
    • Integration tests have been added/modified and passed for this PR.
  • Documentation
    • This PR introduces no user visible changes.
    • This PR introduces user visible changes and the corresponding documentation change has been made.
  • Minor version bump
    • This PR introduces no new features.
    • This PR introduces new features, and there is a separate PR to bump the minor version since the last release already.
    • This PR bumps the version.

@XuechunHou XuechunHou added the kokoro:force-run Forces kokoro to run integration tests on a CL label Nov 30, 2024
@stackdriver-instrumentation-release stackdriver-instrumentation-release removed the kokoro:force-run Forces kokoro to run integration tests on a CL label Nov 30, 2024
@XuechunHou XuechunHou force-pushed the xuechun-go-yaml branch 2 times, most recently from 56e169b to 92d4e0a Compare December 2, 2024 17:56
@XuechunHou XuechunHou added the kokoro:force-run Forces kokoro to run integration tests on a CL label Dec 2, 2024
@stackdriver-instrumentation-release stackdriver-instrumentation-release removed the kokoro:force-run Forces kokoro to run integration tests on a CL label Dec 2, 2024
@@ -4435,7 +4435,7 @@ func getRecentServiceOutputForImage(imageSpec string) string {
}, ";")
return cmd
}
return "sudo systemctl status google-cloud-ops-agent"
return "sudo cat /var/log/google-cloud-ops-agent/health-checks.log && sudo systemctl status google-cloud-ops-agent"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why are we moving health checks to their own log file? systemctl status is where users expect to see error messages.

@@ -27,4 +27,4 @@ ExecStartPre=@PREFIX@/libexec/google_cloud_ops_agent_engine -service=fluentbit -
ExecStart=@PREFIX@/libexec/google_cloud_ops_agent_wrapper -config_path @SYSCONFDIR@/google-cloud-ops-agent/config.yaml -log_path ${LOGS_DIRECTORY}/subagents/logging-module.log @PREFIX@/subagents/fluent-bit/bin/fluent-bit --config ${RUNTIME_DIRECTORY}/fluent_bit_main.conf --parser ${RUNTIME_DIRECTORY}/fluent_bit_parser.conf --storage_path ${STATE_DIRECTORY}/buffers
Restart=always
# For debugging:
RuntimeDirectoryPreserve=yes
RuntimeDirectoryPreserve=yes
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You have whitespace changes in these service files (it looks like you're removing the trailing newline).

@quentinmit
Copy link
Member

Is there a design doc you can link me to for these changes?

@quentinmit quentinmit marked this pull request as draft December 3, 2024 21:20
…-agent/libexec/google_cloud_ops_agent_uap_plugin"
status`. Updated the test to check `health-checks.log` to confirm health
check statuses.
off commands that restart.

Also added designated signal handlers for each go routines; the signal
handler responds to SIGTERM and cancels the context on trigger.
now causes Start() to return an RPC error. Context will be cancelled on
config generation failures, resulting in GetStatus() returning an
inactive status.
start up more than 3 times, the plugin will stop restarting the command.
@XuechunHou XuechunHou added the kokoro:force-run Forces kokoro to run integration tests on a CL label Dec 19, 2024
@stackdriver-instrumentation-release stackdriver-instrumentation-release removed kokoro:force-run Forces kokoro to run integration tests on a CL labels Dec 19, 2024
output then the systemd service is also not present. Added the case to
the conflicting installation check.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants