Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Get AEE runner from OpenstackVersion #469

Merged
merged 1 commit into from
May 30, 2024

Conversation

fao89
Copy link
Contributor

@fao89 fao89 commented May 27, 2024

@fao89 fao89 requested a review from cescgina May 27, 2024 10:14
@@ -5,6 +5,7 @@ metadata:
namespace: openstack
spec:
customContainerImages:
ansibleeeImage: {{ ansibleee_runner_img | default('quay.io/openstack-k8s-operators/openstack-ansibleee-runner:latest')}}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
ansibleeeImage: {{ ansibleee_runner_img | default('quay.io/openstack-k8s-operators/openstack-ansibleee-runner:latest')}}
{% if not skip_patching_ansibleee_csv is defined or not skip_patching_ansibleee_csv | bool %}
ansibleeeImage: {{ ansibleee_runner_img | default('quay.io/openstack-k8s-operators/openstack-ansibleee-runner:latest')}}
{% endif %}

Copy link
Contributor

@cescgina cescgina left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We need something a bit more complex here. Using the variable skip_patching_ansibleee_csv we could skip setting the ansible-runner image and rely on the operator default, which we used when testing beta downstream, we need some jinja templating here to retain the functionality. I left a suggestion inline, but we'd still need to test it

@fao89 fao89 force-pushed the aeeversionrunner branch from 4cc4fcd to 7229889 Compare May 27, 2024 10:47
Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/aa82c5d2399f40f7ab30ee20efa398fa

✔️ data-plane-adoption-osp-17-to-extracted-crc SUCCESS in 2h 32m 24s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph FAILURE in 2h 37m 33s

@fao89
Copy link
Contributor Author

fao89 commented May 27, 2024

recheck

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/4f766037a37f405a83b7f58b44fef989

✔️ data-plane-adoption-osp-17-to-extracted-crc SUCCESS in 2h 29m 41s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph FAILURE in 2h 31m 29s

@fao89 fao89 force-pushed the aeeversionrunner branch from 7229889 to 3bfd3d7 Compare May 27, 2024 17:02
Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/c3604e9bd59e4e37abb7b237c37ea1ca

data-plane-adoption-osp-17-to-extracted-crc FAILURE in 1h 20m 22s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph FAILURE in 2h 37m 07s

@fao89
Copy link
Contributor Author

fao89 commented May 27, 2024

recheck

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/4f91e7fb34d8412c9415f6342792a064

✔️ data-plane-adoption-osp-17-to-extracted-crc SUCCESS in 2h 32m 15s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph FAILURE in 2h 29m 00s

Copy link
Contributor

@marios marios left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks OK, but wondering if the non-ceph job failure is related? I don't think so but it has not passed on this patch? Perhaps we have a new issue/cix there @cescgina do you know?

i hit this issue when developing the multinode (it was rabbit related at that time, but might be something else in this case)

https://logserver.rdoproject.org/69/469/3bfd3d7b768a9a5cd46985a4bbf5a371bf2195ad/github-check/data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph/a056775/controller/data-plane-adoption-tests-repo/data-plane-adoption/tests/logs/test_minimal_out_2024-05-27T21:02:14EDT.log

        * TASK [dataplane_adoption : wait for cell1 Nova compute EDPM services version updated] ***
        * fatal: [localhost]: FAILED! => {"attempts": 20, "changed": true, "cmd": "set -euxo pipefail\n\n\nPODIFIED_MARIADB_IP=10.217.5.136\nPODIFIED_CELL1_MARIADB_IP=10.217.5.132\nPODIFIED_DB_ROOT_PASSWORD=\"12345678\"\n\n# The CHARACTER_SET and collation should match the source DB\n# if the do not then it will break foreign key relationships\n# for any tables that are created in the future as part of db sync\nCHARACTER_SET=utf8\nCOLLATION=utf8_general_ci\n\noc exec openstack-cell1-galera-0 -c galera -- mysql -rs -uroot -p$PODIFIED_DB_ROOT_PASSWORD  -e \"select a.version from nova_cell1.services a join nova_cell1.services b where a.version!=b.version and a.binary='nova-compute';\"\n", "delta": "0:00:00.246207", "end": "2024-05-27 21:50:31.918957", "msg": "", "rc": 0, "start": "2024-05-27 21:50:31.672750", "stderr": "+ PODIFIED_MARIADB_IP=10.217.5.136\n+ PODIFIED_CELL1_MARIADB_IP=10.217.5.132\n+ PODIFIED_DB_ROOT_PASSWORD=12345678\n+ CHARACTER_SET=utf8\n+ COLLATION=utf8_general_ci\n+ oc exec openstack-cell1-galera-0 -c galera -- mysql -rs -uroot -p12345678 -e 'select a.version from nova_cell1.services a join nova_cell1.services b where a.version!=b.version and a.binary='\\''nova-compute'\\'';'", "stderr_lines": ["+ PODIFIED_MARIADB_IP=10.217.5.136", "+ PODIFIED_CELL1_MARIADB_IP=10.217.5.132", "+ PODIFIED_DB_ROOT_PASSWORD=12345678", "+ CHARACTER_SET=utf8", "+ COLLATION=utf8_general_ci", "+ oc exec openstack-cell1-galera-0 -c galera -- mysql -rs -uroot -p12345678 -e 'select a.version from nova_cell1.services a join nova_cell1.services b where a.version!=b.version and a.binary='\\''nova-compute'\\'';'"], "stdout": "57", "stdout_lines": ["57"]}

comes from there

- name: wait for cell1 Nova compute EDPM services version updated

@marios
Copy link
Contributor

marios commented May 28, 2024

looks OK, but wondering if the non-ceph job failure is related? I don't think so but it has not passed on this patch? Perhaps we have a new issue/cix there @cescgina do you know?

i hit this issue when developing the multinode (it was rabbit related at that time, but might be something else in this case)

https://logserver.rdoproject.org/69/469/3bfd3d7b768a9a5cd46985a4bbf5a371bf2195ad/github-check/data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph/a056775/controller/data-plane-adoption-tests-repo/data-plane-adoption/tests/logs/test_minimal_out_2024-05-27T21:02:14EDT.log

        * TASK [dataplane_adoption : wait for cell1 Nova compute EDPM services version updated] ***
        * fatal: [localhost]: FAILED! => {"attempts": 20, "changed": true, "cmd": "set -euxo pipefail\n\n\nPODIFIED_MARIADB_IP=10.217.5.136\nPODIFIED_CELL1_MARIADB_IP=10.217.5.132\nPODIFIED_DB_ROOT_PASSWORD=\"12345678\"\n\n# The CHARACTER_SET and collation should match the source DB\n# if the do not then it will break foreign key relationships\n# for any tables that are created in the future as part of db sync\nCHARACTER_SET=utf8\nCOLLATION=utf8_general_ci\n\noc exec openstack-cell1-galera-0 -c galera -- mysql -rs -uroot -p$PODIFIED_DB_ROOT_PASSWORD  -e \"select a.version from nova_cell1.services a join nova_cell1.services b where a.version!=b.version and a.binary='nova-compute';\"\n", "delta": "0:00:00.246207", "end": "2024-05-27 21:50:31.918957", "msg": "", "rc": 0, "start": "2024-05-27 21:50:31.672750", "stderr": "+ PODIFIED_MARIADB_IP=10.217.5.136\n+ PODIFIED_CELL1_MARIADB_IP=10.217.5.132\n+ PODIFIED_DB_ROOT_PASSWORD=12345678\n+ CHARACTER_SET=utf8\n+ COLLATION=utf8_general_ci\n+ oc exec openstack-cell1-galera-0 -c galera -- mysql -rs -uroot -p12345678 -e 'select a.version from nova_cell1.services a join nova_cell1.services b where a.version!=b.version and a.binary='\\''nova-compute'\\'';'", "stderr_lines": ["+ PODIFIED_MARIADB_IP=10.217.5.136", "+ PODIFIED_CELL1_MARIADB_IP=10.217.5.132", "+ PODIFIED_DB_ROOT_PASSWORD=12345678", "+ CHARACTER_SET=utf8", "+ COLLATION=utf8_general_ci", "+ oc exec openstack-cell1-galera-0 -c galera -- mysql -rs -uroot -p12345678 -e 'select a.version from nova_cell1.services a join nova_cell1.services b where a.version!=b.version and a.binary='\\''nova-compute'\\'';'"], "stdout": "57", "stdout_lines": ["57"]}

comes from there

- name: wait for cell1 Nova compute EDPM services version updated

same issue found on another pull request so looks like cix

https://logserver.rdoproject.org/62/462/429efde0f496a6073d322512c0ec554120e4e7fa/github-check/data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph/a738c36/controller/data-plane-adoption-tests-repo/data-plane-adoption/tests/logs/test_minimal_out_2024-05-27T13:54:14EDT.log

@fao89
Copy link
Contributor Author

fao89 commented May 28, 2024

recheck

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/7c70998ed34747b5865eec235ace0d37

data-plane-adoption-osp-17-to-extracted-crc RETRY_LIMIT in 3s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph FAILURE in 2h 32m 20s

@marios
Copy link
Contributor

marios commented May 29, 2024

recheck

mainly i want to see if this NODE_FAILURE is a real thing (only 2 from yestreday https://review.rdoproject.org/zuul/builds?job_name=data-plane-adoption-osp-17-to-extracted-crc&skip=0)

I don't think we really need to re-run here - it was tested in testproject and seen to do the right thing. I think we can force merge it today. the non-ceph issue is being tracked with https://issues.redhat.com/browse/OSPCIX-307 which may be fixed now (openstack-k8s-operators/ci-framework#1757)

Copy link
Contributor

@cescgina cescgina left a comment

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/d0549132a8c242f19cd12fd3a27250ee

✔️ data-plane-adoption-osp-17-to-extracted-crc SUCCESS in 2h 28m 55s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph POST_FAILURE in 2h 35m 24s

@fao89
Copy link
Contributor Author

fao89 commented May 29, 2024

recheck

Copy link

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://review.rdoproject.org/zuul/buildset/269abd60e7424481a2c3ce775ec66ae0

✔️ data-plane-adoption-osp-17-to-extracted-crc SUCCESS in 2h 32m 41s
data-plane-adoption-osp-17-to-extracted-crc-minimal-no-ceph FAILURE in 2h 32m 36s

@marios marios merged commit 83f8cb0 into openstack-k8s-operators:main May 30, 2024
1 of 2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants