FE HA validation framework #22

dandrushko · 2025-07-14T07:40:26Z

No description provided.

balazsbme · 2025-07-14T11:21:54Z

roles/fe_ha/defaults/main.yml

@@ -0,0 +1,6 @@
+---
+#################### TEST CONFIGURATION VARIABLES ####################


Let's move this parameter and follow the decision we made after Bruno's feedback to create a hierarchy of config params.
See the docs here: https://github.com/OpenNebula/one-infra/wiki/Development-and-Testing-Post%E2%80%90deployment-Validation-tools#variables

Let's discuss to which extend this is applicable for validation framework and where we should store role specific variables.

okay, lets discuss and conclude in chat.

I think the role specific variables can also follow the same hierarchy. And a control flag for skip/execute. As I understood we agreed on this decision with this PR #20

balazsbme · 2025-07-14T11:25:38Z

roles/fe_ha/tasks/main.yml

+- name: Wait for leader failover
+  pause:
+    seconds: 20 
+    prompt: "Waiting for OpenNebula to elect a new leader. Press Enter to continue after 1 minutes, or Ctrl+C to abort."


I think we should not wait for user input here. Rather look for some log message or zone state (or check leader selection state directly somehow) and wait with a specified timeout.

ok, make sense. Will remove prompt from here and will wait for 20 seconds

balazsbme · 2025-07-14T11:29:04Z

roles/fe_ha/tasks/main.yml

+  pause:
+    seconds: 20 
+    prompt: "Waiting for OpenNebula to elect a new leader. Press Enter to continue after 1 minutes, or Ctrl+C to abort."
+  run_once: true 


run_once should not be used, as recommended by Michal in one-deploy coding style: https://github.com/OpenNebula/one-deploy/wiki/code_style#4-be-careful-with-run_once

Due to this I am using also in the connectivity matrix a logic to make sure it only runs once in the first frontend. In this case maybe we have to find another option, because we might have different behaviour if we happen to run the leader status checking on the "initial_leader" vs. any other one.

I think this is a bit another case, i.e. we don't have parallel operation here, but have to get output of the onezone command from one of the nodes. Let's discuss if you see any risk here.

Yea, as I understand the run_once will just run on random node of the group. And the test would anyway fail earlier if we do not run these on the FE nodes... So I guess it is fine, I would prefer if we have the same approach for run_once, but we can live with it.

balazsbme · 2025-07-14T11:32:45Z

playbooks/fe-ha.yml

Shouldn't we include this in the main "validation.yml" so it will be run together with the other tests? In that case we can use also the same HTML report, no need for a new file just for this testcase.

Majority of the latest deployment is non-FA, so IMO this roles should be conditionally included only for HA deployments

Okay, then we should figure out what should be the condition for it. I think still the easiest would be to also include this test casse under "validation.fe_ha" for the params and control it with "validation.run_fe_ha" that would be pretty consistent, and we can disable it by default.

roles/fe_ha/tasks/main.yml

tinova · 2025-07-14T16:33:02Z

roles/fe_ha/tasks/main.yml

+  #when: hostvars[groups[frontend_group | d('frontend')][1]]['ansible_host'] == ansible_host    
+  run_once: true
+
+- name: Save a new Leader node


let's use leader with downcase for consistency

FE HA validation framework

fde4816

dandrushko requested review from tinova and balazsbme July 14, 2025 07:40

balazsbme requested changes Jul 14, 2025

View reviewed changes

tinova requested changes Jul 14, 2025

View reviewed changes

roles/fe_ha/tasks/main.yml Show resolved Hide resolved

tinova approved these changes Jul 14, 2025

View reviewed changes

tinova requested changes Jul 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FE HA validation framework #22

FE HA validation framework #22

Uh oh!

dandrushko commented Jul 14, 2025

Uh oh!

balazsbme Jul 14, 2025

Uh oh!

dandrushko Jul 14, 2025

Uh oh!

balazsbme Jul 14, 2025

Uh oh!

balazsbme Jul 14, 2025

Uh oh!

dandrushko Jul 14, 2025

Uh oh!

balazsbme Jul 14, 2025

Uh oh!

dandrushko Jul 14, 2025

Uh oh!

balazsbme Jul 14, 2025

Uh oh!

balazsbme Jul 14, 2025

Uh oh!

dandrushko Jul 14, 2025

Uh oh!

balazsbme Jul 14, 2025

Uh oh!

Uh oh!

tinova Jul 14, 2025

Uh oh!

Uh oh!

		@@ -0,0 +1,6 @@
		---
		#################### TEST CONFIGURATION VARIABLES ####################

FE HA validation framework #22

Are you sure you want to change the base?

FE HA validation framework #22

Uh oh!

Conversation

dandrushko commented Jul 14, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!