multi-master explorer support for high availability #44
Labels
No labels
prio_critical
prio_low
type_bug
type_contact
type_issue
type_lead
type_question
type_story
type_task
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
lhumina_code/hero_compute#44
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Description
Currently hero_compute supports only a single master (explorer) node. If the master goes down, the explorer is unavailable and workers cannot register or have their VMs managed remotely. We need multi-master support for high availability.
Current Architecture
Proposed Architecture
What Already Works
What Needs To Be Built
Phase 1: State replication between explorers
Phase 2: Worker multi-master awareness
Phase 3: Consistency and health
Design Considerations
proxy.
Acceptance Criteria