Adding a Worker Node to Single-Node OpenShift (SNO)

OKD Homelab Series - This article is part of a series.

Part 1: Installing OKD 4.22 Single-Node on Bare Metal: A Homelab Guide That Actually Works

Part 2: This Article

Part 3: The Good, the Bad, and the Ugly: Lessons from an AI-Assisted Infrastructure Failure

Single-Node OpenShift gets you a working cluster on one machine. The moment you actually start deploying things, two limits show up:

Anti-affinity blocks replicas. A PerconaXtraDBCluster with size: 2 will sit Pending forever because both pods want different hosts.
Maintenance is scary. Reboot the SNO node and the whole cluster goes dark.

Both go away as soon as you add a worker. Here’s the actual flow on OKD 4.22 SCOS, no Assisted Installer, no agent ISO.

The shape of the trick
#

SNO doesn’t ship a worker-ignition by default — there’s no machine-api operator wired to spit out a worker bootstrap image. What you do have:

A working API server at https://api.okd.example.com:6443
A machine-config-server (MCS) at https://api-int.okd.example.com:22623
Existing worker MachineConfigPool with all the desired config baked in

So the recipe is:

Boot the new host on a plain SCOS live ISO.
Have it coreos-installer install from the ISO and embed an ignition URL that points at the running MCS’s worker endpoint.
The new host pulls config, pivots, joins the cluster as a worker.

That’s it. Three commands, mostly.

Pre-flight on the new box
#

Anything that boots SCOS will work — bare metal, Proxmox VM, NUC. Mine is a Beelink mini PC.

4+ cores, 16 GB RAM, 250 GB SSD minimum
Reachable from the SNO (the new node will pull from api-int.okd.example.com:22623)
DNS resolves api, api-int, and *.apps of your cluster (point Technitium at it like SNO did)

Step 1: ignition file from the running cluster
#

From your workstation with the SNO kubeconfig:

oc -n openshift-machine-config-operator get secret \
   worker-user-data -o jsonpath='{.data.userData}' \
   | base64 -d > worker.ign

That single JSON file is what makes the new node “a worker for this cluster.” It points at the MCS, embeds the cluster CA, and tells coreos-installer how to pivot.

Step 2: serve the ignition
#

The new node will fetch the ignition during install. Easiest way — python3 -m http.server on your workstation:

python3 -m http.server 8080 --bind 10.20.30.40

Now http://10.20.30.40:8080/worker.ign is reachable from the new node.

Step 3: install
#

Boot the new node on the SCOS live ISO (same one you used for SNO). At the live shell:

sudo coreos-installer install /dev/sda \
   --ignition-url=http://10.20.30.40:8080/worker.ign \
   --copy-network

--copy-network carries your live-session network config (DNS, static IP if you set one) into the installed disk so the box comes back with the same identity.

Reboot. Pull the ISO. Wait.

Step 4: approve the CSR
#

The new node will phone home, request a kubelet certificate, and sit at Pending until you approve the CSR:

oc get csr | grep Pending
oc adm certificate approve <csr-name>
# wait 30s, second CSR appears for kubelet-serving
oc get csr | grep Pending
oc adm certificate approve <csr-name>

Within 2 minutes:

$ oc get nodes
NAME                         STATUS   ROLES                         AGE
master-0.okd.example.com   Ready    control-plane,master,worker   2d
node6                        Ready    worker                        90s

Removing the master’s worker role (optional)
#

SNO masters are both control-plane and worker so they can run pods at all. Once you have a real worker you can drop the worker role from the master:

oc label node master-0.okd.example.com node-role.kubernetes.io/worker-

I keep mine dual-roled — homelab, no real reason to be ascetic about it.

Things that bit me
#

Wrong DNS — the new node needs api-int.okd.example.com to resolve to the SNO. Without it, ignition fetch hangs.
api-int and api point at the same SNO. There’s only one. The split exists because in a real OCP cluster they go to different load balancers; for SNO they’re the same A record.
Worker pulls into “Provisioning, then SchedulingDisabled” for a few minutes while the MCO renders the config. That’s normal. Leave it alone, it joins.
CSR approval is a separate step. If you blink past oc get csr you’ll wonder why your node never goes Ready.

Now anti-affinity works, replicas: 2 schedules, and you can reboot one host without losing the cluster.

OKD Homelab Series - This article is part of a series.

Part 1: Installing OKD 4.22 Single-Node on Bare Metal: A Homelab Guide That Actually Works

Part 2: This Article

Part 3: The Good, the Bad, and the Ugly: Lessons from an AI-Assisted Infrastructure Failure

The shape of the trick #

Pre-flight on the new box #

Step 1: ignition file from the running cluster #

Step 2: serve the ignition #

Step 3: install #

Step 4: approve the CSR #

Removing the master’s worker role (optional) #

Things that bit me #