Thoughts on AI infrastructure, Kubernetes, SAP, cloud, and automation — from the field.
Deploying OpenShift on bare-metal servers is a multi-step orchestration problem — authentication, cluster creation, static networking, ISO booting via Redfish, host discovery, and post-install operators. Here's a complete walkthrough of an Ansible automation framework that handles all of it, including idempotent state management and Day-2 operations.
Enterprise AI isn't just about buying H100s. It's a multi-layer architecture problem involving resiliency, redundancy, load balancing, GPU scheduling, and months of decisions that nobody documents. Here's the full stack from the perspective of an enterprise AI architect working with Cisco UCS and OpenShift.