Seasoned IT Infrastructure Leader with 22+ years of experience in data center computing, enterprise architecture, and technical solutions delivery. Specialized in designing and implementing high-performance AI training & inference infrastructure for enterprise customers.
Expert in architecting scalable AI factories on Cisco UCS platforms with NVIDIA GPUs, OpenShift/Kubernetes clusters, high-speed RDMA networking, and hybrid cloud architectures.
Proven track record in infrastructure automation using Python, Ansible, and Terraform, along with the development of GenAI solutions including LangChain agents and fine-tuned Llama2 models.
Recognized for bridging deep technical expertise with customer-focused delivery — helping organizations accelerate their AI journey through optimized compute, storage, and networking solutions.
End-to-end solutions across AI infrastructure, cloud platforms, SAP, and enterprise automation.
Design and deploy high-performance AI training & inference clusters with NVIDIA GPUs, OpenShift, and RDMA networking on Cisco UCS platforms.
Architect and deliver hybrid cloud solutions across AWS, Azure, and GCP including Azure Stack, Express Routes, BGP, and VPN Gateways.
Develop LangChain agents, Webex bots, and fine-tuned Llama2 models for autonomous Day-2 operations and domain-specific AI use cases.
Build and maintain K8s/OpenShift clusters across cloud and on-prem using Kubespray, NVIDIA GPU Operator, and custom automation modules.
Automate compute provisioning with Terraform, Ansible, and Python — UCS, Fabric Interconnects, and OpenShift clusters at scale.
End-to-end SAP HANA TDI design, sizing, HA/DR implementations, OS/DB migration, and S/4HANA on Cisco UCS and cloud platforms.

Deep dive into designing high-performance AI training clusters with NVIDIA GPUs and OpenShift.

How I built a Webex bot and Python LangChain agents for autonomous Day-2 infrastructure operations.

Lessons learned from deploying SAP HANA HA/DR on Azure with ARM templates and Express Routes.