Lead Infrastructure Engineer
by G42 in Artificial Intelligence
The Lead Infrastructure Engineer role at Inception, a G42 company, is responsible for leading the architecture, deployment, optimization, and operation of hybrid Azure and on-premises infrastructure environments that support AI-powered domain-specific and industry-agnostic solutions. The role operates within Information Technology Operations and plays a central role in transforming data and compute infrastructure into real-world applied AI solutions. The position encompasses ownership of Azure cloud infrastructure, hybrid cloud architecture, Infrastructure as Code (IaC), enterprise networking, security controls, identity and access management using Entra ID, and data encryption standards to ensure regulatory compliance and data protection. The role drives infrastructure engineering projects and modernization initiatives using DevOps and DevSecOps practices, container orchestration platforms such as Kubernetes and OpenShift, and automation through scripting languages including PowerShell and Python. The engineer is responsible for disaster recovery and business continuity planning, including risk assessment, mitigation strategies, and testing for mission-critical systems. The role also supports high-performance computing (HPC) environments, including GPU clusters using NVIDIA and AMD technologies and SLURM workload management, enabling advanced AI workloads. The position requires deep involvement in incident management, troubleshooting high-priority incidents using IT Service Management tools such as ServiceNow, Jira, and Sentinel, while adhering to ITIL change management processes. The engineer collaborates with engineering, product, and business teams to remediate issues, identify trends, and drive improvements across cloud, security, and networking layers, while also managing RFPs and architecting enterprise-level solutions considering cost, risk, and compliance. The role includes leadership responsibilities such as managing team deliverables, KPIs, professional development, and fostering a culture of continuous improvement, diversity, equity, inclusion, and respect. The environment emphasizes learning, innovation, AI infrastructure advancement, and alignment with international compliance standards including ISO 27001, SOC 2, HIPAA, and GDPR within Azure and hybrid environments.