Senior ML DevOps & IT Support

Flatgigs

Job Overview

Location

Dubai, Dubai, United Arab Emirates

Employment Type

Full-time

Work Arrangement

On-site

Sector

Information Technology & Software

Experience Level

Senior (5-8 years)

Application Deadline

May 6, 2026

About the Company

Flatgigs operates as a strategic execution partner, specializing in addressing critical talent and operational challenges for startups aiming for growth within the MENA region.

Headquartered in Dubai, UAE, the company focuses on connecting high-growth businesses with specialized, sector-specific talent. Their core mission is to facilitate measurable ROI, accelerate revenue milestones, and build sustainable, long-term value for their clients by filling essential talent gaps.

Job Description

Flatgigs is seeking a highly skilled Senior ML DevOps and IT Support professional to join their dynamic team in Dubai. This is an on-site position focused on building, securing, and maintaining the essential systems that power the company's artificial intelligence infrastructure.

This role bridges the critical areas of MLOps, cloud networking, and IT Service Management. You will be entrusted with the ownership of both the hardware and software foundations for machine learning workloads, while also serving as the primary point of contact for all internal IT operations. Based in the vibrant Dubai office, your responsibilities will include ensuring that both the production GPU environments and the internal team have the robust support and reliable connectivity necessary to operate with maximum efficiency.

Key responsibilities include managing and scaling multi-cluster GPU environments, providing hands-on IT support for users and devices, and designing secure cloud network architectures across Azure and GCP. You will also support MLOps pipelines, administer internal IT infrastructure, set up monitoring and alerting systems, and apply cloud security best practices. Collaboration with AI engineers to optimize infrastructure for machine learning tasks is also a core component of this role.

To apply for this role, click the Apply button on this page and follow the instructions.

Required Skills

IT AdministrationSystems EngineeringDevOpsGPU ManagementHigh-Compute EnvironmentsIT SupportAzureGoogle Cloud Platform (GCP)MLOpsNetworking ProtocolsCloud SecurityInfrastructure as Code (Terraform)KubernetesContainerization

Key Responsibilities

  • Manage and scale multi-cluster GPU environments and high-compute AI workloads
  • Provide hands-on IT support (user access, device management, and internal systems)
  • Design and maintain secure cloud network architectures across Azure and GCP
  • Support MLOps pipelines and the deployment of machine learning models
  • Administer internal IT infrastructure, including IAM, SSO, and MDM
  • Set up monitoring and alerting for GPU health, system performance, and security
  • Apply cloud security best practices across infrastructure and endpoint layers
  • Troubleshoot and resolve system, deployment, and connectivity issues
  • Collaborate with AI engineers to optimize infrastructure for machine learning tasks

Qualifications

  • 4+ years of experience in IT Administration, Systems Engineering, and DevOps
  • Strong experience managing GPU workloads and high-compute server environments
  • Expert-level comfort leading IT support as a core daily responsibility
  • Practical experience with Azure and Google Cloud Platform (GCP)
  • Solid understanding of MLOps workflows or deploying AI models in production
  • Knowledge of networking protocols (TCP/IP, DNS, VPNs) and cloud security
  • Proficiency in Infrastructure as Code (Terraform) or system automation scripting
  • Hands-on experience with Kubernetes and containerized environments
  • Strong problem-solving mindset with the ability to support a technical team independently

How to Apply

The AI and machine learning sector in Dubai is rapidly expanding, creating a high demand for specialized infrastructure management. This role is pivotal in building, securing, and maintaining the core systems that power advanced AI capabilities. You will leverage expertise in MLOps, cloud networking, and IT service management to ensure the seamless operation of GPU environments and AI workloads. Your impact will directly influence the speed and efficiency of AI development and deployment, contributing significantly to the company's technological advancement and competitive edge in this burgeoning market.

Posted Date

April 21, 2026