Full Time

Principal Fintech Infrastructure Resilience & Site Reliability Engineering (SRE) Lead

  • Remote
  • Specialism : Principal Fintech Infrastructure Resilience & Site Reliability Engineering (SRE) Lead
  • Post Date: May 12, 2026
  • Expires In : 92 Days
  • Apply Before: August 12, 2026
Job Overview

Principal Fintech Infrastructure Resilience & Site Reliability Engineering (SRE) Lead – Abu Dhabi, United Arab Emirates

Job Type: Full-Time

Experience Level: Principal (10–15 years)

Functional Area: Infrastructure | Reliability Engineering | Fintech Systems


Role Overview

In fintech, system failure is not an option. This role is for a highly technical leader who can ensure continuous availability, performance, and resilience of financial systems operating at scale.

You will build and manage site reliability frameworks that guarantee uptime, optimize performance, and proactively prevent system failures in high-transaction environments.


Core Responsibilities

  • Design and implement SRE practices for fintech platforms
  • Define and monitor service-level objectives (SLOs), SLAs, and SLIs
  • Build automated monitoring and alerting systems for real-time system health
  • Implement incident response and disaster recovery strategies
  • Optimize infrastructure for scalability, latency, and cost efficiency
  • Collaborate with engineering teams to embed reliability into system design
  • Conduct post-incident reviews and continuous improvement initiatives

Technical Stack

  • Cloud: AWS, Azure, Google Cloud
  • Tools: Kubernetes, Docker, Terraform
  • Monitoring: Prometheus, Grafana, ELK stack
  • Programming: Python, Go
  • Architecture: Distributed systems, microservices

Required Qualifications

  • Degree in Computer Science, Engineering, or related field
  • 10+ years in infrastructure engineering or SRE roles
  • Strong expertise in distributed systems and cloud-native architectures
  • Experience in high-availability financial systems
  • Deep understanding of performance optimization and system reliability

Preferred Expertise

  • Experience with real-time payment or trading systems
  • Knowledge of chaos engineering practices
  • Familiarity with security and compliance in financial infrastructure
  • Strong leadership and incident management skills

Strategic Importance

You will ensure fintech platforms operate with near-perfect reliability, safeguarding financial transactions and maintaining user trust at scale.

Are you excited about this opportunity?

Don’t miss the chance to make a difference in the fintech and FX industry!

 Apply now by clicking on the “Apply Now” button below. 

Let’s shape the future of finance together!

#EmploySolutionJobs #FXCareers.

Quick Job application form

Select your currency