Diogo Figueiredo

Staff SRE • Platform Engineer • Product Leader
Lisboa, Portugal

Staff SRE with 13+ years building and scaling infrastructure for products serving millions of users. I specialize in platform transformations — taking systems from frequent outages to 99.95% uptime, reducing cloud costs, and building observability that enables teams to move faster with confidence. Deep expertise in AWS, Terraform, Kubernetes, Nomad, and Datadog.

Currently at Dashlane leading observability, SLO frameworks, and disaster recovery. Previously led infrastructure and product at Telpark (4M+ users, 30+ services, 45 engineers) and Etleap.

Dashlane
Staff SRE
Jan 2025 – Present
  • Leading company observability initiative, designing unified monitoring architecture and established consistent instrumentation
  • Reducing AWS infrastructure costs through architecture audit and rightsizing initiatives
  • Driving automated dependency/update workflows to reduce security risk surface and free engineering time
  • Establishing SLO framework across platform services, defining reliability targets and error budgets
  • Establishing disaster recovery strategy for critical components reducing RTO with automated failover
Terraform AWS Observability SRE
Premium Minds — Telpark
SRE & Product Manager
Apr 2012 – Dec 2024
  • Owned infrastructure while leading product for Spain/Portugal's #1 parking app (4M+ users), driving practices across 7 teams and 40+ engineers
  • Eliminated recurring outages caused by undetected database issues and deployment failures, delivering 99.95% uptime
  • Architected migration from static EC2 instances to Nomad-orchestrated cluster with IaC Terraform, enabling dynamic scaling and zero-downtime deployments across 30+ services
  • Unified CI/CD pipelines across 30+ microservices through standardised Jenkins workflows
  • Replaced basic CloudWatch monitoring with comprehensive Datadog stack (metrics, APM, alerting), shifting to proactive incident detection
  • Led development of the #1 car parking app in Spain and Portugal, serving over 4 million users
Terraform AWS Nomad Kubernetes Datadog CI/CD Product
Etleap
SRE
Feb 2021 – May 2024
  • Led infrastructure for B2B data pipeline platform: AWS architecture, CI/CD pipelines, observability, and developer tooling
  • Reduced CI pipeline duration by 50% (70 to 35 min) and eliminated flaky test noise through test parallelisation, Docker optimisations and intelligent retry mechanisms
  • Unified fragmented monitoring tools into single Datadog platform, building standardised dashboards for self-serve debugging
  • Established Terraform standards and practices that scaled infrastructure ownership from sole SRE to full engineering team
AWS Terraform Datadog CI/CD DevOps
Limetree
CTO
Jun 2012 – Oct 2014
Designed architecture and led development of mobile platform for family memory sharing. Built backend, frontend, and mobile apps; managed AWS infrastructure and deployment automation.
AWS Architecture Mobile
MyOut
Founder & CEO
Jan 2011 – May 2014
Built cultural events aggregator for Portugal. Developed full stack including ElasticSearch-powered search, Redis caching, and web crawlers.
ElasticSearch Redis Full Stack
Amazon Web Services (AWS) Product Development DevOps Management Site Reliability Engineering Terraform Kubernetes Nomad Datadog CI/CD Observability
Instituto Superior Técnico
MSc, Information Systems and Computer Engineering
2010 – 2012
Utrecht University
MSc, Computer Science — Erasmus
2011