Milad Jahandideh MJ

Milad Jahandideh

Site Reliability Engineer | Tech Lead

About

I'm a Site Reliability Engineer (SRE) and Tech Lead with more than 7 years of experience in building, scaling, and maintaining high-availability cloud infrastructures. My expertise spans across Kubernetes, OpenStack, Ceph, and DevOps automation, with a deep focus on reliability, performance, and operational excellence.

At ArvanCloud, I lead SRE and systems engineering initiatives, driving improvements in system scalability, monitoring, and resilience across large distributed systems.

Experience

Site Reliability Engineer / Tech Lead
ArvanCloud.ir · Full-time
Nov 2020 – Present

Infrastructure as a Service (IaaS) with OpenStack, Ceph, and Kubernetes.

  • Participate in project planning, task estimation, and prioritization to maintain alignment with organizational goals.
  • Collaborate with cross-functional teams to design and implement scalable, resilient infrastructure solutions.
  • Improve system reliability by building robust monitoring and alerting stacks using Prometheus, Grafana, and custom alerting rules.
  • Promote SRE best practices, driving a culture of reliability, automation, and continuous improvement.
  • Design and maintain CI/CD pipelines using GitLab CI, improving deployment speed and consistency.
  • Contribute to Infrastructure-as-Code initiatives using Ansible, enabling repeatable and automated deployments.
  • Implement Load Balancer as a Service (LBaaS) using the OpenStack Octavia project.
  • Deploy, operate, and optimize multiple Kubernetes clusters.
  • Manage dozens of microservices on Kubernetes using Helm charts, ensuring efficient rollouts and lifecycle management.
  • Led a VPC project connecting three OpenStack clusters using VXLAN overlays and BGP EVPN routing, leveraging OVN and Open vSwitch to deliver unified networking and secure inter-cluster communication.
Linux System Administrator
Mahsan.co · Full-time
Dec 2018 – Nov 2020
  • Implemented VMware ESXi virtualization infrastructure to facilitate code development and testing for developers.
  • Managed deployment and maintenance of applications on a large-scale Linux server environment.
  • Automated repetitive tasks using Ansible and Shell Scripting.
  • Containerized monolithic applications and optimized them to run on LXC.
  • Implemented ELK Stack to collect and analyze logs from thousands of servers centrally.
  • Utilized Zabbix for server monitoring.
  • Implemented UI testing automation using Selenium.
  • Provided Linux-related technical support and assistance to developers.
Embedded Systems Developer
Adeeco · Full-time
Dec 2017 – Nov 2018
  • Worked on embedded systems for industrial applications.

Education

MS – Technology and Innovation Management
Iran University of Science and Technology
BS – Information and Communications Technology
Shamsipour Technical and Vocational College
AS – Electronic Engineering
Technical and Vocational University
Diploma – Electronics
Vocational School

Projects

Melec.ir
Founder & Webmaster · Electronics & Microcontroller education community

Skills