Hello, I'm

Maxim Murygin

Site Reliability Engineer / Software Engineer

Amsterdam, Netherlands

Maxim Murygin
Scroll

About Me

Software Engineer / Site Reliability Engineer with a 15+ year history of driving platform evolution and engineering efficiency. Expertise lies in architecting, deploying, and maintaining high-scale environments that support critical business workloads. Adept at technical leadership, improving testing culture, and guiding engineering teams through complex migrations and foundational platform shifts.

Experience

Senior Site Reliability Engineer

Booking.com Nov 2024 - Present Amsterdam, Netherlands

Engaged by leadership to stabilize a highly critical legacy system (distributed jobs scheduling platform), focusing on toil reduction and stability improvement, while simultaneously leading the design and architectural vision for the new Function-as-a-Service (FaaS) platform.

  • Architected and spearheaded the development of the company's first Function-as-a-Service (FaaS) platform on Kubernetes, directly unblocking a top-priority company initiative: the buildout of the LLM agents platform
  • Prevented major waste of engineering resources by challenging the proposed multi-component solution for distributed jobs triggering, advocating for a no-code architecture that improved overall design
  • Led initiative to reduce operational toil and enhance platform stability through automation tools, eliminating significant recurring manual work
  • Helped discover and mitigate one of the biggest security breaches in platform history; conducted postmortem and drove cross-functional improvements for company-wide security incident handling
Java Python Golang Kubernetes Puppet

Senior Site Reliability Engineer

Booking.com Aug 2022 - Oct 2024 Amsterdam, Netherlands

Led the delivery and reliability engineering for a high-scale Private Cloud Platform (OpenStack), establishing a secure and resilient foundation.

  • Unblocked migration of key booking services to Private Cloud by being constantly in touch with stakeholders, understanding their needs, delivering requirements to the team and contributing to the solution both by design and coding
  • Executed a high-impact, cross-functional initiative (across 5 teams) to optimize CPU pinning and platform configurations, resulting in a quantifiable 10% increase in performance for critical workload
  • Played a major role in architecture decisions, including leading the design for the 'Multi regional' resilience model to ensure all services could withstand cloud failures
  • Championed a shift to SLO-based alerting and mercilessly eliminated false positives, resulting in a quantifiable reduction in out-of-hours alert volume
  • Acted as a principal technical consultant for Engineering/Product Leadership, quickly diagnosing and delivering complex, time-sensitive solutions that prevented critical path blockers for major product initiatives
Python OpenStack Puppet Terraform Linux

Site Reliability Engineer

Booking.com May 2020 - Jul 2022 Amsterdam, Netherlands

SRE in Core Infrastructure. Building from scratch and maintaining an integration layer between OpenStack-based Private Cloud and internal services.

  • Designed and took a major role in implementation of internal platform which consists of 8000+ VMs and provides a working environment for 2000+ developers
  • Guided the adoption of IaC with Terraform, developed many internal terraform modules and helped AWS team to setup and use private terraform registry
  • Drove a high-risk, foundational architectural work to align VM and Baremetal lifecycle, a complex change that was deployed with zero service degradation or lost bookings
  • In collaboration with Risk and Compliance built a comprehensive list of controls to certify the environment as SoX compliant. Ensured platform compliance by successfully leading first two audit sessions with auditors
  • Onboarded 6 new team members and promoted by example pair programming
Terraform OpenStack GoLang Python Puppet Graphite Grafana PostgreSQL Linux

Team Lead / Site Reliability Engineer

Rubius Aug 2016 - Mar 2020 Tomsk, Russia

Team Lead managing a cross-functional team while handling SRE responsibilities. Led team of 8 engineers, managed stakeholder relationships, and drove technical decisions. Maintained and enhanced web services for data processing and generating training sets for Machine Learning tasks.

  • Setup a solid architecture which survived 100x scale over the next 5 years
  • Led successful migration from monolith to microservices architecture
  • Moved operations from manual to Infrastructure as Code with terraform
  • Grew team from 6 to 9 engineers
GCP Terraform Kubernetes Docker StackDriver Python MySQL PostgreSQL Microservices

Senior Backend Developer

Rubius Jun 2015 - Jul 2016 Tomsk, Russia

As the Lead Developer of the outsourced team, I was responsible for building backend architecture and optimizing critical requests.

  • Implemented real-time monitoring of production performance
  • Gathered and optimized critical production requests
  • Implemented stress tests to prevent performance degradation
Node.js Python Linux Docker GCP MySQL

Backend Developer

Rubius Oct 2013 - May 2015 Tomsk, Russia

I was responsible for the backend development of the enterprise project management system.

  • Made significant refactoring to make the system testable
  • Demonstrated the value of testing and proper test writing techniques to other developers, rapidly increasing test coverage to nearly 90%
.NET C# SQL Server

R&D Intern

Tomsk Polytechnic University Sep 2010 - Jun 2013 Tomsk, Russia

Developed bacterial population monitoring system by building asymptotic solutions of the Fisher-Kolmogorov equation followed by modeling in MatLab.

C++ MatLab

Skills

Core Languages

Python GoLang JavaScript Bash

Data Layer

MySQL PostgreSQL Kafka Message Queues

Compute

Containers (Kubernetes) Virtualization (OpenStack) Baremetal (Linux)

Cloud Platforms

Amazon Web Services (AWS) Google Cloud Platform (GCP)

Architecture

Test Driven Development Domain Driven Design

Certifications

Education

Tomsk Polytechnic University

Engineer's Degree

2006 - 2012

Recommendations

"

Maxim is easy to work with and stands out as a highly responsible person with good organizational and communication skills. During my 4 year work relationship he was a reliable business partner during crisis or an engineering architect solving complex technical problems. He successfully managed a team of developers and customers across different regions around the world. Maxim would be a great asset in any tech company!

Eldar Khaliullin Principal Software Engineer at Magic Leap
"

Maxim is very diligent and responsible Team Lead and strong development expert. He is proactive and learn quickly, a good team player, attentive to detail and people. I would definitely recommend Maxim to anyone.

Sergey Dorofeev Co-founder at Rubius

Get In Touch

I'm open to discussing new opportunities and interesting projects.