About
Site Reliability Engineer (SRE) & Technology Lead with 13+ years of IT experience (4+ years in SRE). Proven expertise in automation, monitoring, incident response and infrastructure management to ensure performance, availability and scalability.
Strong leader who has mentored SRE teams, delivered mission-critical financial applications, and partnered with global stakeholders to improve reliability and reduce operational costs.
Experience
- Led an 8-member SRE team, mentoring engineers and improving delivery efficiency.
- Reduced MTTR by 30% using proactive monitoring (Splunk, Grafana, Control-M).
- Conducted chaos engineering experiments, achieving 99.98% uptime and improved resilience.
- Partnered with developers to optimise applications and reduce infrastructure costs by 15%.
- Implemented structured incident management, reducing repeat incidents by 20%.
- Delivered 10+ full-cycle software projects across banking, finance and government sectors.
- Led development teams and improved delivery timelines through Agile practices.
- Enhanced production support and log analysis, improving operational efficiency by ~25%.
- Collaborated with onsite teams across Muscat, Brunei and Singapore for requirements and delivery.
Highlights
Migrated legacy jobs to a standard pipeline model, cutting release time and failures.
Introduced reusable Terraform modules and remote state, standardising environment setup.
Helped move workloads to Kubernetes with proper config, secrets and rollout strategies.
Improved dashboards, alerts and on-call playbooks to reduce noise and MTTR.
Skills
Certifications
- AWS Certified Solutions Architect – Associate (Planned / Achieved)
- Certified Kubernetes Administrator (CKA) (Optional placeholder)
- Any other cloud / DevOps certifications you hold.
Links
- GitHub: github.com/revanthkodam23
- LinkedIn: linkedin.com/in/revanth-kodam-68917019
- Email: revanth.kodam@outlook.com