Designing, implementing and maintaining a cloud infrastructure platform ensuring balance between high availability, reliability, complexity, security, scalability and cost depending on each stage of the product (using Infrastructure as Code and GitOps)
Participating in CI/CD system designing and implementing
Performing root cause analysis for errors and investigating and resolving technical issues in all environments
Optimizing performance, cost of the whole system
Designing, implementing, maintaining, optimizing the observability systems (include monitoring, logging, distributed tracing, continuous profiling and APM)
Supporting development team in designing, implementing, maintaining infrastructure for new services/applications
Researching, designing, implementing toolchains and workflows that enable self-service capabilities for development team, as well as improve stability and scalability of the system
Writing system documentations and training other team members
Collaborating with security team to patch infrastructure vulnerabilities, implement security components