Overview:
At Finteqhub, we strive to empower teams to take ownership of their tools, processes, and workflows, ensuring everyone can deliver their best work. We are looking for a dedicated Senior DevOps Engineer to support and guide teams while driving system reliability and observability improvements.
About Product:
Finteqhub
А PCI DSS certified payment gateway for online businesses, providing integration with payment systems via a single software platform
Learn more
Key responsibilities:
- Team Enablement and Support
Act as a mentor and guide, empowering the team to adopt and effectively use existing tools and processes. Focus on building their autonomy in areas such as CI/CD, observability, and Kubernetes workflows, reducing reliance on your role over time. - Facilitating SLOs and SLIs
Collaborate with the team to define and implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs), ensuring alignment with system and business requirements. Provide guidance and expertise to help the team measure, track, and improve system reliability, while making the process an integral part of the team’s operations. - System and Architecture Understanding
Support the team in understanding system architecture and communication patterns, enabling better decision-making in design and implementation. - Observability and Monitoring
Assist the team in leveraging the observability stack (e.g., Grafana, OpenTelemetry, ELK, Jaeger) to gather actionable insights and improve system performance. Collaborate to enhance monitoring workflows and establish comprehensive metrics coverage that the team can independently maintain. - Cross-Functional Collaboration
Foster a collaborative culture by working closely with development and operations teams to align tools and processes with business goals. Provide recommendations and insights for teams using AWS, Kubernetes, and related infrastructure, ensuring a shared understanding of best practices.
Requirements:
Core Experience
- Minimum 4 years of hands-on experience with Continuous Integration and Deployment (CI/CD) tools and techniques, such as Argo CD, Argo Workflows, or GitHub Actions.
- Minimum 4 years of experience with Containerization concepts and Kubernetes.
- Ability to work on on-call duty.
Infrastructure Management
- Minimum 2 years of experience with Infrastructure as Code tools, such as Terraform or CloudFormation.
- Minimum 2 years of experience with Source Control Management tools, such as Git or Subversion.
Programming & Automation
- Minimum 2 years of experience in a current programming language, such as Python or Golang.
Monitoring, Logging, and SRE Practices
- Experience working with monitoring and logging tools.
- Improving monitoring and alerting workflows.
- Proven experience in defining and implementing Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to enhance system reliability and performance.
- Demonstrated ability to collaborate with cross-functional teams to optimize monitoring, logging, and alerting processes for improved system insights and response times.
Nice to have:
- Familiarity with issue tracking software, such as JIRA.
- Understanding of software development life cycle (SDLC) and experience developing enterprise-grade software.
- Knowledge of enterprise security and hardening best practices.
- Hands-on experience with databases, such as PostgreSQL or DynamoDB.