Overview:
The role requires focusing on designing, scaling, and securing distributed environments while treating infrastructure as an internal product. This position serves as a critical bridge between software development and the core infrastructure of Game Aggregator.
Rather than focusing solely on pipeline maintenance and cluster management, this role centers on building abstract internal tooling, establishing engineering guardrails, and acting as a high-level technical advisor to software development teams. The position requires deep expertise in optimizing distributed systems, designing scalable cloud architectures, and mentoring engineering teams on cloud-native best practices.
About Product:
SOFTSWISS Game Aggregator
A fast and cost-effective solution that allows you to integrate and manage casino gaming content easily
Purpose of the role:
You will be responsible for designing and evolving the internal platform that powers our Game Aggregator, enabling development teams to build, deploy, and operate services efficiently at scale. Acting as a bridge between software engineering and infrastructure, you will create reusable platform capabilities, establish engineering standards, and guide teams in adopting cloud-native best practices. Your work will improve system reliability, security, scalability, and developer experience across distributed, high-load environments.
Key responsibilities:
- Design and maintain scalable, multi-tenant Kubernetes platforms using reusable Helm templates to enable developer self-service.
- Own pipeline reliability, cost, and speed. Embed automated release gates, secure secret architectures, and software vulnerability scanning (SCA).
- Build custom tools to extend platform capabilities and standardize shared environments.
- Maintain distributed tracing, logging, and metrics systems. Guide development teams on service instrumentation and establishing SLIs/SLOs/SLAs.
- Consult development teams on system architecture, resource optimization, and modern traffic routing patterns.
Required Experience:
- 5+ years in a DevOps, SRE, or Platform Engineering role supporting high-load distributed systems.
- Strong grasp of multi-cloud patterns, multi-region architectures, and cloud cost optimization.
- Production-grade Kubernetes administration, including advanced scheduling policies, network isolation, and complex Helm templating.
- Deep Linux administration and cross-stack troubleshooting. Experience configuring L4/L7 traffic layers (Load Balancers, Ingress, API Gateways, Service Meshes).
- Mastery of declarative IaC and proficiency in Python, Go, or Bash.
- Ability to independently gather requirements, set delivery goals, and build maintainable solutions.
- Intermediate or higher English and Russian (B1+).
Nice to have:
- Practical experience scaling and maintaining infrastructure for AI-powered applications, LLM frameworks, or MLOps pipelines.
Main Advantages
- Private health insurance
- Sports benefits
- Comprehensive Mental Health Program
- Free English lessons (online)
- Local language courses
- Paid time off
- Maternity leave support
- Referral program rewards
- Upskilling, internal workshops, and participation in professional conferences and corporate events