Senior Site Reliability Engineer
- Responsibilities
- Lead initiatives to improve reliability, scalability, and performance across our live game infrastructure.
- Serve as subject matter expert and mentor to junior and mid-level engineers.
- Daily operation and maintenance of hosted/cloud data-center environments.
- Installation, configuration, and patching of system and game software.
- Define, monitor, and improve SLIs/SLOs to maintain 99.9%+ uptime.
- Own incident response and root cause analysis processes; create and maintain runbooks and playbooks.
- Evaluate and implement new technologies, conducting POCs and driving them to production.
- Maintain accurate, up-to-date documentation for systems, workflows, and processes.
- Lead capacity planning, scaling strategies, and disaster recovery efforts.
- Continuously optimize the reliability, observability, and cost efficiency of critical infrastructure.
- Requirements
- Previous experience as a Site Reliability Engineer, Platform Engineer or similar
- Proven experience designing and operating large-scale, high-availability systems.
- Strong Linux administration skills.
- Experienced with containerization and orchestration technologies.
- Experience in CI/CD pipelines, automated deployment, and infrastructure as code.
- Solid understanding of network security principles.
- Hands-on experience with both bare-metal and cloud (preferably AWS).
- Proficient in automation tools such as Ansible and Terraform.
- Skilled with observability tools like Open Telemetry, Prometheus, Mimir, and Grafana.
- Deep understanding of scalability, profiling, debugging, and performance testing.
- Strong grasp of web stack fundamentals (REST, HTTP, CDN, caching).
- Experience setting up monitoring, metrics, and proactive alerting for production systems (Go, Java, C++).
- Proficient scripting in Shell and Python.
- Excellent communication and documentation skills in English.
- Willingness to relocate to Frankfurt am Main, Germany.
- Pluses
- Experience with Zero Trust Networks, WireGuard, Nomad, MaaS, Foreman.
- Knowledge of capacity forecasting and cost optimization for large-scale systems.
Apply for this Position
Please apply directly online and, if applicable, upload your materials as specified on the job posting. Fields marked with a * are required.
We are Crytek.
Crytek is an independent video game developer and publisher based in Frankfurt, Germany.
Crytek pushes the boundaries of the possible to make the impossible a reality. We want to create the most fun gaming experiences around, and if we have to blow up computer system requirements or push genre boundaries to do so, then we will. With almost two decades of experience in the games industry, Crytek takes its unique combination of experience and skills and continue to make an impact via innovative, fun, cutting-edge games and technology.