Design, develop, and maintain Kubernetes Operators and Custom Resource Definitions (CRDs). Engineer custom scheduler plugins and optimize GPU workloads for AI infrastructure. Requires 3-5 years of Go experience and strong Kubernetes internals knowledge.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Job Description
Your mission
- Kubernetes Operator Development: Design, develop, and maintain production-grade Kubernetes Operators using frameworks like Operator SDK or Kubebuilder to automate application lifecycle management and infrastructure orchestration.
- Custom Resource Design: Architect and implement Custom Resource Definitions (CRDs) and controllers to extend the Kubernetes API
- Scheduler Plugin Engineering: Build and optimize custom scheduler plugins to refine workload placement strategies, addressing business-specific requirements such as cost-efficiency, latency reduction, or resource constraints.
- GPU Optimization: Develop, deploy, and manage DaemonSets that enable and execute optimizations on NVIDIA and AMD GPUs.
- Cross-Functional Collaboration: Partner with other engineering teams to embed custom solutions into the Kubernetes world
- Innovation & Ecosystem Engagement: Research emerging trends in the Kubernetes ecosystem (e.g., KEPs, upstream projects) and prototype innovative solutions to address evolving infrastructure challenges.
Looking to advance your Devops career with relocation support? Explore Devops Jobs with Relocation Packages that include comprehensive packages to help you move and settle in your new role.
- 3-5 years of experience in Software Engineering in Go
- Good grasp of Kubernetes internals, including controller patterns, reconciliation loops, and API extension via CRDs
- Familiarity with designing custom scheduling logic using Kubernetes scheduler plugins and extension points
- Solid understanding of GPU architectures and how to expose, manage, and optimize them in Kubernetes clusters
- Ability to collaborate across different teams to embed custom functionality into cluster workflows
- Experience with CI/CD pipelines and working in modern software delivery workflows
- Proficiency in Bash and/or Python for basic scripting and automation tasks
Discover our full range of relocation jobs with comprehensive support packages to help you relocate and settle in your new location.
- Build something big: Help build and scale a fast-growing AI infrastructure startup
- Pay & perks: Competitive compensation with a performance-based incentive, subsidized Deutschlandticket, and access to a discount portal
- Work your way: Flexible hours with hybrid and remote-friendly options
- Fast lanes, no red tape: Flat hierarchies and rapid decision-making mean ideas ship quickly
- Global team: Work with a diverse, international team across Germany and the USA
- Modern headquarters: Well-stocked office near the Heidelberg Hauptbahnhof, available on a hybrid basis or as a place to connect during our quarterly team workshops
- Top setup: Your choice of high-quality hardware and equipment
- Relocation support: We’ll help make your move to join us as smooth as possible
Interested in relocating to Germany? Check out our comprehensive Relocation Jobs in Germany page with detailed relocation packages and benefits.
turbalance is an innovative, emerging startup that transforms AI laws. We are a team of passionate problem-solvers who believe in what we’re building. We constantly push boundaries and embrace our inner nerds as we find new ways to tackle complex challenges. You will find a dynamic work environment here, with flat or even non-existent hierarchies and the chance to take on responsibility from day one.
Apply for this job
Similar Jobs
Explore other opportunities that match your interests
Engineering Manager, ZEOS B2B Portal and Portal Application Team
Zalando
TSCNET Services GmbH
Senior DevOps Engineer - SAP Cloud Infrastructure Delivery Validation & Release