Luxoft Poland company
Project Description:
Our Client is one of the largest TV providers in North America.
We are looking for an experienced Site Reliability Engineer (DevOps) to help us build and maintain infrastructure for a variety of customer-facing applications. You will work with a wide range of technologies to ensure the application infrastructure is available, well-maintained, and secure. The primary infrastructure consists of Kubernetes container orchestration, deployed in both cloud (AWS) and on-prem environments, as well as other
AWS resources such as databases, messaging systems, DNS, networking, and more. You will collaborate with development teams to ensure the infrastructure meets application requirements. Additionally, you will work on automating the infrastructure deployment process and leveraging Infrastructure as Code (IaC) practices.
Responsibilities:
Responsibilities include:
• Deploy new infrastructure components such as Kubernetes clusters, databases, Lambda functions, and others.
• Perform system upgrades and patch management.
• Manage and troubleshoot Kubernetes deployments using Helm and Kustomize.
• Implement and enforce Kubernetes security best practices, including OIDC and RBAC.
• Provide monitoring capabilities using Prometheus and a wide range of exporters.
• Offer technical guidance and support to development and DevOps teams.
• Maintain up-to-date infrastructure documentation.
• Oversee the architecture of the entire system and provide recommendations for improvements.
• Stay current with evolving industry best practices, preparing and presenting improvement proposals.
Mandatory Skills Description:
• Kubernetes: Proficient in deploying, configuring, and managing Kubernetes clusters (e.g., using kubeadm, EKS, AKS, GKE).
• Containerization: Expertise with Docker for containerization, including Dockerfile creation, container orchestration, and image optimization.
• Helm: Experience with Helm for Kubernetes package management, including creating and managing Helm charts.
• Infrastructure as Code (IaC): Familiarity with IaC tools (e.g., Terraform,
CloudFormation) for provisioning infrastructure.
• Configuration Management: Proficiency in tools like Ansible, Puppet, or Chef for automating system configurations and deployments.
• Monitoring & Logging: Experience with monitoring tools like Prometheus, Grafana, ELK stack for observability and alerting.
• Logging & Troubleshooting: Skills in setting up centralized logging systems and debugging Kubernetes-related issues (e.g., kubectl, K9s, Fluentd).
• Service Meshes: Familiarity with service mesh tools like Istio or Linkerd to manage microservices traffic in Kubernetes.
Cloud Platforms:
• Cloud Providers: Hands-on experience with at least one cloud provider (e.g., AWS, Google Cloud, Azure) for Kubernetes clusters and cloud-native services.
• Managed Kubernetes Services: Experience managing Kubernetes clusters in managed services like AWS EKS, GKE, or Azure AKS.
• Cloud-native Architecture: Familiarity with cloud-native architectures such as microservices, serverless, and event-driven systems.
Security and Best Practices:
• Kubernetes Security: Deep understanding of Kubernetes security features like RBAC, Network Policies, Pod Security Policies, and OIDC.
• Secrets Management: Experience with tools like Kubernetes Secrets and AWS Secrets Manager for managing secrets.
• Compliance & Auditing: Familiar with auditing Kubernetes environments, performing security scans, and following compliance standards
Automation & Scripting:
• Scripting: Proficiency in scripting languages like Bash, Python, or Go for
automation tasks.
• Automation Frameworks: Experience with automation tools and frameworks (e.g., Jenkins pipelines, Ansible, GitOps with ArgoCD or Flux).
Version Control & Collaboration:
• Git: Strong experience with Git for version control, including branching strategies and code reviews.
• Collaboration Tools: Familiar with project management tools like Jira, Confluence, or Slack to collaborate across teams.
Soft Skills:
• Problem-Solving: Excellent troubleshooting and problem-solving skills to diagnose and resolve issues in complex, distributed environments.
• Communication: Ability to communicate complex technical concepts to non-technical stakeholders and work in cross-functional teams.
• Collaboration: Strong team player who collaborates well with developers, QA, and operations teams.
Nice-to-Have Skills Description:
Desired Qualifications:
• Certifications: Kubernetes certifications (e.g., CKAD, CKA) are a plus.
• Experience with Serverless: Familiarity with serverless technologies (e.g., AWS Lambda, Azure Functions) and how they integrate with Kubernetes.
• Disaster Recovery: Experience in designing and implementing disaster recovery plans for Kubernetes clusters.
• Multi-cloud Environments: Experience working with hybrid or multi-cloud environments.
Bonus Skills:
• Service Deployment & Management: Experience with managing and scaling large-scale microservices-based applications using Kubernetes.
• Data Management: Familiarity with Kubernetes-based storage solutions and distributed databases (e.g., StatefulSets, persistent volumes).
• Custom Kubernetes Controllers: Experience writing custom controllers or operators using Go
Languages:
English: C1 Advanced
What do we offer our employees?
💰 Tax-Deductible Costs on a contract of employment for all development roles
👩⚕️ LuxMed health & dental care, life insurance
🏋🏽♀️ MyBenefit program (sports card, well-being program etc.)
🏠 Remote work (or hybrid)
🖋 Stable employment based on an employment contract (or B2B)
🏦 Special offer of banking services
💻 Equipment such as a laptop and monitor
🌎 Support for relocated employees
🎯 Full coverage relocation package for Senior positions
🎓 LuxTalent platform (webinars, training, courses with certificates)
...and even more!