Senior DevOps Engineer - Cloud Infrastructure & Security

careflow • United State
Remote
Apply
AI Summary

Lead cloud infrastructure, security, and reliability for a growing platform. Improve deployment processes, monitor systems, and troubleshoot issues. Enjoy a fully remote, flexible schedule with weekend coverage.

Key Highlights
Own and improve cloud infrastructure
Ensure platform security and reliability
Collaborate cross-functionally to drive initiatives
Key Responsibilities
Manage and maintain GCP environment
Design and implement scalable infrastructure
Monitor system health and performance
Establish centralized logging and monitoring
Lead incident response and root cause analysis
Implement security best practices
Improve deployment pipelines and automation
Establish service-level objectives and operational standards
Partner with teams to support company initiatives
Technical Skills Required
Google Cloud Platform Infrastructure as Code Terraform Docker Kubernetes Python TypeScript Node.js
Benefits & Perks
Fully remote
Flexible schedule with weekend coverage
Nice to Have
Experience with healthcare technology
Ability to read and contribute to application code

Job Description


About The Role

We are looking for an experienced DevOps Engineer to own and improve our cloud infrastructure, security, observability, and operational reliability. This role is responsible for ensuring our platform remains secure, scalable, performant, and highly available as we continue to grow.

The ideal candidate is someone who enjoys wearing multiple hats—building infrastructure, improving deployment processes, monitoring production systems, and troubleshooting issues across the stack. As a bonus, we would love someone who is comfortable diving into the application codebase to diagnose and resolve bugs when needed.

This is a fully remote position. We are particularly interested in candidates who can provide weekend coverage on Saturdays and take another day off during the week in exchange.

What You'll Do

Cloud Infrastructure & Operations

  • Manage and maintain our Google Cloud Platform (GCP) environment.
  • Design, implement, and improve infrastructure for scalability, reliability, and cost efficiency.
  • Manage networking, compute resources, databases, storage, and cloud services.
  • Monitor system health and proactively address performance bottlenecks.

Monitoring, Logging & Observability

  • Build and maintain centralized logging and monitoring solutions.
  • Create dashboards and alerts for system health, application performance, and business-critical workflows.
  • Establish operational metrics and usage tracking across the platform.
  • Lead incident response and root cause analysis efforts.
  • Monitor and manage spend

Security & Compliance

  • Implement and maintain security best practices across infrastructure and applications.
  • Manage identity and access controls, secrets management, and environment security.
  • Conduct security reviews and vulnerability remediation.
  • Assist with compliance initiatives and audit readiness.

CI/CD & Automation

  • Improve deployment pipelines and release processes.
  • Automate infrastructure provisioning and operational workflows.
  • Enhance development environments and deployment reliability.
  • Reduce manual operational tasks through automation.

Reliability Engineering

  • Improve uptime, resiliency, backup strategies, and disaster recovery processes.
  • Establish service-level objectives and operational standards.
  • Drive improvements in platform stability and performance.

Cross-Functional Support

  • Partner with engineering, product, and leadership teams to support company initiatives.
  • Provide technical guidance on infrastructure and operational considerations.
  • Participate in an on-call and operational support rotation.

Bonus Responsibilities

  • Troubleshoot and fix application-level issues when needed.
  • Contribute code improvements and bug fixes across the platform.
  • Assist with performance optimization and debugging efforts.

What Success Looks Like

Within your first 90 days, you will:

  • Gain ownership of our GCP infrastructure and environments.
  • Establish visibility into system performance, reliability, and usage metrics.
  • Improve monitoring, alerting, and incident response processes.
  • Identify and address security and operational risks.
  • Reduce infrastructure-related issues and deployment friction.
  • Become a trusted technical resource for platform reliability and operational excellence.

Position Details

  • Role: DevOps Engineer
  • Employment Type: Full-Time
  • Location: Fully Remote
  • Schedule: Flexible, with availability to provide Saturday coverage and take another weekday off
  • Reports To: Lead Architect

This role is ideal for someone who enjoys both infrastructure ownership and hands-on problem solving, and wants to have a significant impact on the reliability, security, and scalability of a growing software platform.

Requirements

Required Qualifications

  • 5+ years of DevOps, Site Reliability Engineering, Cloud Engineering, or related experience.
  • Strong hands-on experience with Google Cloud Platform (GCP).
  • Experience building and maintaining CI/CD pipelines.
  • Strong understanding of infrastructure monitoring, logging, and alerting systems.
  • Experience with cloud security best practices.
  • Experience managing production environments and incident response.
  • Strong Linux administration skills.
  • Experience with Infrastructure as Code tools (Terraform preferred).
  • Experience with containerization technologies such as Docker and Kubernetes.
  • Strong troubleshooting and problem-solving abilities.
  • Excellent written and verbal communication skills.
  • Ability to work independently in a fully remote environment.

Nice-to-Have Qualifications

  • Experience working in startup or high-growth environments.
  • Experience with healthcare technology or regulated environments.
  • Ability to read and contribute to application code.
  • Experience with Python, TypeScript, Node.js, or similar technologies.
  • Experience building internal tooling and automation.
  • Experience with data pipelines and analytics infrastructure.

Similar Jobs

Explore other opportunities that match your interests

Staff Software Engineer - Cloud Foundations

Devops
•
5h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

GitHub

United State
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Prodapt

United State

Staff PLM Architect

Devops
•
20h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Agility Robotics

United State

Subscribe our newsletter

New Things Will Always Update Regularly