Lead cloud infrastructure, security, and reliability for a growing platform. Improve deployment processes, monitor systems, and troubleshoot issues. Enjoy a fully remote, flexible schedule with weekend coverage.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
About The Role
We are looking for an experienced DevOps Engineer to own and improve our cloud infrastructure, security, observability, and operational reliability. This role is responsible for ensuring our platform remains secure, scalable, performant, and highly available as we continue to grow.
The ideal candidate is someone who enjoys wearing multiple hats—building infrastructure, improving deployment processes, monitoring production systems, and troubleshooting issues across the stack. As a bonus, we would love someone who is comfortable diving into the application codebase to diagnose and resolve bugs when needed.
This is a fully remote position. We are particularly interested in candidates who can provide weekend coverage on Saturdays and take another day off during the week in exchange.
What You'll Do
Cloud Infrastructure & Operations
- Manage and maintain our Google Cloud Platform (GCP) environment.
- Design, implement, and improve infrastructure for scalability, reliability, and cost efficiency.
- Manage networking, compute resources, databases, storage, and cloud services.
- Monitor system health and proactively address performance bottlenecks.
- Build and maintain centralized logging and monitoring solutions.
- Create dashboards and alerts for system health, application performance, and business-critical workflows.
- Establish operational metrics and usage tracking across the platform.
- Lead incident response and root cause analysis efforts.
- Monitor and manage spend
- Implement and maintain security best practices across infrastructure and applications.
- Manage identity and access controls, secrets management, and environment security.
- Conduct security reviews and vulnerability remediation.
- Assist with compliance initiatives and audit readiness.
- Improve deployment pipelines and release processes.
- Automate infrastructure provisioning and operational workflows.
- Enhance development environments and deployment reliability.
- Reduce manual operational tasks through automation.
Interested in remote work opportunities in Devops? Discover Devops Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
- Improve uptime, resiliency, backup strategies, and disaster recovery processes.
- Establish service-level objectives and operational standards.
- Drive improvements in platform stability and performance.
- Partner with engineering, product, and leadership teams to support company initiatives.
- Provide technical guidance on infrastructure and operational considerations.
- Participate in an on-call and operational support rotation.
- Troubleshoot and fix application-level issues when needed.
- Contribute code improvements and bug fixes across the platform.
- Assist with performance optimization and debugging efforts.
Within your first 90 days, you will:
- Gain ownership of our GCP infrastructure and environments.
- Establish visibility into system performance, reliability, and usage metrics.
- Improve monitoring, alerting, and incident response processes.
- Identify and address security and operational risks.
- Reduce infrastructure-related issues and deployment friction.
- Become a trusted technical resource for platform reliability and operational excellence.
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
- Role: DevOps Engineer
- Employment Type: Full-Time
- Location: Fully Remote
- Schedule: Flexible, with availability to provide Saturday coverage and take another weekday off
- Reports To: Lead Architect
Requirements
Required Qualifications
- 5+ years of DevOps, Site Reliability Engineering, Cloud Engineering, or related experience.
- Strong hands-on experience with Google Cloud Platform (GCP).
- Experience building and maintaining CI/CD pipelines.
- Strong understanding of infrastructure monitoring, logging, and alerting systems.
- Experience with cloud security best practices.
- Experience managing production environments and incident response.
- Strong Linux administration skills.
- Experience with Infrastructure as Code tools (Terraform preferred).
- Experience with containerization technologies such as Docker and Kubernetes.
- Strong troubleshooting and problem-solving abilities.
- Excellent written and verbal communication skills.
- Ability to work independently in a fully remote environment.
- Experience working in startup or high-growth environments.
- Experience with healthcare technology or regulated environments.
- Ability to read and contribute to application code.
- Experience with Python, TypeScript, Node.js, or similar technologies.
- Experience building internal tooling and automation.
- Experience with data pipelines and analytics infrastructure.
Similar Jobs
Explore other opportunities that match your interests
Staff Software Engineer - Cloud Foundations
GitHub
Prodapt