Product Engineer, Web Data Infrastructure

coffeespace • San Francisco Bay Area
Remote Visa Sponsorship
Apply
AI Summary

Join a small, high-ownership engineering team and take end-to-end ownership of the company's flagship scraping product. Tackle difficult technical problems involving browser automation, anti-bot systems, and content extraction. Operate with exceptional autonomy in a team that values builders who can identify opportunities, make product decisions, and ship without layers of management.

Key Highlights
Own a product already trusted by 100k+ developers
Solve genuinely difficult technical problems
Operate with exceptional autonomy
Key Responsibilities
Own the core scraping product end-to-end
Improve success rates across complex websites
Build and iterate on structured extraction capabilities
Technical Skills Required
TypeScript Node.js Playwright Puppeteer headless browsers rendering systems proxy infrastructure anti-bot mitigation Datadog Sentry OpenTelemetry
Benefits & Perks
$180K-$290K base
0.01%-0.15% equity
Remote work
Visa sponsorship available
San Francisco office available
Nice to Have
Experience with LLM-powered applications
Strong intuition for what makes web data useful for AI systems

Job Description


Job Title: Product Engineer, Scraping

Salary: $180K-$290K base + 0.01%-0.15% equity

Location: Remote (Americas time zones preferred)

Visa sponsorship available for exceptional candidates

San Francisco office available for candidates who prefer working in person


Company Description

Venture-backed developer infrastructure startup building one of the fastest-growing web data platforms in the AI ecosystem.


In less than two years, the company has become a critical part of the modern AI stack, helping developers transform the messy web into clean, structured, LLM-ready data through a simple API. Thousands of engineering teams rely on the platform to power search, agents, retrieval systems, automation workflows, and AI products at scale.


With exceptional growth, strong product-market fit, and a highly technical team of fewer than 40 people, this is an opportunity to own a product already loved by developers while helping define the next generation of web data infrastructure.


Job Description

Join a small, high-ownership engineering team and take end-to-end ownership of the company's flagship scraping product.


This role sits at the intersection of product engineering, developer experience, and large-scale web infrastructure. You'll be responsible for making the product the most reliable way for developers to transform any URL into clean, structured, AI-ready data.


You'll tackle some of the hardest problems on the modern web, from JavaScript-heavy applications and anti-bot protections to content extraction quality, latency optimization, structured data generation, and developer usability.


Unlike traditional infrastructure roles, you'll own both the product and the implementation. There is no PM writing specifications for you. You'll identify problems, formulate hypotheses, ship solutions, measure outcomes, and continuously improve the experience for developers.


Why this role is remarkable

  • Own a product already trusted by 100k+ developers and directly influence one of the fastest-growing tools in the AI infrastructure ecosystem
  • Solve genuinely difficult technical problems involving browser automation, anti-bot systems, content extraction, structured data generation, and LLM-ready data pipelines
  • Operate with exceptional autonomy in a team that values builders who can identify opportunities, make product decisions, and ship without layers of management
  • Join a company with strong growth, meaningful equity, direct founder access, and a clear opportunity to shape the future of AI data infrastructure


What you will do

  • Own the core scraping product end-to-end, including reliability, extraction quality, latency, response formats, and developer experience
  • Improve success rates across complex websites by solving challenges involving JavaScript rendering, browser automation, dynamic content, anti-bot systems, and long-tail edge cases
  • Build and iterate on structured extraction capabilities, helping developers generate usable data without extensive post-processing
  • Run rapid product experiments, instrument outcomes, analyze usage patterns, and make informed decisions based on real developer behavior
  • Work closely with founders while gathering product insights directly from GitHub issues, community discussions, support channels, and internal usage


The ideal candidate

  • 3-12 years of experience as a product engineer, founding engineer, or early engineer at a startup
  • Strong TypeScript and Node.js expertise with experience shipping production-grade developer-facing products
  • Deep understanding of web scraping, crawling, browser automation, or data extraction systems at scale
  • Experience with technologies such as Playwright, Puppeteer, headless browsers, rendering systems, proxy infrastructure, or anti-bot mitigation
  • Strong product instincts with a track record of independently owning product direction, developer experience, and feature prioritization without relying on dedicated PMs
  • Familiarity with LLM-powered applications and strong intuition for what makes web data useful for AI systems
  • Experience operating production APIs and using observability tooling such as Datadog, Sentry, OpenTelemetry, or similar platforms


Less likely to be a fit

  • Engineers whose scraping experience is limited to internal tooling or isolated side projects
  • Pure infrastructure or systems engineers who prefer not to engage with product decisions or developer experience
  • Academic researchers without evidence of commercial product ownership
  • Candidates from large technology companies without meaningful startup experience or examples of operating in high-autonomy environments


Next steps

  1. Apply via this LinkedIn job post.
  2. We'll review applications and reach out directly if there's a strong match.
  3. If this specific opportunity isn't the right fit, we may also introduce qualified candidates to other high-signal startup opportunities we're actively recruiting for, always with your permission.


A quick note on authenticity

This is a real, active role that we're supporting in close partnership with the hiring team. We do not post speculative opportunities and work directly with founders and hiring managers on current hiring needs.


Similar Jobs

Explore other opportunities that match your interests

Full Stack Software Engineer

Programming
•
12h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Entry level

employia

San Francisco Bay Area
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

Samsara

San Francisco Bay Area

Backend Product Engineer - GraphQL API

Programming
•
14h ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

coffeespace

San Francisco Bay Area

Subscribe our newsletter

New Things Will Always Update Regularly