Senior Full-Stack Engineer for AI Model Benchmarking Platform

benchstack ai • United State
Visa Sponsorship
Apply
AI Summary

Build and maintain a platform for AI model benchmarking, working with a small team in San Francisco. 2-6 years of experience in full-stack or platform engineering required. Competitive salary and equity offered.

Key Highlights
Build and maintain AI model benchmarking platform
Work with a small team in San Francisco
Competitive salary and equity offered
Key Responsibilities
Build and maintain the platform that runs LLM benchmarks end to end
Work across the full stack as a generalist
Perform code and architecture reviews for other engineers
Technical Skills Required
Python Django React TypeScript Cloud infrastructure
Benefits & Perks
Competitive salary ($150K - $215K)
Equity
Visa sponsorship available
Nice to Have
Interest in the AI/ML domain and its challenges

Job Description


Our client is a pre-seed San Francisco team building the platform that benchmarks the world's leading AI models on rigorous, domain-specific tasks. They work with every major foundation model lab and top financial institutions, and their work has been featured in the Wall Street Journal, Washington Post, and Bloomberg. Backed by Pear VC, the team is small (~15 people) and growing fast — early enough that you'll own massive chunks of the platform and help shape the engineering culture from day one.



The rol

eThis is a pure individual-contributor seat with real autonomy. You'll build the infrastructure that runs LLM evaluations at scale — distributed systems, cloud infra, and full-stack features — taking fuzzy, open-ended problems and shipping solutions independently


.
What you'll

  • doBuild and maintain the platform that runs LLM benchmarks end to end — Python libraries, web platform, cloud infrastructure, and toolin
  • g.Work across the full stack as a generalist — Python/Django backend services and React/TypeScript frontend feature
  • s.Take open-ended problems and independently design, build, and ship solution
  • s.Perform code and architecture reviews for other engineer
  • s.Collaborate closely with the research team to ensure infrastructure meets their evaluation need


s.
What we're looking

  • for2–6 years of experience in full-stack or platform engineeri
  • ng.Production Python expertise in a professional setti
  • ng.React/TypeScript frontend experien
  • ce.Experience building systems of meaningful scope in producti
  • on.A sign of excellence — a top company, interesting research, a strong project, or a top-tier CS progr
  • am.Hands-on every day — this is not a management or delegation ro


le.
Bonus po

  • ints0-to-1 product building at big tech, a VC-backed startup, or as a foun
  • der.Interest in the AI/ML domain and its challen


ges.
Compensation and ben

  • efitsBase: $150K –
  • $215KEquity: Compet
  • itiveVisa sponsorship available for exceptional candi


dates
Location and work

  • modelSan Francis
  • co, CAOn-site, 5 days pe


r week

Similar Jobs

Explore other opportunities that match your interests

Senior ML Infrastructure Engineer

Programming
•
3m ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

blue river technology

United State

.NET Developer (React & AWS)

Programming
•
3h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

Baanyan Software Services, Inc...

United State

Senior Java Developer

Programming
•
3h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Entry level

BeaconFire Inc.

United State

Subscribe our newsletter

New Things Will Always Update Regularly