Member of Technical Staff - ML/RLOps

adaptive ml France
Visa Sponsorship
Apply
AI Summary

Build foundational technology for Adaptive ML's Reinforcement Learning Operations platform. Focus on high-performance software engineering and large-scale RL research for LLM deployment. Combine Rust programming, GPU optimization, and systematic empirical research to develop specialized models.

Key Highlights
Work on internal LLM Stack, Adaptive Harmony
Combine large-scale engineering with rigorous empirical research
Opportunity to shape research efforts and product as company grows
Key Responsibilities
Build foundational technology powering Adaptive with focus on high-performance software engineering and large-scale RL research
Contribute to product roadmap by identifying promising trends and high-impact findings
Report clearly on work to distributed collaborative team with bias for asynchronous written communication
Write high-quality software in Rust with focus on performance and robustness
Profile dedicated GPU kernels in CUDA or Triton, optimizing across latency/compute-bound regimes
Identify and resolve bugs in large distributed systems at intersection of software and hardware correctness
Conduct research on large language models or diffusion models using reinforcement learning for personalization
Reproduce results from RL, LLM, and diffusion literature
Own research agenda with bias for at-scale, systematic empirical research
Technical Skills Required
Rust Python Triton CUDA Distributed systems Reinforcement learning Large language models Diffusion models
Benefits & Perks
Comprehensive medical (health, dental, and vision) insurance
401(k) plan with 4% matching
Unlimited PTO
Mental health, wellness, and personal development stipends
Visa sponsorship
Nice to Have
M.Sc./Ph.D. in computer science
Contributions to relevant open-source projects
Track record of publications at top-tier machine learning venues

Job Description


About The Team

Adaptive ML is a frontier AI startup building a Reinforcement Learning Operations (RLOps) platform that enables enterprises to specialize and deploy LLMs into production with measurable impact.

We provide the core infrastructure to tune, evaluate, and serve specialized models at scale — pioneering task-specific LLM development and running production-ready workflows that serve millions of requests while optimizing for both cost and performance across distributed systems.

Our tightly-knit team was previously involved in the creation of state-of-the-art open-access large language models. We raised a $20M seed led by Index Ventures and ICONIQ in early 2024, and we're already live in production with customers including Manulife, AT&T, Deloitte, across travel and financial services — with much more to be announced soon.

Our Technical Staff develops the foundational technology that powers Adaptive ML in alignment with requests and requirements from our Commercial and Product teams. We are committed to building robust, efficient technology and conducting at-scale, impactful research to drive our roadmap and deliver value to our customers.

About The Role

This is an open-role, describing a generic position in our Technical Staff. If any of the below seems like a fit, please apply!

As a Member of Technical Staff, you will contribute to building the foundational technology that powers Adaptive ML, primarily by working on our internal LLM Stack, Adaptive Harmony. We believe that generative AI is best approached as a “big science”--combining large-scale engineering with rigorous empirical research. As such, we emphasize scalability and systematic, empirical demonstrations in our approach. We are looking for self-driven, business-minded, and ambitious individuals interested in supporting real-world deployments of a highly technical product. As this is an early role, you will have the opportunity to shape our research efforts and product as we grow.

This is an in-person role based at our Paris or New York office.

Examples of tasks our Technical Team pursue on a daily basis:

  • Develop robust software in Rust, interfacing between easy-to-use Python recipes and high-performance, distributed training code running on hundreds of GPUs;
  • Profile and iterate GPU inference kernels in Triton or CUDA, identifying memory bottlenecks and optimizing latency—and decide how to adequately benchmark an inference service;
  • Develop and execute an experiment analyzing nuances between DPO and PPO in a fair and systematic way;
  • Build data pipelines to support reinforcement learning from noisy and diverse user' interactions across varied tasks;
  • Experiment with new ways to combine adapters and steer the behavior of language models;
  • Build hardware correctness tests to identify and isolate faulty GPUs at scale.

Your Responsibilities

Generally,

  • Build the foundational technology powering Adaptive, with a focus on high-performance software engineering and large-scale RL research;
  • Contribute to our product roadmap, by identifying promising trends and high-impact findings;
  • Report clearly on your work to a distributed collaborative team, with a bias for asynchronous written communication.

On the engineering side,

  • Write high-quality software in Rust, with a focus on performance and robustness;
  • Profile dedicated GPU kernels in CUDA or Triton, optimizing across latency/compute-bound regimes for complex workloads;
  • Identify and resolve bugs in large distributed systems, at the intersection of software and hardware correctness.

On the research side,

  • Conduct research on large language models or diffusion models, systematically exploring how reinforcement learning can be used to personalize models;
  • Reproduce results from the RL, LLM, and diffusion literature, distinguishing the noise from the groundbreaking;
  • Own a research agenda, with a bias for at-scale, systematic empirical research.

Nearly all members of our Technical Staff hold a position that is a blend of engineering and research.

Your (ideal) background

The background below is only suggestive of a few pointers we believe could be relevant. We welcome applications from candidates with diverse backgrounds; do not hesitate to get in touch if you think you could be a great fit,even if the below doesn't fully describe you.

  • A M.Sc./Ph.D. in computer science, or demonstrated experience in software engineering, preferably with a focus on machine learning;
  • Strong programming skills, especially regarding distributed problems where performance is key;
  • Contributions to relevant open-source projects, such as efficient implementations of models and RL;
  • A track record of publications at top-tier machine learning venues (e.g., NeurIPS, JMLR);
  • Passionate about the future of generative AI, and eager to build foundational technology to help machines deliver more singular experiences.

Benefits

  • Comprehensive medical (health, dental, and vision) insurance;
  • 401(k) plan with 4% matching (or equivalent);
  • Unlimited PTO — we strongly encourage at least 5 weeks each year;
  • Mental health, wellness, and personal development stipends;
  • Visa sponsorship if you wish to relocate to New York or Paris.


Similar Jobs

Explore other opportunities that match your interests

Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

mistral

France

Senior Frontend Engineer

Programming
6d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

mistral

France

Software Engineer

Programming
1w ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Not Applicable

bitstack

France

Subscribe our newsletter

New Things Will Always Update Regularly