Machine Learning Engineer (Infrastructure) Opportunity

tech talent partners company

Subscribe to our Telegram & Twitter Channel

Machine Learning Engineer (Infrastructure) in UNITED KINGDOM

Visa sponsorship & Relocation 1 year ago

Machine Learning Engineer (Infrastructure)

£150-300K salary

Equity

London based


*This is not a straightforward ML Engineer role, it's more aligned to the infrastructure end of the ML spectrum. Please read the below carefully to ensure your skills align:


A well-funded London-based AI start-up is looking for a Machine Learning Engineer to spearhead the scaling of their fine-tuning and inference frameworks for LLMs.


The engineer will think in terms of token per second as the metric for language models, will have created multi-instance clusters for parallel training across GPUs/TPUs (ideally using PyTorch and Kubernetes) and have experience serving large Machine Learning models at scale, including distributed computing.


The team need to move from an environment that currently trains 7B parameter models to one that can train 10X that - this is where you step in.


To summarise, we're looking for profiles that demonstrate experience with;


  • Cluster creation across GPUs
  • Serving large ML models at scale
  • Distributed computing


Key technologies: Python, PyTorch, Google Cloud, SkyPilot, DeepSpeed, Ray, Slurm, Kubernetes clusters.


This is an exciting opportunity to join an exceptional team of researchers and engineers, all working towards building a safer future for AI. If you consider yourself a close match, please apply with your CV and we'll be in touch. *Sponsorship and relocation packages are available for applicants coming from anywhere in the world. This role is based in their central London office.



Machine Learning Engineer (Infrastructure)

£150-300K salary

Equity

London based

Apply now

Subscribe our newsletter

New Things Will Always Update Regularly