We are promoting this job opportunity as provided by a third party, the employer. In case of your interest in this job opportunity and for more details please click on "Apply" button below, which will take you to the employer's website
Machine Learning Engineer - Infrastructure | Research focused - LLM / AI Safety
An opportunity to Join a Leading AI Innovation Team as a Machine Learning Infrastructure Engineer. We are working with a London-based start-up dedicated to engineering cutting-edge AI systems poised to revolutionize industries worldwide.
The Role
As a Machine Learning Infrastructure Engineer, you will play a crucial role in developing a robust framework for rapid training and experimentation of large language models.
Please note that this role requires in-person presence in London, but this role offers visa sponsorship and relocation support. Candidates should possess a minimum of 2-3 years of professional experience in a similar capacity.
Responsibilities
- Develop the core inference engine used to serve large machine learning models to customers at scale and across distributed systems.
- Contribute significantly to the internal automated pipeline enabling high throughput training runs and rapid experimentation achieving top hardware efficiency optimisation.
- Collaborate in defining and steering our evolving inference and training stack working with researchers, founders and advisors to develop the next generation of high availability LLM’s.
- Build and grow our engineering organisation, setting a high bar of excellence that propels us forward.
Qualifications
We are seeking candidates with exceptional ML engineering evidenced by:
- Experience in creating and managing high-performance computing clusters across GPU/TPU, preferably in PyTorch.
- Proficiency in efficient serving of large machine learning models at scale, including quantization and distributed computing, leveraging libraries such as deepspeed.
- Strong software engineering acumen with expertise in software design/architecture, particularly in Python.
- Understanding of the latest AI research and ability to efficiently implement these systems.
- Prior experience at a leading machine learning company (OpenAI, DeepMind, Meta, Anthropic, HuggingFace, etc.).
Nice To Have
- Experience as an early engineer at a fast-growing startup.
- Interest in and consideration of the impacts of AI technology.
Interested? Apply directly through LinkedIn, or send your CV to george@eu-recruit.com
Key Words: Machine Learning / LLM / Large Language Model / PyTorch / High Performance Computing / HPC / GPU / TPU / Deepspeed / AI / OpenAI / Distributed Systems
By applying to this role, you understand that we may collect your personal data and store and process it on our systems. For more information please see our Privacy Notice content/uploads/2020/12/Privacy -Notice.pdf
Send Me Alerts About Jobs Like This.
Please enter your email address to continue setting up an email alert for similar jobs to this one. By entering your email address and clicking apply you will sign up to Jobs4 and agree to our terms and conditions .
This page can't load Google Maps correctly.
#J-18808-Ljbffr