At Demand.io, our mission is to be the world’s leading source of e-commerce knowledge. By aligning AI systems with communities of enthusiasts, creators, and shoppers, we create knowledge flywheels that operate at unprecedented scale and accuracy, powering our consumer shopping experiences.
Founded in 2009, we’re a self-funded, profitable company running multiple e-commerce consumer products at scale. Our portfolio includes SimplyCodes (simplycodes.com), an AI-powered social savings platform, and Product.ai (product.ai), an upcoming AI assistant for shopping. Our products are used by millions of shoppers to facilitate thousands of purchase decisions daily, driving over $1 billion in annual e-commerce transaction volume to our retail partner network.
We believe in aligned incentives for employees and stakeholders, operating with a unique model grounded in small team size comprised of top 1% talent, individual empowerment, AI-first thinking, rapid iteration, and outsized compensation tied to company performance.
What we're looking for
We’re hiring a Senior AI Engineer to join our small but growing team. This position will offer you the opportunity to play a lead role in designing, deploying, fine-tuning, and scaling our commerce-oriented large language models and data systems. We’re looking for candidates with direct experience designing and deploying generative AI systems in large production environments. You'll be joining a strong AI team developing and fine-tuning models and large scale data systems to power next-generation AI consumer experiences. We have a dedicated Nvidia A100 GPU server for always-on access to fast training and inference resources. Experience with retrieval augment generation (RAG) systems, vector databases, and knowledge graphs / graph databases are preferred.
What you’ll do:
- Leverage artificial intelligence, machine learning, and deep learning techniques to build powerful systems capable of parsing, categorizing, and organizing e-commerce content at scale.
- Use state-of-the-art transformers like LLMs to build AI-powered chatbots that can assist customers with their shopping needs, answering their questions and guiding them through the purchasing process.Help build an advanced LLM Ops pipeline used to deploy, monitor, integrate and train the models in an ongoing fashion.
- Develop and refine RAG systems, combining neural network-based language models with information retrieval techniques to enhance the accuracy and relevance of generated text.
- Design, build, deploy, and scale robust graph databases, utilizing platforms like Neo4j, Amazon Neptune, and TigerGraph, and integrate these databases into our RAG systems to enrich the depth and precision of our generative AI systems.
- Perform data engineering tasks including the development and maintenance of data crawlers, data ingestion pipelines, data pipelines, and managing workflows using tools such as Dataflow and Airflow.
About you:
- Bachelor's or master’s degree in Engineering, Computer Science, Artificial Intelligence, Data Science, or equivalent practical experience.
- Demonstrated expertise in multiple AI/ML architectures, particularly in transformers (like GPT and BERT), Large Language Models (OpenAI’s GPT and LLaMA), CNNs and RNNs as well as fine-tuning experience using libraries such as Axolotl, Huggingface Autotrain, or PyTorch Trainer.
- Ability to select the appropriate supervised or unsupervised learning strategy for a given problem.
- Experience with RAG or similar retrieval-augmented NLP systems.Experience with machine learning and related libraries such as TensorFlow, PyTorch, Scikit-learn, Pandas, NumPy, LangChain, LlamaIndex, and Hugging Face, NLTK, etc.
- Proficiency implementing and optimizing vector similarity search solutions using databases including but not limited to Pinecone, Faiss, ScaNN, Vald, Vespa, Qdrant, Chroma, Deep Lake, Supabase, Milvus, and Weaviate, with an understanding of their underlying indexing and query mechanisms.
- Experience in designing, building, deploying, and scaling knowledge graphs and graph databases is preferred, using platforms such as Neo4j, Amazon Neptune, TigerGraph, OrientDB, ArangoDB, Microsoft Azure Cosmos DB, and JanusGraph.
- Prior experience with LLM Ops stack including GPU clusters, MLflow, Docker, Kubernetes, Kubeflow, and monitoring tools such as ELK, Grafana, and Prometheus.
- In-depth knowledge and expertise with Python. Familiarity with JavaScript or TypeScript is a plus, as is proficiency in both object-oriented and functional programming paradigms.
- Deep understanding of building large-scale, low-latency distributed systems and a track record of architecting scalable APIs in high-demand production environments.
- Dynamic problem solver with a strong customer focus, adept at driving teams to build user-focused solutions.
About the job
- Starting cash compensation: $200,000 - $350,000 DOE.
- Stock options: 0.25% to 0.50% initial grant.
- Eligibility for our Equity Partners program, a profit-sharing system tied to individual and company performance.
- Opportunities for career growth, leadership, and skill expansion. We sponsor your ongoing career development, including education, courses, certifications, and books.Opportunity to work from the ground level to build and deliver an exciting upcoming AI consumer product, based on fine-tuned open source foundation language models and next-generation RAG + knowledge graph data system. We’ve procured our own A100 GPU server for continuous access to high-speed training and inference resources.
- Premium health coverage including comprehensive PPO and HMO options, along with full dental and vision coverage, paid 100% for all your dependents.
- Our Santa Monica HQ is a newly completed state-of-the-art technology development facility offering prodigious open space, open work setup, large recreation & break room with free food, silence / focus facilities, podcasting studio, and sweeping views from the ocean to downtown.
- Sponsored access to premium AI services, including ChatGPT Plus, Gemini Advanced, Perplexity Pro, GitHub Copilot, Midjourney Pro, Anthropic Claude, Notion, and more.
- Full coverage of your home internet and mobile phone plans.
- Regular team-building events, dinners and activities.401K program.
- Unlimited PTO.Relocation assistance available.
- H1B sponsorship available.
About our culture
- Our core values are innovation, empowerment, and continuous growth. We seek team members who want to take on significant ownership, have the natural desire to learn and grow, and who have an innate passion for products, innovation, and serving customers.
- We lean in-office as a team. While we embrace flexibility and some hybrid work, our team members prefer to collaborate in person and we’re generally present in the office on a daily basis. Fostering a healthy face-to-face culture is part of our identity as a company.
- We seek to keep our team small, working as a team of deep generalists with a bias towards action. Every team member has leadership growth opportunity here.
- We offer above-market compensation through our profit-sharing system. We believe in aligned incentives as our means for attracting top talent and retaining staff for long-term relationships.
- Read our culture backgrounder here.
- Meet the team and learn about our products and company vision at https://demand.io.