Software Engineer (Data Engineer / Data Science) - SWE Bench Evaluation

netrolynx ai • India
Remote
Apply
AI Summary

Design and implement data pipelines for benchmark-driven evaluation of AI systems. Work with structured and unstructured datasets to ensure data quality and integrity. Collaborate with researchers to develop challenging data engineering tasks for AI benchmarking.

Key Highlights
SWE Bench-style evaluation projects
Production-like datasets and data pipelines
Python proficiency required
Collaboration with top AI researchers
Remote work flexibility
Key Responsibilities
Work with structured and unstructured datasets to support SWE Bench-style evaluation tasks, ensuring data quality and integrity.
Design, build, and validate data pipelines used in benchmarking and evaluation workflows to facilitate accurate and reproducible results.
Perform data processing, analysis, feature engineering, and validation to support various data science use cases.
Write, run, and modify Python scripts to process data and support experimental workflows locally, ensuring efficiency and reliability.
Evaluate data quality, transformations, and outputs for correctness, reproducibility, and adherence to project standards.
Create clean, well-documented, and reusable data workflows that can be integrated into benchmarking frameworks.
Participate in code reviews to maintain high standards of code quality, readability, and maintainability.
Collaborate with researchers and engineers to design challenging, real-world data engineering and data science tasks for AI evaluation systems.
Technical Skills Required
Python Data Engineering Data Science Machine Learning Data Processing Feature Engineering Data Validation Code Review Algorithmic Problem Solving
Benefits & Perks
Fully remote work
Flexible working hours
Competitive engagement terms
Continuous learning and growth
Exposure to cutting-edge AI projects

Job Description


About The Company

Based in San Francisco, California, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports its clients by accelerating frontier research through high-quality data, advanced training pipelines, and top-tier AI researchers specializing in coding, reasoning, STEM, multilinguality, multimodality, and agents. Additionally, Turing helps enterprises transform AI from proof-of-concept to proprietary intelligence by developing reliable systems that deliver measurable impact and drive lasting results on the P&L. The company's innovative approach and commitment to excellence position it as a pioneer in the AI industry, fostering collaboration between cutting-edge research and practical enterprise solutions.

About The Role

We are seeking experienced Software Engineers (SWE Bench - Data Engineer / Data Science) to join our team and contribute to benchmark-driven evaluation projects focused on real-world data engineering and data science workflows. In this role, you will work hands-on with production-like datasets, designing and implementing data pipelines, performing data processing and analysis, and supporting experiments that evaluate the performance of advanced AI systems. The ideal candidate will possess a strong foundation in data engineering and data science, with the ability to work across various stages of data preparation, analysis, and modeling within complex codebases. This position offers an exciting opportunity to collaborate with top researchers and engineers to develop meaningful benchmarks that push the boundaries of AI technology.

Qualifications

The ideal candidate will have a minimum of three years of experience as a Data Engineer, Data Scientist, or Software Engineer with a focus on data workflows. Proficiency in Python is essential, particularly for data processing, analysis, and model-related tasks. Demonstrable experience working with structured and unstructured data, coupled with a solid understanding of machine learning and data science fundamentals, is required. Candidates should have the ability to navigate and modify complex, real-world codebases and produce clean, reusable, and well-documented code. Strong problem-solving skills, especially in algorithmic or data-intensive problems, are vital. Excellent communication skills in English, both spoken and written, are also necessary for effective collaboration within cross-functional teams.

Responsibilities

  • Work with structured and unstructured datasets to support SWE Bench-style evaluation tasks, ensuring data quality and integrity.
  • Design, build, and validate data pipelines used in benchmarking and evaluation workflows to facilitate accurate and reproducible results.
  • Perform data processing, analysis, feature engineering, and validation to support various data science use cases.
  • Write, run, and modify Python scripts to process data and support experimental workflows locally, ensuring efficiency and reliability.
  • Evaluate data quality, transformations, and outputs for correctness, reproducibility, and adherence to project standards.
  • Create clean, well-documented, and reusable data workflows that can be integrated into benchmarking frameworks.
  • Participate in code reviews to maintain high standards of code quality, readability, and maintainability.
  • Collaborate with researchers and engineers to design challenging, real-world data engineering and data science tasks for AI evaluation systems.

Benefits

Working as a freelancer with Turing offers the flexibility of a fully remote environment, allowing you to work from anywhere. You will have the opportunity to engage with cutting-edge AI projects alongside leading language model companies, expanding your expertise and professional network. Additionally, Turing provides a platform for continuous learning and growth through exposure to innovative technologies and methodologies. The role offers competitive engagement terms, flexible working hours, and the chance to contribute to impactful AI solutions that shape the future of technology.

Equal Opportunity

Turing is committed to fostering an inclusive environment where all qualified individuals have equal opportunity for employment. We value diversity and are dedicated to creating a workplace that respects and celebrates differences. We do not discriminate based on race, color, religion, gender, gender identity or expression, sexual orientation, national origin, age, disability, or any other protected status. Our goal is to ensure that every team member feels valued, supported, and empowered to contribute their best.


Similar Jobs

Explore other opportunities that match your interests

Senior Data Analyst

Data Science
•
3d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Associate

CareerXperts Consulting

India
Visa Sponsorship Relocation Remote
Job Type Part-time
Experience Level Not Applicable

hired

India

AI Model Trainer

Data Science
•
6d ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Associate

fetchjobs.co

India

Subscribe our newsletter

New Things Will Always Update Regularly