IN.JobDiagnosis logo

Job Title:

SwarmBench Task Engineer (Planning/Operations) - 75063

Company: Turing

Location: New delhi, Delhi

Created: 2026-05-16

Job Type: Full Time

Job Description:

About Turing:Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&LRole Overview:We are looking for a SwarmBench Task Engineer specializing in planning and operations to design and build complex, multi-agent benchmark tasks that simulate real-world planning, scheduling, and operational decision-making scenarios. This role focuses on creating constraint-rich problems that evaluate multi-agent reasoning, decomposition, and optimization capabilities in realistic environments.What does day-to-day life look like?Design and develop multi-agent benchmark tasks involving:Planning, scheduling, and resource allocationOperational decision-making (project management, logistics, incident response, capacity planning)Create constraint-rich problem statements with multiple interacting variablesDevelop verification scripts to evaluate:Feasibility (all constraints satisfied)Completeness (all requirements addressed)Optimality (efficient solutions)Build decomposition strategies:Split tasks across specialized sub-agents (resource-based, constraint-based, conflict resolution, optimization)Model real-world operational scenarios with dependencies, timelines, and resource constraintsCollaborate on improving task quality, coverage, and evaluation rigorRequirements:5+ years of experience in operations or project management or logistics or supply chain or AI research or a strong computer science research backgroundStrong ability to formalize constraints, dependencies, and scheduling logicProficiency in Python for building verification and validation scriptsStrong structured problem-solving and decomposition skillsClear and precise technical writing skillsExperience with AI coding benchmarks (e.g., SWE-bench, Terminal-bench)Hands-on experience with Docker (Dockerfiles, image builds, debugging)Nice to have:Experience with optimization techniques (linear programming, constraint satisfaction, scheduling algorithms)Background in operations researchExperience with simulation or modeling toolsKnowledge of AI planning systems or automated reasoningProject management experience or certifications (PMP, Agile, etc.)Perks of Freelancing With Turing:Work in a fully remote environment.Opportunity to work on cutting-edge AI projects with leading LLM companies.Offer Details:Commitments Required: 40 hours per week with overlap of 4 hours with PST. Engagement Type: Contractor assignment (no medical/paid leave)Duration of Contract: 4 weeks (adjustable based on engagement)

Apply Now

➤
Home | Contact Us | Privacy Policy | Terms & Conditions | Unsubscribe | Popular Job Searches
Use of our Website constitutes acceptance of our Terms & Conditions and Privacy Policies.
Copyright © 2005 to 2026 [VHMnetwork LLC] All rights reserved. Design, Develop and Maintained by NextGen TechEdge Solutions Pvt. Ltd.