Date listed
1 month agoEmployment Type
ContractRemote
YesFound on:
About Polymath
Polymath is an applied research lab focused on advancing long-horizon agent capabilities through reinforcement learning. We design and scale simulation environments where agents learn to operate safely and autonomously. We work with the world’s leading model labs to push the frontier of agent capabilities. Polymath is backed by Base10, Founders Future, Y Combinator, and other incredible investors & angels. We've raised an $8M seed, and are growing out the team.
About the role
We’re looking for talented researchers currently enrolled in MS / PhD programs to collaborate on a research project focused around frontier benchmarks and environments for long-horizon AI agents. This will require 1) identifying failure modes in frontier models, 2) developing rigorous benchmarks that evaluate how well frontier agents perform on complex, realistic tasks requiring long-horizon reasoning and tool use in dynamic environments, and 3) training autonomous agents that can reason, plan, and act over extended time horizons.
We can accommodate full-time or part-time engagements. Compensation will be $200k / year prorated to the number of hours committed. The goal of the residency is to culminate in a publication, and if there is a mutual fit, transition into a full-time role. If you’re interested in joining Polymath but are not currently a student, please apply to the Member of Technical Staff role.
You’ll be a good fit if you:
Culture
Product Manager - AI Agents , Assembled
2 months ago
Account Executive , Truewind
Full Time Employment1 month ago
Lead/Staff Product Manager, Financial Audit Agents , Fieldguide
2 months ago
Product Manager SAP Retail Fuel Network Operations (RFNO) , Implico
2 months ago
Insurance Sales Coach , trainwell
1 month ago
Newsletter
Let's simplify your job search. Receive your tailored set of opportunities today.
Subscribe to our Jobs