You are viewing a preview of this job. Log in or register to view more details about this job.

Machine Learning Researcher

About Pinetree

Pinetree builds secure, production-grade computer-use agents that operate directly in real-world software environments such as web portals, enterprise systems, and legacy frontends. Our agents interact with systems the same way humans do, with strong guarantees around reliability, safety, and auditability. We work at the boundary between machine learning research and high-stakes enterprise deployment, with a strong focus on healthcare, insurance, and regulated workflows.

Role Overview

We are looking for a Machine Learning Researcher to work on core research problems in agentic systems, multimodal models, and learning under real-world constraints. This role is research-driven but tightly coupled to deployment. You will design, prototype, and evaluate novel ML methods that enable reliable computer use, long-horizon reasoning, and robust interaction with complex interfaces.

This is an ideal role for someone who enjoys working on open research questions but wants their work to ship and matter.

Responsibilities

Conduct research on agentic ML systems, including planning, reasoning, tool use, and long-horizon execution
Design and evaluate learning methods for computer use, including vision-language models, multimodal perception, and UI understanding
Develop benchmarks, datasets, and evaluation frameworks for agent reliability, robustness, and safety
Experiment with fine-tuning, distillation, and reinforcement learning for real-world task execution
Analyze agent failures and propose principled fixes grounded in theory and empirical results
Collaborate closely with engineers to translate research ideas into production systems
Author internal research reports and, when appropriate, external publications

Required Qualifications

Strong background in machine learning, AI, or a related field
Solid understanding of at least one of the following:
- Agentic systems and planning
- Multimodal learning and vision-language models
- Reinforcement learning or sequential decision making
- Program synthesis or tool-augmented models
Ability to design rigorous experiments and analyze results critically
Comfortable working in fast-moving, ambiguous problem spaces

Preferred Qualifications

Research experience in agents, robotics, HCI, or applied ML
Familiarity with browser automation, web agents, or UI interaction models
Publications in NeurIPS / ICLR / ICML / ACL / EMNLP or other respected conferences or journals
Experience working with high-stakes or regulated domains such as healthcare or finance

What We Offer

Opportunity to work on foundational problems in real-world AI agents
Direct ownership over research direction and impact
Close collaboration with a highly technical founding team
A culture that values rigor, clarity, and shipping real systems