Salary range - $200k - $300k | Equity - 0.25-0.5% | In-person NYC

APPLY HERE or email us at [email protected] with your resume and references to past work!

Role Overview

We’re looking for a Research Engineer to work across our open-source repos, inference API, and model training stack. You’ll operate at the intersection of applied research and engineering — shaping the models that power real-world document intelligence systems used by enterprises and developers globally.

You’ll be training and evaluating new model architectures, integrating them into production, and shipping updates across our open-source ecosystem. You’ll also help close the loop with users — investigating issues, improving benchmarks, and turning real feedback into better model performance.

Our team focuses on training small, efficient models that outperform much larger LLMs on domain-specific tasks (like OCR, structured extraction, and math recognition). We move fast, prioritize practical results, and build tools that are open, reproducible, and built to last.

Day to day, you will:

Ideal Candidate

You’ve shipped models that made it into production. You understand how to balance exploration with delivery, and how to turn research insights into products people actually use. You work autonomously and thrive in unstructured environments, but you’re also a strong collaborator — you communicate clearly, document your work, and elevate the people around you.