
- Employment
- Full-time
- Seniority
- Staff
About the role
Introducing Moonlake, AI for creating real-time interactive content
Mission: As an applied AI Research Engineer: Code agents (post training + systems)
Scope of Work:
- Agentic systems design: Tool catalogs, function calling, program synthesis/repair loops, ReAct/Reflexion/ToT/LangGraph-style control, self-verification, sandboxed execution.
- Evaluation mindset: Build task suites for multi-step coding Full-stack LLM engineer: Prompt libraries, routing, retrieval, KV-cache management, streaming, telemetry.
- Security & isolation: Docker/firejail, network egress controls, secrets hygiene, dependency pinning, supply-chain sanity.
- Strong post-training: Supervised fine-tuning + preference/trace RL (DPO/RLAIF/RLHF), dataset curation, reward shaping, safety filters.
Tech signals:
- Has shipped agents that pass real repos' test suites end-to-end.
- Has published papers in agentic systems/ code gen Contributions to agent frameworks or OSS evals (LangGraph/AutoGen/Guidance/LEAP, SWE-bench variants, EvalPlus).
- Built datasets from execution traces; can show improvements from data > params.
We are committed to being an on-site, in-person team currently based in San Mateo
731,000+ hidden jobs like this
Embedding VC and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites