The vacancy is well-detailed with clear responsibilities and compensation, but lacks company links for further information.
Job description
Chaos Labs builds financial AI products that power safer, more accessible markets. Our risk management systems, analytics, and AI platform serve hundreds of billions in value for leading protocols and exchanges, including Kraken, Aave, Ethena, and Pendle. Since our founding in 2021, we've set the industry standard for on-chain risk management.
Responsibilities
- Design and build single and multi-agent systems with planning, memory, and tool use
- Build and operate MCP servers with secure schemas and permissions
- Develop agentic workflows using LangGraph or equivalent frameworks
- Integrate LLMs via SDKs; manage prompts, structured outputs, and tool calling
- Define and run LLM evaluations for quality, correctness, latency, cost, and regressions
- Build observability and reliability infrastructure: logging, tracing, retries, state management
- Optimize performance and cost from prototype to production
- Mentor engineers and establish agentic best practices
Requirements
- 5+ years software engineering; 2+ years building production AI/ML systems
- Hands-on experience with agentic architectures and tool calling
- Practical experience with MCP servers
- Experience with LangGraph or equivalent agentic frameworks
- Experience designing and operating LLM evaluation pipelines
- Strong Python and API design skills
- Familiarity with RAG pipelines, vector databases, and embedding-based retrieval
Conditions
- Competitive compensation & equity package
- Career growth opportunities in a rapidly expanding, global technology company
- 21 vacation days + 7 sick days + 8 observed U.S. company holidays
- 100% employer-paid health coverage options for you and your dependents
- FSA / HSA options depending on selected health insurance plan
- Parental leave policy
- Wellness programs including OneMedical, Teladoc, Talkspace, and EAP
- Pre-tax commuter benefits
About Chaos Labs
Chaos Labs provides institutional-grade risk management, analytics, AI tooling, oracles, and simulations for DeFi protocols and crypto institutions, turning complex onchain and offchain data into actionable intelligence. It serves leading protocols like Aave, Jupiter, GMX, and dYdX to optimize risk, enhance capital efficiency, and secure billions in assets.