The vacancy is well-structured and informative, making it appealing to potential applicants.
Job description
Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24/5 trading, and more. Our recent Series D funding round brought our total investment to over $320 million, fueling our ambitious vision.
Amongst our subsidiaries, Alpaca is a licensed financial services company, serving hundreds of financial institutions across 40 countries with our institutional-grade APIs. This includes broker-dealers, investment advisors, wealth managers, hedge funds, and crypto exchanges, totaling over 9 million brokerage accounts.
Our global team is a diverse group of experienced engineers, traders, and brokerage professionals who are working to achieve our mission of opening financial services to everyone on the planet. We're deeply committed to open-source contributions and fostering a vibrant community, continuously enhancing our award-winning, developer-friendly API and the robust infrastructure behind it.
Responsibilities
### Your Role:
As the DevOps Team Lead for our Core Foundation pod, you will lead the "engine room" of Alpaca’s most critical infrastructure initiatives. You will manage a talented, globally distributed team of engineers responsible for Heavy Compute, Core Networking, Stateful Data, Observability, and Cloud/Physical Infrastructure.
### Things You Get To Do:
- **People & Tech Leadership:** Lead, mentor, and foster a healthy, high-performing globally distributed engineering team.
- **Prioritization & Planning:** Own the execution and delivery of highly critical, complex yearly roadmap items centered around large-scale foundational infrastructure upgrades, high availability, and platform resilience.
- **Change Management Ownership:** Own and drive the change management processes across engineering and product domains.
- **Support Frameworks & Methodologies:** Design, implement, and refine robust support workflows, agile planning methodologies, and deployment/rollout strategies to ensure operational excellence.
- **On-Call & Incident Management:** Manage and optimize the global on-call rotation to ensure team well-being while maintaining high availability.
Requirements
### Who you are (must-haves):
- Proven experience as an Engineering Manager, DevOps Lead, or Site Reliability Engineering Lead, with a track record of successfully managing globally distributed teams.
- Exceptional people management skills, with a deep focus on coaching, mentoring, and fostering team culture across multiple time zones.
- Deep expertise in engineering support frameworks, roadmap planning, and team prioritization methodologies.
- Proven experience owning Change Management lifecycles.
- Extensive experience managing Incident Management lifecycles and running sustainable, global on-call rotations.
- Incredibly strong communication and organizational skills.
- A solid technical background in modern DevOps/SRE ecosystems, including Kubernetes (GKE), Infrastructure as Code (Terraform), Relational Databases (PostgreSQL), and Observability stacks (Prometheus, Grafana, Thanos).
- A strategic mindset capable of navigating shifting priorities.
Conditions
### How We Take Care of You:
- Competitive Salary & Stock Options
- Health Benefits
- New Hire Home-Office Setup: One-time USD $500
- Monthly Stipend: USD $150 per month via a Brex Card.
About Alpaca
Alpaca is a global leader in brokerage infrastructure APIs, providing access to stocks, ETFs, options, fixed income, and crypto, along with embeddable finance solutions like tokenization and securities lending. It serves retail traders, institutional investors, app developers, and fintech companies worldwide through its API-first platform. The company, originally started in 2015 as a database and machine learning firm, is headquartered in San Mateo, California.