The vacancy is well-defined with clear responsibilities and salary, but lacks detailed company information and work processes.
Job description
Hibachi is hiring a remote Production Engineer responsible for alert response, incident investigation, and improving observability tooling.
Responsibilities
- Respond to production alerts, triage, investigate, and provide context.
- Trace issues across logs, databases, and codebase to identify root causes.
- Write incident summaries and runbooks.
- Improve and maintain dashboards.
- Build tooling and automation.
- Identify and close visibility gaps.
Requirements
- Experience with live production systems.
- Ability to read and trace codebases.
- Comfort writing database queries.
- Familiarity with monitoring tools like Datadog, Grafana, CloudWatch.
- Clear written communication.
- Comfort with on-call responsibilities.