CareerCross uses cookies to enhance your experience on our websites. If you continue to view our sites without changing your browser settings, then it is assumed that we have your consent to collect and utilise your cookies. If you do not want to give us your consent, then please change the cookie settings on your browser. Please refer to our privacy policy for more information.
CareerCross uses cookies to enhance your experience on our websites. If you continue to view our sites without changing your browser settings, then it is assumed that we have your consent to collect and utilise your cookies. If you do not want to give us your consent, then please change the cookie settings on your browser. Please refer to our privacy policy for more information.
| Hiring Company | Global Insurance Firm |
| Location | Tokyo - 23 Wards |
| Job Type | Permanent Full-time |
| Salary | 5 million yen ~ 11 million yen |
We're looking for a DevOps / SRE to own reliability across our critical services — defining what "good" looks like, building the systems to get there, and partnering closely with product teams to make reliability a first-class concern from day one.
What you'll do
Define and implement SLOs and SLIs for critical services; own reliability design end-to-end, from architecture review to production rollout
Drive automation of toil and operational work; continuously improve incident response, on-call practices, and post-mortem processes
Strengthen system resilience through capacity planning, chaos engineering, and fault injection testing
Build and maintain full observability coverage — metrics, logs, and traces — alongside actionable alerting and well-maintained runbooks
Lead performance analysis, load testing, and capacity management to proactively address bottlenecks before they become incidents
Implement and operate reliability design patterns such as circuit breakers, exponential backoff, bulkheads, and retry strategies
Operate and scale infrastructure, including Kubernetes and service meshes; write and maintain automation scripts in Python and Bash
Partner with product engineering teams to balance reliability targets with development velocity, embedding SRE practices early in the delivery cycle
Collaborate across teams to raise the overall engineering quality bar — through knowledge sharing, documentation, code review, and mentorship
| Minimum Experience Level | Over 3 years |
| Career Level | Mid Career |
| Minimum English Level | Business Level |
| Minimum Japanese Level | Business Level |
| Minimum Education Level | Bachelor's Degree |
| Visa Status | Permission to work in Japan required |
| Job Type | Permanent Full-time |
| Salary | 5 million yen ~ 11 million yen |
| Industry | Insurance |
| Company Type | Large Company (more than 300 employees) - International Company |