Company Overview:
Our client ise a global fintech company that provides technology and services to financial institutions and corporations worldwide. Their mission is to drive innovation and improve the efficiency of the financial industry.
Responsibilities:
- Communicate with internal and external stakeholders for technical and operational tasks
- Design, build, and maintain highly scalable and available systems
- Develop and implement monitoring, alerting, and automation tools to ensure system reliability and availability
- Work with development teams to identify and resolve issues in production systems
- Troubleshoot and resolve incidents and problems in a timely and effective manner
- Participate in the on-call rotation to provide 24/7 support for production systems
- Continuously improve system performance, reliability, and security
- Document system configurations, procedures, and best practices
Qualifications:
- Currently residing in Japan
- Fluency in Japanese (N1 minimum) and fluency in English
- Bachelor's or Master's degree in Computer Science or related field
- At least 3 years of experience in Site Reliability Engineering or related field
- 10+ years management experience supporting APAC clients, emphasis on Japan based asset servicing and TA clients
- Proficiency in at least one programming language (e.g. Java, Python, Go)
- Experience with automation and configuration management tools (e.g. Ansible, Chef, Puppet)
- Knowledge of Linux operating systems and shell scripting
- Familiarity with cloud computing platforms (e.g. AWS, GCP, Azure)
- Excellent problem-solving and troubleshooting skills
- Strong communication and collaboration skills
- Experience with containerization technologies (e.g. Docker, Kubernetes) is a plus
- Experience with log management and analysis tools (e.g. ELK, Splunk) is a plus
- Experience with database administration (e.g. MySQL, PostgreSQL) is a plus.
If this position is not ideal for you, but you are looking for a new opportunity,please contact us to discuss your options.