Report this job

Apply Now

Job Overview

  • Date Posted
    April 24, 2024
  • Location
  • Expiration date
  • Industry
    Software Development
  • Qualification
    Professional Certificate, Bachelor Degree (B.Sc.)
  • Career Level
    Mid, Senior

Job Description

Responsible for ensuring the Production service is prioritized, with all service incidents, problems and requests for cloud hosted services responded to and actioned.
Responsible for maintaining the reliability and security of the Cloud Hosted environments.
Improve Observability and Telemetry in the Cloud Hosted environments utilizing SRE methodology to give SLA, SLO and SLIs.
Ensure risks within the Cloud hosted environment are documented and regularly reviewed. Identified operational risk issues are captured with appropriate actions tracked to agreed timelines. Define and implement standards and procedures to adhere to current best practice and drive continual service improvement.
Responsible for ensuring Security standards are implemented and maintained in the Cloud hosted environment. Including delivery of upgrades and security updates to minimise risk and ensure stability for all cloud hosted services.
Responsible for maintaining service resilience for all cloud hosted services, including backup and disaster recovery processes. Where necessary plan and conduct quarterly DR tests for all cloud hosted services ensuring any findings are captured and addressed promptly.

Able to understand and use AWS including an understanding of AWS services, security and networking.
Knowledge of at least 1 programming language, preferably Python.
Knowledge of CI/CD specifically relating to Cloud Hosted environments. Including an understanding of some of the Infrastructure as Code tools GIT, Terraform, Ansible, Jenkins.
Possesses a strong service-orientated mindset, can consistently deliver a high level of service to the business.
Able to communicate effectively with both business and technical staff at all levels. This includes communicating complex technical issues to different levels of management.
Able to work proactively, own complex deliveries and provide regular updates to management and stakeholders.
Minimum job-related Experience Required:

Must have strong technical operational skills in supporting AWS Cloud Hosted environments and at least 3 years in an Infrastructure support role.
Strong understanding of Infrastructure as Code technologies, ideally including Terraform and Ansible.
Preferably at least one year’s experience using SRE methodologies within a support team and an understanding of Service Level metrics associated with this.
Operational risk and control management processes, including an understanding of Security best practice and how to apply this safely within a Production environment.
Asset management and lifecycle (EOS/EOL) process management.
Planning and leading disaster recovery fail-overs of IT systems and services.
Preferably experience of working in a regulated financial services / banking organization.
Minimum Education Required:

Bachelor’s degree educated or equivalent
AWS certified to at least Associate level
Industry standard IT certification desired e.g. Microsoft / Terraform