Senior Site Reliability Engineer - Shah Alam, Malaysia - Ideagen PLC

    Ideagen PLC
    Ideagen PLC Shah Alam, Malaysia

    2 weeks ago

    Default job background
    Permanent
    Description

    Role Purpose

    Due to our exponential growth plans the technology team have created this exciting and challenging opportunity.

    With a passion for infrastructure engineering and the ability to solve automation and reliability challenges, you will have experience working as a Site Reliability/CloudOps Engineer. Located within the globally distributed Cloud Operations team you will be working on our infrastructure platforms using cutting-edge technologies such as Kubernetes, containers, and automation across AWS and Azure.

    Responsibilities


    • Manage, monitor, and maintain our infrastructure platforms across a multi-cloud environment.

    • Manage design, development, and operational changes to cloud based infrastructure services.

    • Provide operational support and be able to co-ordinate with other teams during incidents that may impact service.

    • Work to improve the reliability, quality, performance, and scalability of our infrastructure.

    • Continually measure and optimise system performance.

    • Enable the engineering organization to innovate and deliver with greater speed and safety

    Skills and Experience

    We don't expect you to be an expert in everything but with our technology stack experience of some of the following is essential:

  • Experience in production 24/7 high-availability SaaS environments based on AWS.
  • Experience of working with orchestration and containerisation e.g. Docker, Kubernetes, EKS/AKS etc
  • Deep knowledge of AWS tools and products, that follow the AWS Well-Architected Framework.
  • Experience of working alongside development functions delivering software within an agile development environment.
  • Strong scripting skills in various languages such as Python, BASH, and/or PowerShell.
  • Working alongside development functions delivering software within an agile development environment.
  • Must be a team player, with exceptional communication skills, working well with others in the group and the rest of the engineering organization.
  • Familiarity with Cloud security and governance models.
  • Proven ability to grasp new technical concepts quickly.
  • Desirable:

  • Strong understanding of Software Development Lifecycles
  • Experience with designing & architecting distributed systems on AWS.
  • Experience of CI/CD such as Jenkins, GitLab, Azure DevOps etc.
  • Experience with Infrastructure as Code (IaC) tools such as Terraform, CloudFormation
  • Knowledge on Automation tools.
  • Good to have knowledge on Grafana, Prometheus.
  • knowledge of Linux troubleshooting, including networking, file systems, security, and the kernel.
  • Experience with compliance standards based infrastructure such as ISO27001, Cyber Essentials & FedRAMP, and general regulatory compliance management.
  • Exposure to ITIL concepts and adoption.
  • Behavioral

  • Ambitious - Drive, Planning & Execution
  • Adventurous - Flexibility & Resilience and Savvy Thinking
  • Community - Collaboration & Communication
  • Options

    Sorry