Architecting high performance, highly-available (HA) and scalable technical solutions on Amazon Web Services.
Provide an efficient way to provision and modifying existing infrastructure using code.
Allow developers to create and deploy applications faster on various environments.
Provide an automated way to build, test & deploy apps to the cloud.
Provide least privilege access and shut down unnecessary services to reduce surface area for attack.
Choosing the right system to manage the data and provide optimum performance using various storage technology.
Create dashboards to understand current systems health and performance.
* Lead a team of DevOps engineers to build and maintain cloud infrastructure according to best
practices.
* Guide DevOps engineers in building tools to automate operations, enhance productivity, and
maintain and improve CI/CD
pipelines.
* Collaborate with software development and IT operations teams to ensure smooth deployment to the
production environment.
* Continuously review existing systems, gather feedback from users, and develop new plans for
improvement.
* Recruit and train DevOps engineers to ensure they are equipped with the right tools and knowledge.
* Update and maintain Jenkins pipeline. Troubleshoot the pipeline whenever there is issue.
* Update and maintain Terraform code to ensure AWS resources are deployed automatically via
infra-as-code.
* Update and maintain Ansible code to ensure application deployment on EC2 instances can be done
automatically.
* Monitor systems performance and issues via Instana. Solve any issues detected during the
monitoring process.
* Participate in root cause analysis whenever there is downtime on the systems.
* Performs senior-level responsibilities for the overall system architecture, design, installation,
configuration, technical support, and maintenance of system mainly hosted in Major Cloud Providers.
* Works with limited supervision to establish, monitor and maintain cloud environment, systems
hardware, operating systems, and related network and security infrastructure to ensure reliable
operations.
* Monitors cloud systems for optimal performance and establishes and monitors best practices,
policies, and procedures.
* Optimize network infrastructure for IaaS, SaaS, PaaS and other cloud applications.
* Recommend and plan for future growth of systems taking into consideration capacity planning,
monitoring, Disaster Recovery, and Business Continuity for operating infrastructure.
* Research connectivity, performance and related security issues to determine root cause and
implement a plan of action to resolve these issues.
* Lead and design technical infrastructure & cloud processes, integrate solutions into existing
infrastructure, consult on development projects, help deploy solutions that meet business and
technical requirements.
* Provide technical leadership to teammates through coaching and mentorship.
* Maintain the stability and performance of the cloud platform. Additionally, provide operational
support on all cloud
solutions.
* Design, develop and maintain Infrastructure-as-Code (IAC) that automates and orchestrates
Continuous Integration and Continuous Development (CI/CD) which enable agile team to deliver quality
application.
* Design, develop and implement monitoring system by using tools such as Cloudwatch, NewRelic,
Wormly and PagerDuty alerts to ensure high availability of the system. Troubleshoot incidents and
provide Post Incident Report for future system improvement.
* Drive projects assigned by management to improve cost efficiency, security and implement industry
best practices.
* Collaborate with agile team to improve engineering tools, systems and procedures.
* Mentor other engineers about DevOps/SRE practices and help build a fast growing team.
Kevin See — [email protected] — (+60) 16 225 1805