Educational requirements: High school
English requirements: Limited
Requirements for skilled employment experience for years: none
Required residence status: Temporary visa, Permanent resident, Citizen
Accept remote work: unacceptable
*** The Role *** You'll act as the fulcrum between Product Engineering, Quality Assurance, Operations, Configuration and DevSecOps, harmonising the efforts of both and helping to craft solutions to operational pain points.
You'll independently design and implement complex tasks spanning multiple components of the whole platform, not limited to infrastructure. You'll make sure they are scalable, resilient, highly available, secure, backed up, testable and tested, properly documented, monitored and alerted on.
*** Your tasks *** Contributing to a high quality service and delivery culture Caring for and nurturing your fellow Trade Legends but not afraid to be firm when required Ensuring that there is an effective process of continuous improvement Ongoing monitoring of the organisation's application and infrastructure architecture, checking for performance, stability and compliance Maintaining all pre-production and production environments Contributing to the DevOps strategy for the business Contributing to strategy and automated pipelines Contributing to environment architecture as a whole Owning and revamping CI/CD pipelines Making sure everything is meticulously documented Ensuring we stay true to our architectural principles with ongoing monitoring of the organisation's application and infrastructure architecture, checking for performance and compliance Routine maintenance and ad hoc investigation of environmental issues
*** Requirements *** ****** Must have ****** Excellent understanding and extensive experience with AWS, including services from the Compute, Containers, Database, Networking, Storage, Management & Governance and Security & Identity areas Proven commercial experience and in-depth knowledge of managing Kubernetes clusters at scale Implementing infrastructure and surrounding components in Terraform Monitoring of infrastructure and applications deployed on it Experience with tools like Datadog, AppDynamics, Splunk, Sumo Logic, New Relic, Prometheus, Grafana, Elastic Building pipelines in tools like Jenkins, CircleCI, Buildkite, GitHub Actions, Harness scripting in a language like Bash, Go, Python Writing technical documentation for coworkers and the wider team
****** Nice to have ****** Experience with Helm Experience with service meshes like Istio or linkerd Experience with GitOps tooling like Flux or Argo CD.Understanding of container runtimes, including security and performance aspects