- Putrajaya Federal Territory Malaysia

Working Location
Job Description
Responsibilities
What you'll be doing
As part of IT Innovation projects, you will join the AI Factory/Innovation team as an AI DevOps Engineer. You will be responsible for designing, implementing and maintaining the infrastructures, CI/CD pipelines and environments required for the deployment and operation of AI solutions in production.
You will work in an agile mode, closely collaborating with AI developers, Data Scientists, architects and CACEIS infrastructure teams. You will play a key role in the industrialisation of AI solutions and the automation of deployment processes.
As a vibe coding expert, you use generative AI tools (GitHub Copilot, Cursor, Claude, ChatGPT) to speed up the creation of scripts, IaC configurations, CI/CD pipelines and to quickly resolve production incidents.
Targeted Profile : DevOps Engineer with expertise in MLOps and strong command of vibe coding, able to build and maintain robust AI infrastructures while automating deployments and ensuring the performance of production solutions.
· Design and implementation of Cloud architectures for AI solutions (scalable, secure, optimised)
· Deployment and management of infrastructures as Code (Terraform, CloudFormation)
· Implementation of CI/CD pipelines for applications and AI models
· Expert use of vibe coding to generate scripts, configurations and automations
· Automation of deployments and rollbacks (blue/green, canary)
· Configuration of monitoring, alerting and observability for AI models in production
· Management of environments (dev, staging, production) and access
· Optimisation of Cloud costs and performance (GPU, compute, storage)
· Support to developers on tooling and DevOps best practices
· Technical documentation of infrastructures and procedures
· Implementation of DevSecOps practices and security compliance
· Sharing of MLOps and vibe coding best practices with the team
What we're looking for
· Proven experience in vibe coding with the use of AI tools to generate scripts, configurations and diagnose incidents
· Expertise in MLOps and deployment of AI models in production
· Proficiency with Cloud platforms (AWS, Azure, GCP) and Cloud AI services
· Expertise in Infrastructure as Code (Terraform, CloudFormation, ARM Templates)
· Strong command of containerisation and orchestration (Docker, Kubernetes, Helm)
· CI/CD expertise (GitLab CI, GitHub Actions, Jenkins, Azure DevOps)
· Knowledge of MLOps tools (MLflow, Kubeflow, Weights & Biases, SageMaker Pipelines)
· Proficiency in scripting (Bash, Python) and automation
· Experience in monitoring and observability (Prometheus, Grafana, ELK, DataDog)
· Knowledge of DevSecOps security practices
· Secrets and configuration management (Vault, AWS Secrets Manager)
· Nice to have: Experience with GPUs and optimisation of AI resources
Soft Skills:
· Ability to use AI to quickly diagnose and resolve incidents
· Pragmatism: balance between automation and delivery timelines
· Rigour in the design and security of infrastructures
· Innovative mindset and continuous technology watch
· Autonomy and proactivity in identifying issues
· Strong service orientation and support to development teams
· Collaborative and pedagogical mindset
· Responsiveness when dealing with production incidents
Pay: RM10,000.00 - RM15,000.00 per month
Work Location: In person
Important Information
Never provide your bank or credit card details when applying for jobs. Do not transfer any money or complete unrelated online surveys. If you see something suspicious, Report this Job ad.