- Singapore
Working Location
Job Description
Responsibilities
We're Hiring: Machine Learning Engineer!
We are looking for a skilled and innovative Machine Learning Engineer to join our dynamic team in Singapore. The ideal candidate will have extensive experience in developing machine learning models, data analysis, and algorithm optimization to drive impactful solutions.
Location: Singapore, Singapore
Work Mode: Work from Office
Role: Machine Learning Engineer
We are seeking an experiencedMachine Learning Engineerwith 10+ years of overall experience and at least 6 years of relevant experience in Machine Learning Engineering, Data Engineering, and Big Data platforms. The ideal candidate willbe responsible fordesigning, developing, andoptimizinglarge-scale data and machine learning systems that support real-time and batch processing workloads across enterprise environments.
This role requires strongexpertisein theHadoop ecosystem, distributed data processing, machine learning operationalization, and scalable data platform development. The successful candidate will work closely with Data Scientists, Data Engineers, and Platform Teams to build robust data ingestion frameworks, operationalize machine learning models, and deliver high-performance AI-driven solutions capable of processing structured, unstructured, and multi-modal data.
Key Responsibilities
Design, develop, andmaintainhighly scalable real-time and batch processing systems using Hadoop ecosystem technologies including Spark, Kafka, Flink, Hive, Iceberg, Trino,NiFi, Ranger, and Ozone.
Build and enhance robust data ingestion, transformation, and processing frameworks capable of handling structured, semi-structured, and multi-modal data sources including image, audio, video, and unstructured documents.
Develop scalable data pipelines using Java, Python, Spark, and shell scripting to support enterprise data and AI workloads.
Collaborate closely with Data Scientists to deploy, operationalize,monitor, andmaintainmachine learning models in production environments.
Utilize Cloudera Machine Learning (CML) and related ML platforms to support model lifecycle management and deployment.
Design and implement data engineering solutions that support both real-time streaming and large-scale batch processing requirements.
Develop internal engineering tools, automation solutions, and full-stack applications using Python and modern web frameworks such as Flask and React.
Perform performance tuning, optimization, and troubleshooting of Hadoop-based applications and distributed data processing systems.
Monitor system performance, resourceutilization, and platform stability while implementing improvements to maximize efficiency and scalability.
Ensure adherence to data governance, security, and access control standards using tools such as Apache Ranger.
Participate in architecture reviews, technical design discussions, and platform modernization initiatives.
Support CI/CD implementation, deployment automation, and operational excellence practices for machine learning and data engineering workloads.
Stay current with emerging technologies, machine learning frameworks, and big data platform innovations.
Job Qualifications and Requirements
Bachelor's degree in Computer Science, Information Technology, Data Science, Engineering, ora relatedfield.
Minimum 10 years of overall software, data engineering, or platform engineering experience.
At least 6 years of hands-on experience in Machine Learning Engineering, Big Data Engineering, or Data Platform Development.
StrongexpertiseinHadoop ecosystem technologies(Apache Spark,Apache Hive,Apache Kafka,Apache Flink,ApacheNiFi,Apache Iceberg, etc.)
Strong programming experience inJava and/orPython.
Extensive experience developing data ingestion, transformation, and processing frameworks.
Hands-on experience with batch and real-time data processing architectures.
Strong understanding of distributed computing, scalable data platforms, and high-volume data processing systems.
Experience operationalizing machine learning models using platforms such as Cloudera Machine Learning (CML), SparkMLlib, or equivalent ML platforms.
Experience with machine learning libraries such as scikit-learn,XGBoost, and related Python-based ML frameworks.
Strongproficiencyin shell scripting and automation.
Experience performing performance tuning and optimization of Hadoop-based applications and clusters.
Strong problem-solving, analytical, and troubleshooting skills.
Excellent communication and stakeholder collaboration abilities.
Experience working within Agile development environments.
Ready to take your career to the next level? Apply now and be a part of our innovative journey!
Important Information
Never provide your bank or credit card details when applying for jobs. Do not transfer any money or complete unrelated online surveys. If you see something suspicious, Report this Job ad.