Please beware of recruitment scams that are currently targeting jobseekers. Click here for further advice. 
            
            
            
        
					
						Back to jobs
					
					
					
					  
			             
					
						
						
							
						
					 
				
				
					
				
			(Senior) Data Engineer - (Customer Data Platform)
Job description
Key Responsibilities:
- Build and maintain scalable data pipelines using AWS Glue, EMR, Airflow
 - Manage data lake architecture, ensuring secure and compliant data lifecycle practices
 - Refactor and optimize PySpark jobs for performance and cost efficiency
 - Automate pipeline deployment via GitLab CI/CD
 - Collaborate with cross-functional teams to define and implement data integration strategies
 - Build internal tools including LLM-powered chatbot
 
Requirements:
- Bachelor's degree in Computer Science, Data Engineering, or related field
 - 2+ years of experience in data engineering, preferably in cloud environments
 - Strong proficiency in AWS services, PySpark, SQL, and Python
 - Experience with OpenMetadata, Airflow, GitLab CI/CD, and web scraping
 - Familiarity with LLM models, RAG architecture, and chatbot development is a plus
 - Excellent communication and collaboration skills
 
If you are interested in this opportunity, please send your CV to jennie.jiang@ambtech.com