Please beware of recruitment scams that are currently targeting jobseekers. Click here for further advice.
Back to jobs
(Senior) Data Engineer - (Customer Data Platform)
Job description
Key Responsibilities:
- Build and maintain scalable data pipelines using AWS Glue, EMR, Airflow
- Manage data lake architecture, ensuring secure and compliant data lifecycle practices
- Refactor and optimize PySpark jobs for performance and cost efficiency
- Automate pipeline deployment via GitLab CI/CD
- Collaborate with cross-functional teams to define and implement data integration strategies
- Build internal tools including LLM-powered chatbot
Requirements:
- Bachelor's degree in Computer Science, Data Engineering, or related field
- 2+ years of experience in data engineering, preferably in cloud environments
- Strong proficiency in AWS services, PySpark, SQL, and Python
- Experience with OpenMetadata, Airflow, GitLab CI/CD, and web scraping
- Familiarity with LLM models, RAG architecture, and chatbot development is a plus
- Excellent communication and collaboration skills
If you are interested in this opportunity, please send your CV to jennie.jiang@ambtech.com