Back to jobs

(Senior) Data Engineer - (Customer Data Platform)

Job description

Key Responsibilities:

  • Build and maintain scalable data pipelines using AWS Glue, EMR, Airflow
  • Manage data lake architecture, ensuring secure and compliant data lifecycle practices
  • Refactor and optimize PySpark jobs for performance and cost efficiency
  • Automate pipeline deployment via GitLab CI/CD
  • Collaborate with cross-functional teams to define and implement data integration strategies
  • Build internal tools including LLM-powered chatbot

Requirements:

  • Bachelor's degree in Computer Science, Data Engineering, or related field
  • 2+ years of experience in data engineering, preferably in cloud environments
  • Strong proficiency in AWS services, PySpark, SQL, and Python
  • Experience with OpenMetadata, Airflow, GitLab CI/CD, and web scraping
  • Familiarity with LLM models, RAG architecture, and chatbot development is a plus
  • Excellent communication and collaboration skills

If you are interested in this opportunity, please send your CV to jennie.jiang@ambtech.com