Job Summary
We are seeking a Lead / Senior Data Engineer to design, build, and optimize our core data infrastructure. In this role, you will bridge the gap between raw data and production-ready environments, ensuring our analytics systems are scalable, reliable, and high-performing. You will collaborate closely with Data Scientists to support advanced modeling and lead data engineering initiatives.
Key Responsibilities
Data Pipelines: Build and orchestrate automated ETL/ELT pipelines for batch and real-time streaming data.
Architecture: Design and maintain scalable data lake and data warehouse structures.
Data Modeling Support: Partner with Data Scientists to optimize data pipelines, feature stores, and datasets required for statistical modeling.
Performance: Tune and optimize complex SQL queries and data workflows for large-scale datasets.
Leadership: Establish technical best practices, ensure data quality, and mentor junior engineers.
Requirements
Experience: Minimum 4+ years in Data Engineering, Big Data, or Data Architecture.
Education: Degree in Computer Science, Software Engineering, or a related quantitative field.
Technical Proficiency:
Languages & Core Libraries: Mastery of Python (including advanced Pandas and NumPy for data manipulation) and expert-level SQL.
Data Modeling & ML Support: Strong familiarity with preparing data for statistical modeling and predictive frameworks (e.g., XGBoost, Regression, Decision Trees).
Infrastructure & Orchestration: Experience with modern data stack tools (e.g., Spark, Airflow, dbt) and cloud data platforms (AWS, GCP, or Azure).
Benefits & Work Environment
Flexible Hybrid Arrangement: Balance of remote work autonomy and in-office collaboration.
Professional Growth: Lead high-impact architectural decisions alongside a talented AI and analytics team.
Competitive Compensation: Monthly salary up to RM 15,000, commensurate with experience and technical assessment.