The role of a Data Engineer is to design, build, and maintain robust data pipelines and systems that support our analytics, reporting, and machine learning workflows.
Responsibilities and Duties
Design, develop, and maintain scalable data pipelines and ETL processes across multiple data sources
Build and optimize data warehouses and data lakes to support business intelligence and advanced analytics
Develop and maintain data architecture documentation, data dictionaries, and metadata repositories
Implement data quality and governance frameworks, including monitoring, validation, and lineage tracking
Ensure data security, compliance, and privacy standards are adhered to
Ensure data integrity, accuracy, and consistency across systems and reports
Bachelor’s degree in Computer science, Information Systems, Engineering, Statistics, Actuarial Science, Mathematics or a related field
At least 2 years of experience in a data related role
Proficiency in SQL and experience with relational databases (e.g., SQL Server, PostgreSQL, MySQL)
Experience in healthcare is advantageous
Experience with version control systems (e.g., Git), CI/CD pipelines, and containerization (Docker/Kubernetes) is advantageous
Strong programming skills in Python
Experience in working with visualization tools like Powerbi, Tableau
Familiarity with using Apache Spark, Apache Airflow
Experience in applying open-source tools on-prem is mandatory
Experience with cloud data platforms such as Azure, Databricks, AWS, or Google Cloud is advantageous