
Data Engineer (Apache Spark – Python/Scala)
- Luxemburg
- Unbefristet
- Vollzeit
- Build and optimize data pipelines with Apache Spark (Python and/or Scala)
- Process large-scale batch and streaming datasets
- Work with REST APIs to retrieve and integrate external data
- Collaborate with data scientists and engineers in Agile teams
- Ensure data quality, testing, and monitoring
- Contribute to CI/CD and automation best practices
- Organize and manage data in on-prem object storage
- Promote data governance awareness: data lineage, metadata, PII, data contracts…
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field
- 2 to 5 years of experience as a Data Engineer in Big Data environments
- Strong skills in Apache Spark (Python and/or Scala), SQL, and data integration
- Comfortable with Git, Airflow, and CI/CD pipelines
- Experience with REST APIs and object storage (S3/MinIO)
- Previous work in on-premises environments (not cloud-based) is appreciated
- Awareness of data governance topics: data lineage, metadata, PII, data contracts…
- Fluent in French and English (minimum B2 level)
- Proactive, detail-oriented, and a strong communicator