
Property Finder
Junior Data Engineer
- Permanent
- Dubai, United Arab Emirates
- Experience 5 - 10 yrs
Job expiry date: 20/03/2026
Job overview
Date posted
03/02/2026
Location
Dubai, United Arab Emirates
Salary
Undisclosed
Compensation
Salary only
Experience
5 - 10 yrs
Seniority
Experienced
Qualification
Bachelors degree
Expiration date
20/03/2026
Job description
The Junior Data Engineer role at Property Finder is focused on building reliable, scalable data pipelines that power analytics, AI/ML, and emerging Generative AI use cases across the organization. The role contributes to the core data ecosystem by developing and maintaining batch and streaming pipelines with a strong emphasis on reliability, performance, and cost efficiency. The position involves developing SQL, Python, and Spark/PySpark transformations, supporting analytics, reporting, machine learning workloads, and integrations with internal and external systems. The role also supports GenAI initiatives through building data flows for embedding generation, vector pipelines, and data preparation for LLM training and inference, while collaborating closely with Data Science, Analytics, Product, and Engineering teams. The Junior Data Engineer upholds data quality, testing, documentation, and governance standards, supports deployments and operational stability, and contributes to scalable data platform components that enhance data availability and performance for intelligent, data-driven products within Property Finderās real estate technology ecosystem in the MENA region.
Required skills
Key responsibilities
- Build and maintain batch and streaming data pipelines with strong focus on reliability, performance, and efficient cost usage
- Develop SQL, Python, and Spark/PySpark transformations to support analytics, reporting, and machine learning workloads
- Contribute to data model design and ensure datasets meet high standards of data quality, structure, and governance
- Support integrations with internal and external systems to ensure accuracy and resilience of data flows
- Build and maintain data flows supporting Generative AI workloads such as embedding generation, vector pipelines, and LLM training and inference datasets
- Collaborate with ML and GenAI teams to enable high-quality training, inference, enrichment workflows, and retrieval pipelines
- Work with Data Science, Analytics, Product, and Engineering teams to translate data requirements into reliable data solutions
- Participate in design reviews, uphold data quality, testing, and documentation standards, and support deployments and troubleshooting of owned pipelines
- Demonstrate ownership of data platform components and contribute to team learning through code reviews, documentation, and pairing
Experience & skills
- Demonstrate 5+ years of experience working as a Data Engineer
- Apply strong SQL and Python skills with a solid understanding of Spark and PySpark
- Show experience building and maintaining production-grade data pipelines
- Demonstrate practical experience with cloud-based data warehouses and data lake architectures
- Apply experience with AWS data services including Glue, Athena, Kinesis, Lambda, and S3
- Demonstrate familiarity with orchestration tools such as Dagster, Airflow, or Step Functions
- Apply solid understanding of data modeling principles and data quality best practices
- Show experience with CI/CD pipelines or automation for data workflows
- Demonstrate exposure to or willingness to learn Generative AI workflows including embeddings, vector stores, enrichment pipelines, and retrieval workflows