The vacancy is well-structured with clear responsibilities and requirements, but could improve on compensation details and company information.
Job description
At Preply, the Data ingestion and enrichment team provides a single, trusted, and scalable data foundation. The team ensures that all analytics, machine learning, and product features are built on unified, governed, and production-grade data assets in Preply’s Lake House.
Responsibilities
- Design, build, and own Preply’s data lake.
- Develop and operate scalable, reliable batch and streaming ingestion pipelines.
- Define and implement data contracts between producers and consumers.
- Build enrichment logic that joins, standardizes, and contextualizes data.
- Instrument ingestion pipelines with strong observability.
- Apply consistent access control, classification, and privacy protections.
Requirements
- Experience building architectural patterns of a large, high-scale application.
- Solid experience working in platform or data engineering teams.
- Familiarity with cloud platforms (AWS/GCP or equivalent).
- Hands-on experience with real-time and batch data processing infrastructures.
- Expertise with orchestration tools.
About Preply
Preply is an online language learning platform that connects learners with expert tutors for personalized, one-on-one lessons in 90+ languages, supported by AI tools and a proprietary curriculum. It serves individual learners across 180+ countries and offers Preply Business for corporate language training. The platform features over 100,000 tutors and operates as a leading online tutoring marketplace.
EdTech· 1000+· Brookline, Massachusetts, United States· Founded 2012· https://preply.com