The vacancy is well-defined in tasks and requirements but lacks compensation details and comprehensive company information.
no salary info
Job description
Join Wildberries as a Data Engineer to develop solutions for data collection, validation, and preprocessing for ML models.
Responsibilities
### Responsibilities
- Develop solutions for data collection, validation, and preprocessing for model training;
- Design dataset structures for detection, segmentation, and 3D reconstruction tasks;
- Scale our solutions for large volumes of data;
- Prepare data, including preprocessing and post-processing (images, trajectories, point clouds, 3D models);
- Develop tools for automatic and semi-automatic labeling;
- Create and version datasets and develop instructions for labeling teams.
Requirements
### Requirements
- Proficient knowledge of Python, pandas, numpy;
- Understanding of the principles of building quality datasets;
- Knowledge of data augmentation methods;
- Experience working with large volumes of images and point clouds;
- Basic understanding of ML model quality metrics to assess data impact.
About Wildberries
Wildberries is Russia's largest online marketplace and e-commerce platform, offering a wide range of products through its website and app, processing over 15 million orders daily across more than 100 warehouses and sorting centers in seven countries. It provides various career opportunities including tech roles like developers and product managers, as well as logistics and service positions.