All vacancies
deeplay

Data Engineer

deeplay · remote · middle · full-time
devtechai JavaPostgreSQLClickHousegRPCPythonApache AirflowDockerAnsibleTerraform
6.0
AI Score
The vacancy is well-defined in terms of tasks and requirements but lacks compensation details and company links.
no salary info
Job description
We are a company that values equal communication, supports each other, and encourages feedback and initiative. We strive for technological leadership in the industry.
Responsibilities
• Support and develop cross-platform desktop software (Java 17+, Swing) • Maintain monolithic software, including client and server parts (using gRPC) • Organize interaction with databases (PostgreSQL, ClickHouse) via JDBC • Implement multithreading mechanisms using ExecutorService, CompletableFuture • Cover code with tests (Mockito, JUnit) • Handle analytical queries and large data volumes (~60 TB)
Requirements
### Main Requirements • Deep knowledge of PostgreSQL system catalogs, statistics, indexes, partitioning, inheritance, foreign tables, EXPLAIN ANALYZE, cost model, pg_stat_statements, pg_stat_activity, pg_locks, index strategies, lock monitoring • SQL: complex queries with CTE, window functions, lateral joins, dynamic SQL, stored procedures, triggers, functions, query rewriting, understanding execution plans ### Additional Requirements • 3+ years of experience in a similar role • Knowledge of relational database theory and understanding of logical and physical database models • Experience optimizing complex queries • Experience troubleshooting high-load systems • Experience creating and maintaining ETL processes • Strong practical experience in Python (pandas, numpy) • Experience with Apache Airflow, creating DAGs independently • Ability to take tasks independently, clarify requirements as needed, and deliver results • Skill in finding optimal solutions: ability to work in uncertainty, formulate questions, and propose solutions ### Nice to Have • Bash scripting for automating routine operations • Linux command line (confident level), systemd, cron, bash scripting • Understanding of alerting strategies • Experience with ClickHouse • Basic understanding of Docker • Basic understanding of Ansible/Terraform for automation • Python/Pandas for complex analytics • Jupyter notebooks for exploratory analysis • pgBadger for detailed analysis • Database monitoring: analyzing connection pools (PgBouncer), performance troubleshooting, analyzing wait events
Conditions
• Flexible work format and location — choose any place on the map or one of our comfortable offices in Omsk, Novosibirsk, and St. Petersburg • Corporate culture: we communicate as equals, support each other, value feedback and initiative, and maintain a light-hearted atmosphere • Opportunity to influence processes: if you see potential for improvement, you can implement your ideas to enhance efficiency and product quality • Become part of a team that finds drive in its tasks and strives for technological leadership in the industry • Accessibility of company management and an open internal policy
About deeplay
Deeplay is a software development company specializing in AI-powered, geo-distributed, real-time applications, leveraging machine learning, Big Data, and high-performance computing to provide high-availability solutions for clients. They operate an international team of developers, designers, and solution architects.
SaaS · 101-200 · Saint Petersburg, Russian Federation · https://max.ru/devs_it]
Apply to this role