All vacancies

Senior Data Engineer

· senior
datadev GreenplumTrinoApache AirflowClickHousePythonSparkSQLS3HadoopJAVAGitDBT
4.5
AI Score
The vacancy has a solid tech stack but lacks compensation details and company information, affecting overall quality.
Job description
Looking for a Senior Data Engineer with extensive experience in data technologies and systems. Must have strong skills in SQL, Python, and various data processing tools.
Requirements
### Requirements - Stack: Greenplum, Trino, Apache Airflow, ClickHouse, Python, Spark, SQL (dbt), S3, Hadoop. - General understanding of Lakehouse technology stack. - Understanding the differences between BigData/Lakehouse and regular-sized data. - Knowledge of SQL (indexes, functions, optimization, performance profiling). - Knowledge of programming languages (JAVA, Python). - Experience with relational databases (Oracle, Postgres, MySQL, MsSQL, etc.). - Ability to work with Git (knowledge of git pull/commit/push commands). - Experience with DBT, Cosmos, Ni-Fi. - Experience in Spark development. - Skills in using Hadoop ecosystem components: Yarn, Ranger, Zookeeper, Hive metastore. - Understanding of Trino features. - Understanding of data formats Iceberg, Parquet, Avro. - Understanding of working with minio or any other S3-based storage. - Experience using project management and documentation systems. - Experience in developing atypical integrations (including SAP systems). - Experience in developing near-realtime streams (Flink, Debezium). - Experience optimizing high-load streams (billions of incremental records) using Observability tools (Grafana, Victoria Metrics, Zabbix).
Apply to this role