Requirements
### Requirements
- Stack: Greenplum, Trino, Apache Airflow, ClickHouse, Python, Spark, SQL (dbt), S3, Hadoop.
- General understanding of Lakehouse technology stack.
- Understanding the differences between BigData/Lakehouse and regular-sized data.
- Knowledge of SQL (indexes, functions, optimization, performance profiling).
- Knowledge of programming languages (JAVA, Python).
- Experience with relational databases (Oracle, Postgres, MySQL, MsSQL, etc.).
- Ability to work with Git (knowledge of git pull/commit/push commands).
- Experience with DBT, Cosmos, Ni-Fi.
- Experience in Spark development.
- Skills in using Hadoop ecosystem components: Yarn, Ranger, Zookeeper, Hive metastore.
- Understanding of Trino features.
- Understanding of data formats Iceberg, Parquet, Avro.
- Understanding of working with minio or any other S3-based storage.
- Experience using project management and documentation systems.
- Experience in developing atypical integrations (including SAP systems).
- Experience in developing near-realtime streams (Flink, Debezium).
- Experience optimizing high-load streams (billions of incremental records) using Observability tools (Grafana, Victoria Metrics, Zabbix).