Experienced Cloud Data Engineer with 7 years of expertise in developing and optimizing ETL/ELT workflows for efficient data processing. Specializing in cost-effective data transformation solutions using AWS, Kafka, Cloudera, Iceberg and NiFi. Adept at leveraging ETL tools to streamline large-scale data transitions into structured, manageable projects with optimized budgets and timelines. Well-versed in Big Data technologies, including Hadoop, Airflow, and Docker.
ETL steaming and batch processes
Cloud (AWS, Azure)
SQL (CTE, Stored procedures, Query tuning, Indexes, Partitioning)
Kafka
Hadoop, Impala, Hive, Oracle, PostgreSQL, MS SQL
Iceberg
NiFi
Python3 (Pandas, SQLAlchemy, Numpy, Jupyter)
Docker, Kubernetes
SAS Products (SAS DIS, SAS EG, SAS VA and etc)
Apache Airflow (DAG Development, Installation, Administration)
Mlflow (Development, Installation, Administration)
DWH Architecture
Git, SVN
Bash
Linux, Windows
Jira, Confluence