Experienced Data Engineer with a strong background in Spark, Scala, Azure Data Factory (ADF), and Databricks. Skilled in designing, building, and optimizing large-scale data pipelines and ETL processes. Passionate about data processing, cloud technologies, and performance optimization to ensure efficient data workflows.
Optimized existed pipelines to significantly reduce execution time and costs as a result.
Implemented common logic for building timeseries tables based on source dataset and timeframe descriptor via Dataset API and successfully allied it on 20+ tables.
Implemented common logic for building linear regression model based on source dataset and interests descriptor via Dataset API.