Summary
Overview
Work History
Education
Skills
Accomplishments
Languages
Timeline
Generic

Daniil Cheltsou

Wroclaw

Summary

Experienced Data Engineer with a strong background in Spark, Scala, Azure Data Factory (ADF), and Databricks. Skilled in designing, building, and optimizing large-scale data pipelines and ETL processes. Passionate about data processing, cloud technologies, and performance optimization to ensure efficient data workflows.

Overview

5
5
years of professional experience

Work History

Data Engineer

EPAM Systems
09.2023 - Current
  • Worked with IoT device messaging for smart home applications (e.g., thermostats, water leak detectors, smoke detectors), ensuring efficient data ingestion, processing, and storage.
  • Designed, implemented, and maintained ADF pipelines using Databricks jobs to handle large-scale data processing.
  • Developed and optimized Spark jobs in Scala to ingest, transform, and productionalize data efficiently.
  • Collaborated within a Scrum team alongside developers, QA engineers, business analysts, and solution architects.
  • Optimized existing data pipelines, significantly improving execution time and reducing costs.
  • Tech stack: Spark, Scala, Azure (ADLS Gen2, ADF, EventHub, Cosmos DB), Databricks.

Software Engineer

EPAM Systems
10.2020 - 09.2023
  • Developed and maintained backend solutions in Java using Spring, Hibernate.
  • Participated in Agile development processes, including sprint planning, grooming, reviews, and retrospectives.
  • Designed and reviewed Middle-Level Designs (MLD) from customer architects, proposing optimized solutions.
  • Debugged and troubleshot software issues, improving system stability and performance.
  • Implemented and tested APIs, using tools like Postman for validation and JUnit 5 & Mockito for unit testing.
  • Performance tuning: Analyzed response times and CPU performance using Dynatrace to identify and resolve bottlenecks.
  • Continuous Integration (CI): Configured and managed Jenkins jobs for building, testing, and deploying applications.

Junior Software Engineer

Resliv
08.2020 - 10.2021
  • Reviewed code and debugged errors to improve performance
  • Troubleshot various software issues by using debugging processes and coding techniques
  • Researched new technologies and software packages used in web application (Apache Camel, Spring Boot, Hibernate, Mockito)

Education

Java Developer - Java Web Development

EPAM Systems
07.2020

Skills

  • Sql
  • Spark
  • Scala
  • BigData
  • Azure Data Stack
  • Databricks
  • Java
  • Git

Accomplishments

    Optimized existed pipelines to significantly reduce execution time and costs as a result.

  • Improved spark daily obfuscation job aimed to prepare and provide prod data to lower environments by fully rewriting legacy code and reconsidering core logic.
  • Found performance issue on one of product pipelines by debugging and investigation spark UI and resolved it by removing extra data caching. As a result execution time was reduced from 22+h to 2h.
  • Implemented common logic for building timeseries tables based on source dataset and timeframe descriptor via Dataset API and successfully allied it on 20+ tables.

    Implemented common logic for building linear regression model based on source dataset and interests descriptor via Dataset API.

Languages

English
Upper intermediate (B2)

Timeline

Data Engineer

EPAM Systems
09.2023 - Current

Software Engineer

EPAM Systems
10.2020 - 09.2023

Junior Software Engineer

Resliv
08.2020 - 10.2021

Java Developer - Java Web Development

EPAM Systems
Daniil Cheltsou