Senior Data Engineer

Vladyslav K

Information

Available hours \ week
40 h/w
Seniority level
Senior
Years of experience
5 yrs.
Location
Ukraine
Nationality
Ukraine
Timezone
(GMT+02:00) Kyiv

Languages

English
Upper-Intermediate (B2)

About

Vladyslav is a Senior Data Engineer focused on building data pipelines and infrastructure for AI-driven products. Started as a Python developer, he moved deeper into data engineering through hands-on work with ETL/ELT systems, data lakes, and AWS cloud architecture. Most of his commercial experience has been in fintech and document processing. Vlad built the data infrastructure supporting generative AI stock analysis for global investors: AWS Glue and PySpark pipelines, a Delta Lake-based data lake on S3, and data models across PostgreSQL. Before that, he spent nearly three years at Esynergetics building OCR-powered document exchange systems, including leading an OCR team for invoice processing. Vladyslav is AWS Certified (Cloud Practitioner, 2022) and is comfortable across the full data engineering stack: Python with FastAPI and Django, Apache Spark/Pyspark, Delta Lake, various SQL and NoSQL databases, and container-based deployment with Docker.

Experience

BRIDGEWISE

Back-end developer, Data Engineer

About the Project

Bridgewise is a technology research company using proprietary AI and language models to deliver instant fundamental analyses of global stocks to investors and financial institutions. The platform integrates directly into brokerage infrastructures, generating bespoke investment strategies and multi-language stock insights for millions of investors. The data engineering work supported the platform's ability to ingest, transform, and serve massive volumes of financial data reliably, with the governance and performance required for a regulated fintech environment.

  • Fintech
  • AI
  • Machine Learning

Responsibilities

Designed and developed scalable system architectures for new services on the platform. Built and maintained ETL/ELT pipelines using AWS Glue and PySpark, processing large volumes of market and fundamental data. Architected a data lake on AWS S3 using Delta Lake for ACID reliability and data governance. Designed and optimized data models in PostgreSQL and DocumentDB for performance under production load. Handled deployment and ongoing back-end development. Worked across the full AWS stack including Athena, Glue, S3, Lambda, Step Functions, EC2, and ECS. Engagement lasted 12 months.

ESYNERGETICS

Back-end Developer, DevOps

About the Project

Esynergetics provides an AI-powered infrastructure for secure document exchange between customers and suppliers, converting PDF-based documents into machine-readable formats. The platform delivers structured JSON representations of pdfs and images, integrates into existing document management processes of enterprise clients, and provides analytics on top of the extracted data. This phase of the work focused on core architecture and services, with a DevOps angle covering deployment and infrastructure.

  • AI
  • Machine Learning
  • Management

Responsibilities

Designed and developed the core architecture, creating and implementing new services across the system. Handled deployment pipelines and ongoing back-end development. Took on knowledge sharing, mentorship, and code review responsibilities for the team. Worked on bug fixing across the codebase. The stack combined Python back-end (Django, FastAPI, aiohttp), OCR and data processing (OpenCV, Pandas, Tesseract), front-end and full-stack work (JavaScript, TypeScript, NodeJS, Angular), and AWS infrastructure (EC2, ECS, Route 53).

ESYNERGETICS

Back-end Developer, Team Lead

About the Project

Continuation of the Esynergetics platform, with a focus on the OCR invoice processing component. The team was responsible for extracting structured data from diverse invoice formats, which required continuous refinement of parsers, handling new document types, and integrating the OCR output into the broader data pipeline. This phase combined hands-on back-end development with team leadership responsibilities for the OCR group.

  • Management

Responsibilities

Led the OCR team, handling knowledge sharing, mentorship, and project coordination. Implemented and maintained parsers, modifying the parser repetition logic for better reliability. Managed the OCR invoice processing workflow, including integration of new invoice types into the existing extraction system. Handled database queries and optimisation, fixed Docker configuration issues, and resolved bugs across the stack. Engagement lasted 14 months. The work required balancing individual contribution with team leadership responsibilities.