Work experience
Hugging Face🤗
Machine Learning Research Engineer Intern
July 2025 - current
Improving data sourcing and training methods for coding agents
ETH Zurich
Research Assistant
May 2025 - July 2025
◆ Research Assistant at Secure, Reliable, and Intelligent Systems Lab (SRI)
◆ Improving multilingual training, alignment and safety for EEU languages
ETH Zurich
Research Assistant
Nov 2023 - May 2024
◆ Project Intern at Secure, Reliable, and Intelligent Systems Lab (SRI)
◆ Documenting capabilities and vulnerabilities of the state-of-the-art large language models
◆ Contributing to LVE (Language Model Vulnerabilities and Exposures) project
◆ SynthPAI: A Synthetic Dataset for Personal Attribute Inference: Semester project under supervision of Prof. Martin Vechev (co-supervised by Robin Staab and Mark Vero)
Fractal Analytics
Junior Data Scientist
Sep 2021 - Jul 2022
◆ Went through a 3-month internship with intensive training for statistics, machine learning techniques, data engineering and cloud (Azure)
◆ Executed an end-to-end Market Mix Modelling project for a particular segment of one of the world's biggest CPG companies, including methods research, EDA, developing a statistical model and fine-tuning it
Tech stack
Education
ETH Zurich
Statistics MSc
2022-current
⭐ Main courses: Natural Language Processing, Large Language Models, Interactive Machine Learning: Visualization & Explainability, Probabilistic AI, Big Data for Engineers, AI4Good.
⭐ Master thesis "Enhancing Mid-Resource Language Performance in Large Language Models": end-to-end pipeline recipe for efficient bilingual LLM training and alignment (under supervision of prof. Vechev).
⭐ Extracurricular activities:
◆ Statistics representative at Seminar fur Statistik (SfS): organizing and leading events for students of Statistics MSc program
◆ Statistics MSc mentor: mentoring incoming first-year students of the program
◆ Member of VMP (student organization of D-MATH ETH department)
National Technical University of Ukraine
Economic Cybernetics MSc
2021-2022
⭐ Master thesis: 'Modeling the investment portfolio of E-commerce companies':
◆ Twitter sentiment analysis of E-commerce stock tickers
◆ Stock prediction using Generative Adversarial Networks (GAN)
◆ Investment portfolio modeling (option hedge fund, stock portfolio prediction)
National Technical University of Ukraine
Economic Cybernetics BSc
2017-2021
⭐ Grade: 94/100
◆ Graduated with honors
◆ Bachelor thesis: 'Modeling an equity investment fund using financial derivative management and hedging strategies'
Projects and publications
Jupyter Agent
Hugging Face 🤗
Multi-step pipeline to generate synthetic Jupyter notebooks with custom scaffolding to finetune efficient data science coding agents.

MamayLM v0.1
Ukrainian/English
An efficient bilingual LLM with cutting-edge performance in Ukrainian and English.

SynthPAI: A Synthetic Dataset for Private Attribute Inference
NeurIPS D&B 2024
LLM generated collection of synthetic texts to ensure privacy-preserving research in area of private attribute inference benchmarking of Large Language Models.

LVE Project
Python
An open-source repository of Language Model Vulnerabilities and Exposures (LVEs).

Urban Planning project
Python, Typescript, HTML, CSS
A project for ETH Zurich course 'Interactive Machine Learning: Visualization & Explainability', spring semester 2023.
