👋

Hi, I'm Hanna

I am

Get to know me

@ayukh

📍Zurich, Switzerland

agents @ 🤗
statistics msc @ eth zurich

Work experience

Hugging Face🤗

Machine Learning Research Engineer Intern

July 2025 - current

Improving data sourcing and training methods for coding agents

ETH Zurich

Research Assistant

May 2025 - July 2025

◆ Research Assistant at Secure, Reliable, and Intelligent Systems Lab (SRI)
◆ Improving multilingual training, alignment and safety for EEU languages

ETH Zurich

Research Assistant

Nov 2023 - May 2024

◆ Project Intern at Secure, Reliable, and Intelligent Systems Lab (SRI)
◆ Documenting capabilities and vulnerabilities of the state-of-the-art large language models
◆ Contributing to LVE (Language Model Vulnerabilities and Exposures) project
SynthPAI: A Synthetic Dataset for Personal Attribute Inference: Semester project under supervision of Prof. Martin Vechev (co-supervised by Robin Staab and Mark Vero)

Fractal Analytics

Junior Data Scientist

Sep 2021 - Jul 2022

◆ Went through a 3-month internship with intensive training for statistics, machine learning techniques, data engineering and cloud (Azure)
◆ Executed an end-to-end Market Mix Modelling project for a particular segment of one of the world's biggest CPG companies, including methods research, EDA, developing a statistical model and fine-tuning it

Tech stack

Python

R

C++

HTML

Typescript

React

SQL

Pytorch/XLA

Tensorflow

Keras

AWS

GCP

Education

ETH Zurich

Statistics MSc

2022-current

⭐ Main courses: Natural Language Processing, Large Language Models, Interactive Machine Learning: Visualization & Explainability, Probabilistic AI, Big Data for Engineers, AI4Good.
⭐ Master thesis "Enhancing Mid-Resource Language Performance in Large Language Models": end-to-end pipeline recipe for efficient bilingual LLM training and alignment (under supervision of prof. Vechev).
⭐ Extracurricular activities:
◆ Statistics representative at Seminar fur Statistik (SfS): organizing and leading events for students of Statistics MSc program
◆ Statistics MSc mentor: mentoring incoming first-year students of the program
◆ Member of VMP (student organization of D-MATH ETH department)

National Technical University of Ukraine

Economic Cybernetics MSc

2021-2022

⭐ Master thesis: 'Modeling the investment portfolio of E-commerce companies':
◆ Twitter sentiment analysis of E-commerce stock tickers
◆ Stock prediction using Generative Adversarial Networks (GAN)
◆ Investment portfolio modeling (option hedge fund, stock portfolio prediction)

National Technical University of Ukraine

Economic Cybernetics BSc

2017-2021

⭐ Grade: 94/100
◆ Graduated with honors
◆ Bachelor thesis: 'Modeling an equity investment fund using financial derivative management and hedging strategies'

Projects and publications

Jupyter Agent

Hugging Face 🤗

Multi-step pipeline to generate synthetic Jupyter notebooks with custom scaffolding to finetune efficient data science coding agents.

Jupyter Agent

MamayLM v0.1

Ukrainian/English

An efficient bilingual LLM with cutting-edge performance in Ukrainian and English.

MamayLM v0.1

SynthPAI: A Synthetic Dataset for Private Attribute Inference

NeurIPS D&B 2024

LLM generated collection of synthetic texts to ensure privacy-preserving research in area of private attribute inference benchmarking of Large Language Models.

SynthPAI: A Synthetic Dataset for Private Attribute Inference

LVE Project

Python

An open-source repository of Language Model Vulnerabilities and Exposures (LVEs).

LVE Project

Urban Planning project

Python, Typescript, HTML, CSS

A project for ETH Zurich course 'Interactive Machine Learning: Visualization & Explainability', spring semester 2023.

Urban Planning project
  • © 2025. Hanna Yukhymenko.