<hello/> _
Agnieszka
Mikołajczyk-Bareła
Senior AI Engineer working on Reedy at Chaptr.AI — LLMs, RAG, and multimodal pipelines. Previously part of the NLP team at VoiceLab.AI that shipped TRURL, Poland's first large-scale generative model. Researcher, speaker, and co-organizer of AI-for-Good initiatives.
3 200+ citations across 30+ papers and 10+ open-source projects. Named among Top Women in AI Poland.
Skills
Projects
Research, open-source, and industry work
Schema-First Prompting [Claude Skill]
Modern prompt engineering, designed the way a human would — clean, minimal, elegant. An open-source skill for Claude and Cursor that encodes best practices for structured outputs with Pydantic models.
Model training
Trained models across computer vision (classification, detection, segmentation), NLP and generative text, and audio; with the VoiceLab team shipped TRURL 7B/13B (incl. 8-bit and academic), vlt5-base-keywords, herbert-base-cased-sentiment, and datasets on Hugging Face.
Data Augmentation
Curated review of augmentation techniques, libraries, and papers (1.6k+ stars), plus research code for Targeted Data Augmentation (TDA) and Counterfactual Bias Insertion (CBI) on dermoscopy and face data.
Biomedical imaging
Dermoscopy / melanoma bibliography (classical features through NAS, self-supervision, augmentation, bias) plus deep learning for other biomedical imaging—microbleeds, blood smear, erythrocytes—with links added over time.
Bias in Machine Learning
PhD thesis on arXiv (bias, XAI, skin-lesion case studies, style transfer / targeted augmentation / attribution feedback); IEEE Access survey with M. Grochowski; GEBI; MICCAI 2022 GAN debiasing in dermoscopy.
Waste detection
WiMLDS Trójmiasto AI4Good: PyTorch detection and segmentation (EfficientDet, DETR, Mask R-CNN, Faster R-CNN) on merged litter benchmarks, Waste Management paper, curated waste image datasets list (338+ stars), and co-organized Hack4Environment (waste & environmental literacy with DIH4.AI).
HearAI — Sign Language Recognition
Making the world more accessible for the Deaf community through deep learning-based sign language recognition.
Punctuation Restoration [PolEval]
Created the WikiPunct dataset and organized the first Polish punctuation restoration shared task at PolEval 2021.
Machine Learning Acronyms
Community-maintained reference of ML and AI acronyms and abbreviations.
Bird Song Classification
WiMLDS Trójmiasto project — sound-based bird species classification using deep learning on audio spectrograms.
Get in touch
Want to collaborate or just say hello?
When not training models, you'll find me with a book, in the kitchen, or being supervised by two cats.
LinkedIn →