2021

Waste detection

Detect & classify litter in the wild — merged benchmarks, open models, a living dataset index, and community hackathons in Pomerania.

Computer Vision AI4Good Deep Learning Datasets Hackathon Open Source

Prior waste datasets were often small, class‑imbalanced, or tied to one domain (e.g. aerial only). The detect‑waste benchmark unifies detection boxes into one “litter” taxonomy across tens of thousands of images—indoor, outdoor urban/natural, and underwater scenes, varied lighting and camera gear—so models generalize beyond a single collection.

Classify‑waste

A parallel classification benchmark aggregates public datasets into eight labels aligned with Gdańsk recycling rules—bio, glass, metals & plastics, paper, non‑recyclable, other (construction, e‑waste, hazardous), highly degraded “unknown” waste, and a true background class—reflecting messy real‑world streams rather than tidy lab subsets.

Community programme

A months‑long WiMLDS educational track brought Tri‑City participants together with mentors from industry and research to train modern detectors on shared baselines, publish code, and release the extended benchmarks that later supported the Waste Management journal article.

Waste datasets review

A separate, community‑indexed GitHub list catalogs image datasets for litter, garbage, and waste detection and classification—useful when you are comparing domains, licenses, and label granularity before training or benchmarking.

Hack4Environment

Co‑organized with WiMLDS Trójmiasto and the Digital Innovation Hub DIH4.AI: a hackathon on waste, environment monitoring, and AI literacy—hands‑on activities across age groups, from environmental storytelling to monitoring ideas, in the same AI4Good thread as Detect Waste.

Plastic waste and population pressure make civic awareness as important as model accuracy; the event aimed to shape attitudes toward land care alongside technical prototypes.

Original write-ups

Project posts from the original Detect Waste site where Agnieszka Mikołajczyk is listed as author or co-author. Links go to the archived source in GitHub, with the original public URL kept when available.

2020-11-20

Introduction

By Maria Ferlin, Agnieszka Mikołajczyk

How the Detect Waste story began: motivation, recycling rules, waste categories, and the TACO dataset.

2020-12-08

Data analysis

By Maria Ferlin, Agnieszka Mikołajczyk

Exploratory analysis of extended TACO annotations, category mapping, bounding boxes, and dataset imbalance.

2021-02-09

Comparing waste detection approaches on extended TACO dataset

By Sylwia Majchrowska, Agnieszka Mikołajczyk

EfficientDet results on 7-class and one-class waste detection, plus the motivation for separating detection and classification.

2021-03-05

Pseudo-labeling as a boost in waste classification task

By Sylwia Majchrowska, Agnieszka Mikołajczyk

Using pseudo-labeling and OpenLitterMap data to expand a waste classifier beyond the labeled dataset.

2021-05-01

Summary of the project

By Agnieszka Mikołajczyk

A wrap-up of the 5-month non-profit project, roles, outputs, arXiv paper, repository, and blog series.

← All projects