ADEPT: advanced dense prediction

Semantic analysis of natural images at the pixel level

Project ADEPT studies methods for dense semantic analysis in large images of natural scenes. Our goal is to reduce obstacles towards exciting real-world applications such as autonomous cars, road safety inspection or automated warehouses. We are especially interested in the following problems:

Research results

Panoptic domain adaptation Ensembled detection of anomalous regions Defenses from data-poisoning attacks
martinovic24eccv delic24bmvc sabolic24bmvc
[martinovic24eccv] [delic24bmvc] [sabolic24bmvc]


Synthetic negative data Recognition of road-safety attributes Efficient panoptic segmentation
grcic24tpami kacan24tits saric23rs
[grcic24tpami] [kacan24tits] [saric23rs]


Detection of anomalous regions Construction of universal taxonomies Dense hybrid anomaly detection
grcic23cvprw bevandic22bmvc grcic22eccv
[grcic23cvprw] [bevandic22bmvc] [grcic22eccv]


Training with overlapping labels Dense semantic forecasting with F2MF Densely connected normalizing flows
bevandic24ijcv saric23tnnls grcic21neurips
[bevandic24ijcv] [saric23tnnls] [grcic21neurips]

Starting point

We start from convolutional models for dense prediction based on checkpointed DenseNet backbones, lightweight ladder-style upsampling [1] and pyramidal fusion [2]. We predict future semantic content in video based on feature-to-motion-and-feature forecasting [3]. We apply multi-domain loss expressed as negative log likelihood of aggregated probability [4]. We address open-set recognition through learning with noisy and artificial negative samples [5].

[1] ladder-DenseNets, T-ITS 2020.
[2] pyramidal fusion, CVPR 2019, PR 2021.
[3] F2MF forecasting, CVPR 2020.
[4] multi-domain training, NLL+ loss, Arxiv 2020.
[5] open-set recognition with noisy negatives, GCPR 2019.

Time frame

Start date: 1st February 2021.

Duration: 48 months

The project has been funded by the Croatian science foundation