Iterative versus amortized inference solutions to the constellation problem

Hasanov, Farid

Iterative versus amortized inference solutions to the constellation problem

Failid

Hasanov_MSc_computer_science_2022.pdf (4.54 MB)

Kuupäev

2022

Autorid

Hasanov, Farid

Kirjastaja

Tartu Ülikool

Abstrakt

Making sense of the visual inputs is an essential part of human intelligence. While processing in the human visual cortex has been observed to have recurrent nature, machine vision systems with one feedforward pass from input into prediction have dominated computer vision benchmarks. This discrepancy may be explained through lack of challenging datasets where gradual refinement of solution would be necessary to lead to correct solution. Such a dataset, where local information about the encoded objects has been erased, was recently proposed. The current thesis represents the first attempt to solve this novel dataset. We propose to use generative models DCGAN and VAE with optimization algorithm CMA-ME to refine the solutions as iterative inference, and use generative models Pix2pix and CycleGAN as feedforward or amortized inference. Through solving the problem posed in the novel computer vision dataset, we show the prevalence of iterative refinement of hypotheses over the single-prediction paradigm, encouraging further research in the field of iterative inference.

Märksõnad

Deep Learning, Computer Vision, Convolutional Neural Networks, Generative Modeling, Image processing

URI

https://hdl.handle.net/10062/91785

Kollektsioonid

MTAT magistritööd – Master's theses

Kirje täielik lehekülg

Iterative versus amortized inference solutions to the constellation problem

Failid

Kuupäev

Autorid

Ajakirja pealkiri

Ajakirja ISSN

Köite pealkiri

Kirjastaja

Abstrakt

Kirjeldus

Märksõnad

Viide

URI

Kollektsioonid