Műegyetemi Digitális Archívum

Uncertainty approximation in neural networks using parameter-space proximity regularization

Date

Type

könyvfejezet

Language

en

Reading access rights:

Open access

Conference Date

February 1-2, 2021

Conference Place

Online

Conference Title

28th PhD Minisymposium of the Department of Measurement and Information Systems

ISBN, e-ISBN

978-963-421-845-6

Container Title

Proceedings of the 28th PhD Minisymposium of the Department of Measurement and Information Systems

Department

Department of Measurement and Information Systems

Version

Kiadói változat

Faculty

Faculty of Electrical Engineering and Informatics

First Page

8

Subject (OSZKAR)

uncertainty
Bayesian neural network
Bayesian approach
approximation methods

Gender

Konferenciacikk

University

Budapest University of Technology and Economics

OOC works

Abstract

A common question regarding the application of neural networks is whether the predictions of the model are reliable, in other words, what is the degree of uncertainty of our model. Generalizing a neural network into a Bayesian neural network is a frequent choice to quantify uncertainty. This means the extension of scalar weights and biases of the network to random variables. In terms of implementation, there are two main approaches: (1) the assumption of an unknown distribution for the random variables (i.e. weights) which are sampled and then utilized to infer the distribution of the output; (2) the assumption of a prior distribution for the random variables, and search for the output in the form of a similar posterior distribution. A common drawback of both approaches is that their scalability is limited, and generally require more computational resources than a simple neural network. In this paper, we introduce a new parameter sampling method, which maximizes the pairwise distance in parameter space between the models in the ensemble, therefore ensuring model diversity. Results indicate that this method generally needs less samples from the parameter space to be effective, thus it surpasses the other investigated approximation methods in terms of scalability. Finally, we apply our method to a real-life problem related to semantic segmentation of objects relevant in urban driving environments.

Description

Keywords