Departmental Technical Reports (CS)

Why rectified linear neurons: a possible interval-based explanation

Jonathan Contreras, The University of Texas at El PasoFollow
Martine Ceberio, The University of Texas at El PasoFollow
Vladik Kreinovich, The University of Texas at El PasoFollow

Publication Date

11-1-2021

Comments

Technical Report: UTEP-CS-21-101

Abstract

At present, the most efficient machine learning techniques are deep neural networks. In these networks, a signal repeatedly undergoes two types of transformations: linear combination of inputs, and a non-linear transformation of each value v -> s(v). Empirically, the function s(v) = max(v,0) -- known as the rectified linear function -- works the best. There are some partial explanations for this empirical success; however, none of these explanations is fully convincing. In this paper, we analyze this why-question from the viewpoint of uncertainty propagation. We show that reasonable uncertainty-related arguments lead to another possible explanation of why rectified linear functions are so efficient.

Download

Included in

Computer Sciences Commons, Mathematics Commons

COinS

Departmental Technical Reports (CS)

Why rectified linear neurons: a possible interval-based explanation

Publication Date

Comments

Abstract

Included in

Search

Links

Browse

Author Corner

Links

Departmental Technical Reports (CS)

Why rectified linear neurons: a possible interval-based explanation

Authors

Publication Date

Comments

Abstract

Included in

Share

Search

Links

Browse

Author Corner

Links