A Novel Mathematical Framework for the Analysis of Neural Networks

A Novel Mathematical Framework for the Analysis of Neural Networks
Author :
Publisher :
Total Pages : 89
Release :
ISBN-10 : OCLC:1007217942
ISBN-13 :
Rating : 4/5 (42 Downloads)

Book Synopsis A Novel Mathematical Framework for the Analysis of Neural Networks by : Anthony L. Caterini

Download or read book A Novel Mathematical Framework for the Analysis of Neural Networks written by Anthony L. Caterini and published by . This book was released on 2017 with total page 89 pages. Available in PDF, EPUB and Kindle. Book excerpt: Over the past decade, Deep Neural Networks (DNNs) have become very popular models for processing large amounts of data because of their successful application in a wide variety of fields. These models are layered, often containing parametrized linear and non-linear transformations at each layer in the network. At this point, however, we do not rigorously understand why DNNs are so effective. In this thesis, we explore one way to approach this problem: we develop a generic mathematical framework for representing neural networks, and demonstrate how this framework can be used to represent specific neural network architectures. In chapter 1, we start by exploring mathematical contributions to neural networks. We can rigorously explain some properties of DNNs, but these results fail to fully describe the mechanics of a generic neural network. We also note that most approaches to describing neural networks rely upon breaking down the parameters and inputs into scalars, as opposed to referencing their underlying vector spaces, which adds some awkwardness into their analysis. Our framework strictly operates over these spaces, affording a more natural description of DNNs once the mathematical objects that we use are well-defined and understood. We then develop the generic framework in chapter 3. We are able to describe an algorithm for calculating one step of gradient descent directly over the inner product space in which the parameters are defined. Also, we can represent the error backpropagation step in a concise and compact form. Besides a standard squared loss or cross-entropy loss, we also demonstrate that our framework, including gradient calculation, extends to a more complex loss function involving the first derivative of the network. After developing the generic framework, we apply it to three specific network examples in chapter 4. We start with the Multilayer Perceptron, the simplest type of DNN, and show how to generate a gradient descent step for it. We then represent the Convolutional Neural Network (CNN), which contains more complicated input spaces, parameter spaces, and transformations at each layer. The CNN, however, still fits into the generic framework. The last structure that we consider is the Deep Auto-Encoder, which has parameters that are not completely independent at each layer. We are able to extend the generic framework to handle this case as well. In chapter 5, we use some of the results from the previous chapters to develop a framework for Recurrent Neural Networks (RNNs), the sequence-parsing DNN architecture. The parameters are shared across all layers of the network, and thus we require some additional machinery to describe RNNs. We describe a generic RNN first, and then the specific case of the vanilla RNN. We again compute gradients directly over inner product spaces.


A Novel Mathematical Framework for the Analysis of Neural Networks Related Books

A Novel Mathematical Framework for the Analysis of Neural Networks
Language: en
Pages: 89
Authors: Anthony L. Caterini
Categories: Convolutions (Mathematics)
Type: BOOK - Published: 2017 - Publisher:

DOWNLOAD EBOOK

Over the past decade, Deep Neural Networks (DNNs) have become very popular models for processing large amounts of data because of their successful application i
Deep Neural Networks in a Mathematical Framework
Language: en
Pages: 95
Authors: Anthony L. Caterini
Categories: Computers
Type: BOOK - Published: 2018-03-22 - Publisher: Springer

DOWNLOAD EBOOK

This SpringerBrief describes how to build a rigorous end-to-end mathematical framework for deep neural networks. The authors provide tools to represent and desc
Mathematical Methods for Neural Network Analysis and Design
Language: en
Pages: 452
Authors: Richard M. Golden
Categories: Computers
Type: BOOK - Published: 1996 - Publisher: MIT Press

DOWNLOAD EBOOK

For convenience, many of the proofs of the key theorems have been rewritten so that the entire book uses a relatively uniform notion.
Mathematical Approaches to Neural Networks
Language: en
Pages: 391
Authors: J.G. Taylor
Categories: Computers
Type: BOOK - Published: 1993-10-27 - Publisher: Elsevier

DOWNLOAD EBOOK

The subject of Neural Networks is being seen to be coming of age, after its initial inception 50 years ago in the seminal work of McCulloch and Pitts. It is pro
Neural Networks and Statistical Learning
Language: en
Pages: 988
Authors: Ke-Lin Du
Categories: Mathematics
Type: BOOK - Published: 2019-09-12 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book provides a broad yet detailed introduction to neural networks and machine learning in a statistical framework. A single, comprehensive resource for st