Lecture 4 | The Backpropagation

Lecture 4 | The Backpropagation

作者: Ysgc | 来源:发表于2019-10-19 14:01 被阅读0次

Lecture 4 | The Backpropagation
Lecture_backpropagation
Lecture 7 - Backpropagation
Lecture7 Backpropagation
【机器学习】-Week5.2 Backpropagation A
Lecture 4
Lecture 4
2018-10-13
Java notes
小白也能看懂的BP反向传播算法之Towards-Backprop

vector activation vs scalar activation

$\frac{\partial y_i}{\partial z_i} = \frac{exp(z_i)}{\sum_j exp(z_j)} - \frac{exp(z_i)^2}{(\sum_j exp(z_j))^2} = y_i(1 - y_i)$

$\frac{\partial y_i}{\partial z_j} = - \frac{exp(z_i)exp(z_j)}{(\sum_j exp(z_j))^2} = y_i y_j$

sigmoid output -> prob of classification

how to define the error???

first choice: square euclidean distance

L2 divergence -> differentiation is just $y_i - d_i$

gradient<0 => y_i should increase to reduce the div

arithmetically wrong, but label smoothing will help gradient descent!

avoid overshooting

https://leimao.github.io/blog/Label-Smoothing/

it's a heuristic

forward NN

backward NN

(1) trivial: grad of output

(2) grad of the final activation layer

(3) grad of the last group of weights

$[grad (W_{ij}^n)] = [y_0^{n-1}, y_1^{n-1}, ...,y_i^{n-1}]^T\cdot [grad(Z_0^n),grad(Z_1^n),...,grad(Z_j^n)]$

(4) grad of the second last group of y

$[grad (y_{i}^{n-1})]^T = [W_{ij}]\cdot [grad(Z_0^n),grad(Z_1^n),...,grad(Z_j^n)]^T$

(5) 综上 pseudocode & backward forward comparision

backward: in each loop, apply an affine transformation (transposed W) to the derivative, then times the derivative of the activation func

forward: in each iteration, apply an affine transformation to the input and an activation function

step (2) no longer element wise multiplication

$[grad(z_0),grad(z_1),...grad(z_i)]^T = [\frac{\partial y_j}{\partial z_i}]\times [grad(y_0),grad(y_1),...grad(y_i)]^T$

相关文章

Lecture 4 | The Backpropagation
vector activation vs scalar activation sigmoid output -> ...
Lecture_backpropagation
简介本节将帮助读者对反向传播形成直观而专业的理解。反向传播是利用链式法则递归计算表达式的梯度的方法。简单表达式...
Lecture 7 - Backpropagation
Lecture7 Backpropagation
Backpropagation就是如果你要用gradient descent的方法来train一个Neural N...
【机器学习】-Week5.2 Backpropagation A
Backpropagation Algorithm "Backpropagation" is neural-net...
Lecture 4
A. Creating function: 1. Syntax def ( ) : ...
Lecture 4
2.2 定点加法、减法运算 2.2.1 补码加法 2.2.2 补码减法 2.2.3 溢出概念与检测方法 2.3 定...
2018-10-13
Lecture 4, Systematic errors and sampling Additional stra...
Java notes
lecture 4 Animal foo = new Dog(); // ok override heiachy ...
小白也能看懂的BP反向传播算法之Towards-Backprop
本文相关代码可以从Backpropagation下载想要理解backpropagation反向传播算法，就必须先...

网友评论

本文标题：Lecture 4 | The Backpropagation

本文链接：https://www.haomeiwen.com/subject/rasgmctx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

栏目导航

热点阅读

关于我们|服务条款|联系我们|Lecture 4 | The Backpropagation |投稿指南|网站地图|RSS订阅|排版工具|手机版

提供经典美文摘抄,优美散文欣赏,现代诗歌精选,短篇小说,心情随笔,表白情书范文,故事会在线阅读欣赏

Copyright © 2014-2023 Haomeiwen.com All Rights Reserved. 好美文阅读网版权所有

备案信息：桂公网安备 45052102000051号 · 桂ICP备13007215号-3

本站所收录作品、热点评论等信息部分来源互联网，目的只是为了系统归纳学习和传递资讯

所有作品版权归原创作者所有，与本站立场无关，如不慎侵犯了你的权益，请联系我们告知，我们将做删除处理！