Comments (6)
I think the derivative is 2 * ((0.5weight) - 0.8) * 0.5, that is 2 * 0.5 * ((0.5weight) - 0.8) , so the result is 0.5*x - 0.8
from grokking-deep-learning.
the general forum for this : 2 * ((input*weight) - goal_pred) * input.
in nerualnetwork people may dont care about the exactly coefficient of derivative, so just omit 2 and leave the key part of the derivative
from grokking-deep-learning.
Hmm, if error = (input * weight - goal_pred) ** 2
should derivative not be 2 * (input * weight - goal_pred)
?
Since in this example input = 2, it's the same but I'm also confused...
from grokking-deep-learning.
Hmm, if
error = (input * weight - goal_pred) ** 2
should derivative not be2 * (input * weight - goal_pred)
?
Since in this example input = 2, it's the same but I'm also confused...
the derivative should be 2 * weight * (input * weight-goal_pred),that's the chain rule, you also should do derivative to input * weight
from grokking-deep-learning.
As far as I'm concerned, weights_delta
in 4th Chapter are calculated via delta rule
Just to clarify:
The Delta rule is an update rule for single layer NN. It makes use of Gradient Descent.
Backpropagation is an update rule for multi layer NN based on Gradient Descent.
from grokking-deep-learning.
But if we are using direction_and_amount = (pred - goal_pred) * input * 2 (e.g., not omitting the two), the model converges much faster?
from grokking-deep-learning.
Related Issues (20)
- chapter 10, is the `for row_start in range(layer_0.shape[1]-kernel_rows)` correct? HOT 1
- NameError in Chapter 4 notebook
- Chapter 13 - I have a problem trying to implement the autograd in a simple linear regresion HOT 1
- what is the size of the dictionary in chapter 14
- Ch11 Predicting Movie Reviews - error in back propagation code HOT 9
- License HOT 1
- Missed variable
- lots of mistakes and illogical ordering? is this a pre-script?
- ch 10 kernel weight update typo
- Inactive Activation gradients HOT 1
- Authorization to re-implement in another language HOT 2
- Chapter 6 - Array in array vs use vector HOT 1
- Chapter 6 - layer_1_delta
- Mistake in Ch3. code
- 2nd edition?
- chapter 6
- Chapter 8 examples, why do they all turn label into one_hot_labels HOT 3
- Something wrong with the code in chapter 10
- Help Please HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from grokking-deep-learning.