Question

In: Computer Science

1. What is the fundamental equation that guides changes to a weight wij in a BP...

1. What is the fundamental equation that guides changes to a weight wij in a BP network.   Describe its components.

2. Multi-layer BP neural networks have no proof of converging to an optimal solution. Is this true? If it is, then why do we bother to use them?

Solutions

Expert Solution

Backpropagation, short for “backward propagation of errors”, is a mechanism used to update the weights using gradient descent. It calculates the gradient of the error function with respect to the neural network’s weights. The calculation proceeds backwards through the network.

Gradient descent is an iterative optimization algorithm for finding the minimum of a function; in our case we want to minimize th error function. To find a local minimum of a function using gradient descent, one takes steps proportional to the negative of the gradient of the function at the current point.

For example, to update w6, we take the current w6 and subtract the partial derivative of error function with respect to w6. Optionally, we multiply the derivative of the error function by a selected number to make sure that the new updated weight is minimizing the error function; this number is called learning rate.

The derivation of the error function is evaluated by applying the chain rule as following

So to update w6 we can apply the following formula

Similarly, we can derive the update formula for w5 and any other weights existing between the output and the hidden layer.

However, when moving backward to update w1, w2, w3 and w4 existing between input and hidden layer, the partial derivative for the error function with respect to w1, for example, will be as following.

We can find the update formula for the remaining weights w2, w3 and w4 in the same way.

In summary, the update formulas for all weights will be as following:

We can rewrite the update formulas in matrices as following

Backward Pass

Using derived formulas we can find the new weights.

Learning rate: is a hyperparameter which means that we need to manually guess its value.

Now, using the new weights we will repeat the forward passed

We can notice that the prediction 0.26 is a little bit closer to actual output than the previously predicted one 0.191. We can repeat the same process of backward and forward pass until error is close or equal to zero.

Backpropagation Visualization

You can see visualization of the forward pass and backpropagation here.

You can build your neural network using netflow.js


Related Solutions

What is a debit? What is a credit? What is the fundamental accounting equation?
  What is a debit? What is a credit? What is the fundamental accounting equation?
Race Weight Systolic BP 1. Construct confidence intervals for the following situations. You will be prompted...
Race Weight Systolic BP 1. Construct confidence intervals for the following situations. You will be prompted to indicate the: critical values, margin of error, confidence limits, and an interpretation.    (18 points) Hispanic 117 137 Caucasian 107 127 a. Determine the proportion (percentage) of Asians for your point estimate. Then construct a 90% confidence interval for the true proportion of all Asian patients. Caucasian 101 119 Hispanic 123 143 Sample Proportion Hispanic 121 141 Critical Value Caucasian 157 174 Margin of...
Race Weight Systolic BP 1. Construct confidence intervals for the following situations. You will be prompted...
Race Weight Systolic BP 1. Construct confidence intervals for the following situations. You will be prompted to indicate the: critical values, margin of error, confidence limits, and an interpretation.    (18 points) Hispanic 117 137 Caucasian 107 127 a. Determine the proportion (percentage) of Asians for your point estimate. Then construct a 90% confidence interval for the true proportion of all Asian patients. Caucasian 101 119 Hispanic 123 143 Sample Proportion Hispanic 121 141 Critical Value Caucasian 157 174 Margin of...
3. IAS8 Accounting Policies, Changes in Accounting Estimates and Errors guides the process of selecting and...
3. IAS8 Accounting Policies, Changes in Accounting Estimates and Errors guides the process of selecting and applying accounting policies, accounting for changes in estimates and reflecting corrections of prior period errors. a) Distinguish between the terms “accounting policy” and “accounting standard” and give an example of how some of the firms accounting policies can be dictated by an international accounting standard. b) Discuss the disclosure requirements regarding a change in an accounting policy.
A gas which obeys the equation of state, PV/RT=1 +BP , where B is a known...
A gas which obeys the equation of state, PV/RT=1 +BP , where B is a known temperature-dependent parameter, is fed at a steady rate to an insulated valve where its pressure is reduced from P1 to P2. Given that the upstream temperature of the gas is T1 and that the constant-pressure molar heat capacity of the gas in the ideal gas state is given by Cp* = a + bT, where a and b are known constants, derive an expression...
‏1. Use the demand-supply fundamental model to illustrate the changes in the value of the Egyptian...
‏1. Use the demand-supply fundamental model to illustrate the changes in the value of the Egyptian pound (EGP) against the U.S. dollar (USD) in the past few years. In your answer, try as much as possible to distinguish between the demand-side factors and the supply-side factors as discussed in class. Finally, show how the intervention by the Central Bank of Egypt (CBE) at some point has led to the creation and deepening of the “Black Market”. Although drawing graphs is...
The automotive industry has witnessed substantial changes in channels of distribution. What are the fundamental reasons...
The automotive industry has witnessed substantial changes in channels of distribution. What are the fundamental reasons such as consumer behavior, competition, or technology (or others) for this change? Of the fundamental reasons stated for the changes to channels of distribution, what do you see as being the most significant and why?
The weight and systolic blood pressure (BP) of 26 randomly selected males in the age group...
The weight and systolic blood pressure (BP) of 26 randomly selected males in the age group 25-30 are shown in the SAS code. Assume that for any fixed value of weight, BP is normally distributed. Number of Observations Read 27 Number of Observations Used 26 Number of Observations with Missing Values 1 Analysis of Variance Source DF Sum of Squares Mean Square F Value Pr > F Model 1 2693.58122 2693.58122 35.74 <.0001 Error 24 1808.57262 75.35719 Corrected Total 25...
Explain why changes in Ca2+ levels are fundamental in short-term synaptic changes such as facilitation, synaptic...
Explain why changes in Ca2+ levels are fundamental in short-term synaptic changes such as facilitation, synaptic depression, augmentation, and posttetanic potentiation. What can these short-term changes contribute to the study of learning and memory?
What is the error of the predicted Systolic BP when Age = 39? Age Systolic BP...
What is the error of the predicted Systolic BP when Age = 39? Age Systolic BP Year Stories Height Year Germany GDP 47 145 1990 54 770 1950 5.725433 65 162 1980 47 677 1951 6.256754 46 142 1990 28 428 1952 6.70308 67 170 1989 38 410 1953 7.256435 42 124 1966 29 371 1954 7.72644 67 158 1976 38 504 1955 8.570349 56 154 1974 80 1136 1956 9.076571 64 162 1991 52 695 1957 9.45931 56 150...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT