Question

In: Computer Science

1. Multi-layer BP neural networks have no proof of converging to an optimal solution. Is this...

1. Multi-layer BP neural networks have no proof of converging to an optimal solution. Is this true? If it is, then why do we bother to use them?

1. What is the fundamental equation that guides changes to a weight wij in a BP network. Describe its components.

Solutions

Expert Solution

1). ANSWER :

GIVENTHAT :

Backpropagation, short for “backward propagation of errors”, is a mechanism used to update the weights using gradient descent. It calculates the gradient of the error function with respect to the neural network’s weights. The calculation proceeds backwards through the network.

Gradient descent is an iterative optimization algorithm for finding the minimum of a function; in our case we want to minimize th error function. To find a local minimum of a function using gradient descent, one takes steps proportional to the negative of the gradient of the function at the current point.

For example, to update w6, we take the current w6 and subtract the partial derivative of error function with respect to w6. Optionally, we multiply the derivative of the error function by a selected number to make sure that the new updated weight is minimizing the error function; this number is called learning rate.

The derivation of the error function is evaluated by applying the chain rule as following

So to update w6 we can apply the following formula

Similarly, we can derive the update formula for w5 and any other weights existing between the output and the hidden layer.

However, when moving backward to update w1, w2, w3 and w4 existing between input and hidden layer, the partial derivative for the error function with respect to w1, for example, will be as following.

We can find the update formula for the remaining weights w2, w3 and w4 in the same way.

In summary, the update formulas for all weights will be as following:

We can rewrite the update formulas in matrices as following

Backward Pass

Using derived formulas we can find the new weights.

Learning rate: is a hyperparameter which means that we need to manually guess its value.

Now, using the new weights we will repeat the forward passed

We can notice that the prediction 0.26 is a little bit closer to actual output than the previously predicted one 0.191. We can repeat the same process of backward and forward pass until error is close or equal to zero.

Backpropagation Visualization

You can see visualization of the forward pass and backpropagation here.

You can build your neural network using netflow.js


Related Solutions

1. The optimal solution of the linear programming problem is at the intersection of constraints 1...
1. The optimal solution of the linear programming problem is at the intersection of constraints 1 and 2. Please answer the following questions by using graphical sensitivity analysis. Max 2x1 + x2 s.t. 4x1 +1x2 ≤8 4x1 +3x2 ≤12   1x1 +2x2 ≤6 x1 , x2 ≥ 0 A. Over what range can the coefficient of x1 vary before the current solution is no longer optimal? B. Over what range can the coefficient of x2 vary before the current solution is...
The optimal solution of the linear programming problem is at the intersection of constraints 1 and...
The optimal solution of the linear programming problem is at the intersection of constraints 1 and 2. Please answer the following questions by using graphical sensitivity analysis. Max s.t. Max 2x1 + x2 s.t. 4x1 +1x2 ≤8 4x1 +3x2 ≤12 1x1 +2x2 ≤6 x1 , x2 ≥ 0 Over what range can the coefficient of x1 vary before the current solution is no longer optimal? Over what range can the coefficient of x2 vary before the current solution is no...
C. Anti-reflection coatings So far you have explored constructive interference from multi-layer thin films. It is...
C. Anti-reflection coatings So far you have explored constructive interference from multi-layer thin films. It is also possible for interference to be destructive, a phenomenon exploited in making antireflection coatings for optical elements such as eyeglasses. In order to allow the lenses to be thinner (and thus lighter weight), eyeglass lenses can be made of a plastic that has a high index of refraction (n p = 1.70). The high index causes the plastic to reflect light more effectively than...
1. Two converging lenses have the same focal length of 5.00 cm. They have a common...
1. Two converging lenses have the same focal length of 5.00 cm. They have a common principal axis and are separated by 28.5 cm. An object is located 10.0 cm to the left of the left-hand lens. What is the image distance (relative to the lens on the right) of the final image produced by this two-lens system? 2. A plate glass window (n = 1.5) has a thickness of 5.2 mm. How long does it take light to pass...
To find the optimal solution to a linear optimization problem, do you have to examine all...
To find the optimal solution to a linear optimization problem, do you have to examine all the points in the feasible region? Explain. Can a linear programming problem have no solution? More than one solution? Explain. ---------------------------------------------------------------------------------------------------------------- A beverage can manufacturer makes three sizes of soft drink cans—Small, Medium and Large. Production is limited by machine availability, with a combined maximum of 90 production hours per day, and the daily supply of metal, no more than 120 kg per day....
1. Major television networks have never seemed to have issues showing commercials for beer and other...
1. Major television networks have never seemed to have issues showing commercials for beer and other alcoholic beverages. Even though adult viewers tend to enjoy the commercials, most adults seem to think that the commercials target teenagers and young adults (those under 21 years old). To study this belief, the networks conducted a joint poll of viewers and asked them if they felt that beer and other alcoholic beverage commercials targeted teenagers and young adults. The results of the survey...
Assignment problems can never have more than one optimal solution. True False The assignment algorithm can...
Assignment problems can never have more than one optimal solution. True False The assignment algorithm can be used to solve both minimization problems and maximization problems. True False In an assignment problem, a dummy source is given a very high cost for minimization problems and a low value for maximization problems so as to avoid going to the dummy first. True False The objective of an assignment problem solution most often is to minimize the total costs or time of...
You have 1 liter of 80 proof vodka (40% by weight ethanol in water) and someone...
You have 1 liter of 80 proof vodka (40% by weight ethanol in water) and someone removes 3 shots (100ml). What is the resulting concentration (proof)? Next someone fills the bottle back up with water, what is the resulting proof?
1. The adsorbed ions comprising the Stern layer around a charged particle have the same charge...
1. The adsorbed ions comprising the Stern layer around a charged particle have the same charge (positive or negative) as the particle itself. True. False, they have the opposite charge. False, there are an equal number of positively and negatively charged ions in the Stern layer. 2. Both ionic and nonionic surfactants can stabilize colloidal dispersions by adsorbing onto particle surfaces. True. False, nonionic surfactants do not adsorb on particle surfaces. False, surfactant adsorption always destabilizes colloidal dispersions by reducing...
1) Your smartphone is a multi function tool you have on your hand. What equipment in...
1) Your smartphone is a multi function tool you have on your hand. What equipment in the previous time have been replaced because of its existence?   2) Enumerate and describe the function of at least FIVE Internet of Medical Things wearable devices existing in the market. 3) Think of at least FIVE possible Internet of Medical Things wearable devices not existing in the market. Describe how each of these will be useful in your chosen medical field 4) Can you...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT