Question

In: Computer Science

Can I use a linear gain function in any of its layers of a multilayer perceptron?...

Can I use a linear gain function in any of its layers of a multilayer perceptron? Explain

Solutions

Expert Solution

Yes, you can definitely use it but it is as good as having no hidden layer because the linear activation function reduces the equations to linear regression form with redundant parameters. To make the point more clear let's look at an example.

Consider a linear function f(x) = a x + b. If we take another linear function g(z) = c z + d, and apply g(f(x)) (which would be the equivalent of feeding the output of one layer as the input to the next layer with linear activation) we get g(f(x)) = c (a x + b) + d = ac x + cb + d = (ac) x + (cb + d) which is in itself another linear function with redundant parameters.The value of (ac) could be simply written as A and  (cb + d) as B, giving Ax + B.

Therefore, using a linear activation function doesn't really add any value to the MLP. Infact, the main purpose of having non-linear activation functions like sigmoid , relu or tanh is to make MLP learn non-linear and more complex higher order features for making more accurate predictions.

----------------------------------------------------------------------------------------------------------

Please do upvote if you liked the solution.Happy learning!!


Related Solutions

I am sure that any company wants to maximize its profit and linear programming might be...
I am sure that any company wants to maximize its profit and linear programming might be used to find an answer concerning the value of this profit. If you would want to build the linear programming problem that provides such answers, how many constraints would you add? Do you think that an increase in the number of constraints implies solving a more realistic problem
Linear Programming How do I use duality to find the optimal value of the objective function...
Linear Programming How do I use duality to find the optimal value of the objective function for this? minimize 8y1+6y2+2y3 constraints---- y1+2y2 ≥ 3 2y1+y3 ≥ 2 y1 ≥ 0 y2 ≥ 0 y3 ≥ 0
Explain how, given any attempt to use linear regression to explain economic phenomenon, you can be...
Explain how, given any attempt to use linear regression to explain economic phenomenon, you can be sure that your model is indeed measuring what you set out to measure and not simply reflecting spurious correlation between variables.
Explain how, given any attempt to use linear regression to explain economic phenomenon, you can be...
Explain how, given any attempt to use linear regression to explain economic phenomenon, you can be sure that your model is indeed measuring what you set out to measure and not simply reflecting spurious correlation between variables.
the total momentum (linear momentum) is conserved in any collision, not its _________ energy, but its...
the total momentum (linear momentum) is conserved in any collision, not its _________ energy, but its total energy is conserved. In the ballistic pendulum shown in the conservation of linear momentum lesson, if the mass of a bullet is m1 = 10 g and the mass of the block is 8.0 kg and the height that both reach is 5.0 cm, the initial speed of the bullet is _______ m / s. In the Newton pendulum or Newton's cradle the...
Write a function or script that will solve linear systems of any size by Gaussian elimination...
Write a function or script that will solve linear systems of any size by Gaussian elimination with partial pivoting in Python.
Can anyone please tell me what function in excel I could use in this instance: I...
Can anyone please tell me what function in excel I could use in this instance: I have two spreadsheets with sales of products for a company. I am needing to create a new spreadsheet with gross sales by month and by region. It haven't included all the worksheets, because I truly just need pointed in the right direction.
The linear transformation is such that for any v in R2, T(v) = Av. a) Use...
The linear transformation is such that for any v in R2, T(v) = Av. a) Use this relation to find the image of the vectors v1 = [-3,2]T and v2 = [2,3]T. For the following transformations take k = 0.5 first then k = 3, T1(x,y) = (kx,y) T2(x,y) = (x,ky) T3(x,y) = (x+ky,y) T4(x,y) = (x,kx+y) For T5 take theta = (pi/4) and then theta = (pi/2) T5(x,y) = (cos(theta)x - sin(theta)y, sin(theta)x + cos(theta)y) b) Plot v1 and...
How can we use “linear regression” to estimate non-linear functional forms?
How can we use “linear regression” to estimate non-linear functional forms?
How can the use of linear equations and inequalities assist you with linear regression to make...
How can the use of linear equations and inequalities assist you with linear regression to make predictions?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT