Question

In: Computer Science

In a couple sentences, explain the effect of each of the following concepts on overfitting and...

In a couple sentences, explain the effect of each of the following concepts on overfitting and underfitting.
1. Hypothesis class complexity
2. Regularization
3. C in SVMs

Solutions

Expert Solution

1.

A statistical hypothesis is an explanation about the relationship between data populations that is interpreted probabilistically. A machine learning hypothesis is a candidate model that approximates a target function for mapping inputs to outputs .

The choice of algorithm (e.g. neural network) and the configuration of the algorithm (e.g. network topology and hyperparameters) define the space of possible hypothesis that the model may represent.

Learning for a machine learning algorithm involves navigating the chosen space of hypothesis toward the best or a good enough hypothesis that best approximates the target function.
A common notation is used where lowercase-h (h) represents a given specific hypothesis and uppercase-h (H) represents the hypothesis space that is being searched.
h (hypothesis): A single hypothesis, e.g. an instance or specific candidate model that maps inputs to outputs and can be evaluated and used to make predictions.
H (hypothesis set): A space of possible hypotheses for mapping inputs to outputs that can be searched, often constrained by the choice of the framing of the problem, the choice of model and the choice of model configuration.
The choice of algorithm and algorithm configuration involves choosing a hypothesis space that is believed to contain a hypothesis that is a good or best approximation for the target function. This is very challenging, and it is often more efficient to spot-check a range of different hypothesis spaces.

While reducing your hypothesis space typically reduces overfitting, it is not true in general. The best (and really only) way to reduce overfitting without doing any data hacks is to reduce the capacity of your hypothesis space to overfit. If by reducing your hypothesis space you coincidentally decrease its capacity to overfit. . If not, then nothing has changed despite your hypothesis space being smaller.

2. The main reason of overfitting is making a model more complex than necessary. If we find a way to reduce the complexity, then overfitting issue is solved. Regularization penalizes complex models. Regularization adds penalty for higher terms in the model and thus controls the model complexity. If a regularization terms is added, the model tries to minimize both loss and complexity of model.

Without regularization: Minimize (Loss, (model) )
With regularization: Minimize (Loss (model) +Complexity(model)))

Regularization reduces the variance but does not cause a remarkable increase in the bias. Two common methods of regularization are L1 and L2 regularization. Specifically, underfitting occurs if the model or algorithm shows low variance but high bias. Underfitting is often a result of an excessively simple model. Both overfitting and underfitting lead to poor predictions on new data sets.

3.

SVM’s or Support Vector Machines are supervised learning models used for classification and regression. When using an SVM our preliminary objective is to arrive at a hyperplane that helps us

• Maximize the margin

• Classify the training points accurately

One of the key regularization parameters helping us control or influence the outcome of the model is C also known as penalty parameter or cost parameter for misclassification.

Below are a few values of C and its effects while training a model that is of interest to many enthusiasts when learning SVMs.

a. A low value of C means a lower penalty and allows a little freedom, as a result, the training model has some misclassifications and hence creates a larger margin hyperplane also known as Soft Margin. Lower values of C lead to Underfitting resulting in a high bias and low variance.

b. A high value of C means a higher penalty and is focused on achieving better accuracy by avoiding errors or misclassification and hence opts for a hyperplane with a better classification accuracy resulting in a smaller margin hyperplane also known as Hard Margin. Higher values of C lead to Overfitting resulting in a low bias and high variance.

c. As the value of C approaches positive infinity there is no room for error as the penalty for misclassification is enormous and it results in heavy overfitting as the model pushes for the best possible accuracy.

d. As the value of C approaches zero the model is poorly underfitting and there is potential overlap of the data points and it's most likely the model might be unable to find an appropriate hyperplane.


Related Solutions

Demonstrate graphically and explain in a couple of sentences the effect in the bond market and...
Demonstrate graphically and explain in a couple of sentences the effect in the bond market and on interest rates of each of the following:-            a. decrease in the federal deficit            b. an increase in personal savings rate            c. a decrease in the brokerage rates on stock
1) In two to three coherent sentences, explain each of the following concepts: a) Popular capitalism,...
1) In two to three coherent sentences, explain each of the following concepts: a) Popular capitalism, b) Mitbestimmung and c) Ordnungspolitik. 2) Privatization: What do we mean by the term privatization? How successful was privatization in Great Britain? 3) Mittelstand: Explain the importance of the “Mittelstand” for the success of German manufacturing and employment. 4) Post-Brexit: State and explain one point from the article “A weaker post-Brexit Britain looks to America,” The Economist, Jan 30, 2020.
Short Descriptions: In two to three coherent sentences, explain and/or define each of the following concepts....
Short Descriptions: In two to three coherent sentences, explain and/or define each of the following concepts. A- Flexicurity B- Wage drift C- Keiretsu
(a) In a couple of sentences with clear annotated sketches explain the following terms: lobe separation;...
(a) In a couple of sentences with clear annotated sketches explain the following terms: lobe separation; valve overlap; curtain area.
First, spend a couple of sentences summarizing the Concepts in Action video you watched this week....
First, spend a couple of sentences summarizing the Concepts in Action video you watched this week. Then, answer the following.   In the Concepts in Actionvideo you watched this week, what do you think the president of the bagel company mean when they called the Atkins diet unsystematic risk (note that risk here refers to financial risk, not health risk)? Do you agree with them? Explain why or why not. Requirements: Your discussion board response should be approximately 200 words. APA...
First, spend a couple of sentences summarizing the Concepts in Action video you watched this week....
First, spend a couple of sentences summarizing the Concepts in Action video you watched this week. Then, answer the following. What do you think the speaker in the Concepts in Action video you watched this week meant when he mentioned that he talked about the time value of money with his managers? Can you think of an example in your own work history or something you've read about where the time value of money might be relevant? Requirements: Your discussion...
First, spend a couple of sentences summarizing the Concepts in Action video you watched this week....
First, spend a couple of sentences summarizing the Concepts in Action video you watched this week. Then, answer the following.   In the Concepts in Action video you watched this week, the speaker mentioned, "A stock is fundamentally more difficult to value than a bond, because it’s less quantitative." Why do you think this is so (think about intrinsic vs. market value)?
Write a couple of sentences to explain why it is useful to construct a scatterplot graph...
Write a couple of sentences to explain why it is useful to construct a scatterplot graph before using correlation or linear regression to analyze the relationship between two variables. (2 points)
Use the concepts of income effect and subsitution effect to explain why the effect on desired...
Use the concepts of income effect and subsitution effect to explain why the effect on desired savings of an increase in the expected real interest rate is potentially ambiguous. Draw the saving curve for when (a) subsitution effect dominates (b) income effect dominates. I need help with the graphs.
provide and explain in 1 to 2 sentences each an example of each of the following:...
provide and explain in 1 to 2 sentences each an example of each of the following: For Organizational Culture - 1) Artifact, 2) Value/Belief, & 3) Assumption For Organizational Environment - 1) Political/Legal, 2) Sociodemographic, 3) Technology, 4) Economic, & 5 Natural Environment (e.g., the globe and nature). Remember - culture is internal to the organization, shaped by shared experience. And, the environment is external, providing opportunities and threats.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT