Question

In: Computer Science

Machine Learning - multivariate methods Let us say in two dimensions, we have two classes with...

Machine Learning - multivariate methods

Let us say in two dimensions, we have two classes with exactly the same mean. What type of boundaries can be defined? show a picture of the options

Solutions

Expert Solution

Is least square error a suitable choice for classification?

No, it is not. One of the issues with the least square solution is that it lacks robustness to outliers. Consider the figure given below, we can see that on left-hand side we have two classes denoted by red and blue, separated by the decision boundary. The green decision boundary corresponds to the solution obtained by logistic regression and magenta decision boundary corresponds to the least square solution. Now on the right-hand side, we see that after adding more data points to the training set, the magenta curve shifts, thus misclassifying few data points but the green curve is unmoved. This shows that least square solution even penalizes the predictions that are too far from the correct side of the decision boundary .

The least square solution fails to give correct solution not only in case of data consisting of outliers but in also in several other cases. This happens because the least square solution is equivalent to the maximum likelihood solution under the assumption of Gaussian distribution (as we will see in the further sections). Therefore in case of the datasets that fails to show such distribution, least square will not be a good choice.
There are several alternative error functions for classification.


Related Solutions

Let us say we are given the task of building an automated taxi. Define the constraints....
Let us say we are given the task of building an automated taxi. Define the constraints. What are the inputs? What is the output? How can we communicate with the passenger? Do we need to communicate with the other automated taxis, that is, do we need a “language”?
I am writing this machine learning code (classification) to clssify between two classes. I started by...
I am writing this machine learning code (classification) to clssify between two classes. I started by having one feature to capture for all my images. for example: class A=[(4295046.0, 1), (4998220.0, 1), (4565017.0, 1), (4078291.0, 1), (4350411.0, 1), (4434050.0, 1), (4201831.0, 1), (4203570.0, 1), (4197025.0, 1), (4110781.0, 1), (4080568.0, 1), (4276499.0, 1), (4363551.0, 1), (4241573.0, 1), (4455070.0, 1), (5682823.0, 1), (5572122.0, 1), (5382890.0, 1), (5217487.0, 1), (4714908.0, 1), (4697137.0, 1), (4737784.0, 1), (4648881.0, 1), (4591211.0, 1), (4750706.0, 1), (5067788.0, 1),...
This is a java coding String manipulation Question. Let us say I have a String which...
This is a java coding String manipulation Question. Let us say I have a String which looks something likes this: String x = "MATH,PHYSICS,CHEMISTRY,CODING" And then I have another String which looks something like this: String y = "XXXXMXXXAXXXTXXXXHXXXXXXCXXXXOXXXXDXXXIXXXXNXXXGXXXX" and another String that looks like this: String z = "XXXXHXXTXXXXAXXMXXXXXCXXXOXXXDXXXIXXXNXXXG" I want to take String y and print the words that are found in String x as such: Math Coding and for String z if I want to take it...
This week we have been learning about the methods archaeologists and paleontologists use to reconstruct the...
This week we have been learning about the methods archaeologists and paleontologists use to reconstruct the past. Now it’s time to get creative and apply this new knowledge to the present day. This is going to involve a little bit of research and creativity on your part. Pick a room in your house, apartment, dorm, or alternative domicile. If you walked out the door today and your room remained untouched for 100, 1,000 or 50,000 years, what would be left...
1. Briefly describe two shortcomings of univariate methods in ratemaking and explain how multivariate methods behave...
1. Briefly describe two shortcomings of univariate methods in ratemaking and explain how multivariate methods behave with these two shortcomings. 2. Briefly describe why the adoption of multivariate methods has been increasing in the most recent decades.
2) Let us say that the protectionism of the coal industry is in the form of...
2) Let us say that the protectionism of the coal industry is in the form of a tariff. Talk on the benefits and costs of a tariff. Do the benefits exceed the costs or vise versa? Ascertain you include the terms of trade. Also, what is an optimal tariff?
The directions say to "create two methods:
The directions say to "create two methods: -enrollStudent (Student newStudent) which adds newStudent into the enrolled list and increment numOfEnroll by 1, if the section is full, print out an error message and no change with numOfEnroll. -removeStudent (String studentId) which removes the student with studentId from the enrolled list and decrement numOfEnrollby 1, if the target student is not found in the enrolled list, print out an error message and no change with numOfEnroll."  
Explain what an influential multivariate outlier is, and at least two methods of coping with them...
Explain what an influential multivariate outlier is, and at least two methods of coping with them in multiple regression.
Some say that the rapid growth of machine learning and automated systems are going to be...
Some say that the rapid growth of machine learning and automated systems are going to be a threat to the amount of available jobs for people. Do you think that this is a possible scenario?  Or do you think it will generate jobs in other yet unseen areas? Would you have interest in being a part of programming these next-gen automated systems?  Why or why not?
1. Define the concept of “Machine Learning”. 2. Summarise two applications of machine learning and the...
1. Define the concept of “Machine Learning”. 2. Summarise two applications of machine learning and the value it create.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT