In: Statistics and Probability
Please answer the below, all parts! Thank you in advance.
Option 1:
Think of a problem dealing with two possibly related variables (Y and X) that you may be interested in. Share your problem and discuss why a regression analysis could be appropriate for this problem.
Specifically, what statistical questions are you asking? Why would you want to predict the value of Y? What if you wanted to predict a value of Y that’s beyond the highest value of X (for example if X is time and you want to forecast Y in the future)?
You should describe the data collection process that you are proposing but you do not need to collect any data.
(a)
Problem dealing with two possibly related variables (Y and X) that I may be interested in
X = Number of hours a student is devoting per day on studies
Y = Score in the examination
(b)
A regression analysis could be appropriate for this problem. because we know by intuition that there is a strong positive relation existing between
X = Number of hours a student is devoting per day on studies and Y = Score in the examination. As the student studies more, his score in the examination will definitely increase. As the student studies less, his score in the examination will also decrease.
(c)
Description of the data collection process that I am proposing:
The exact details of X = Number of hours a student is devoting per day on studies and Y = Score in the examination for every student will be obtained and statistically analyzed.
While collecting this data, to avoid bias, all the students will be informed that their identity will be kept confidential so that they need not worry about giving their accurate frank details.
(d)
The prediction is invalid beyond the highest value of x because we know by intuition, if a student student spends time on studies much more than his capability, he will becomes bery weak mentally and this will badly affect his scores.in the examination.