In: Statistics and Probability
Think of a problem dealing with two possibly related variables (Y and X) that you may be interested in. Share your problem and discuss why a regression analysis could be appropriate for this problem.
Specifically, what statistical questions are you asking? Why would you want to predict the value of Y? What if you wanted to predict a value of Y that’s beyond the highest value of X (for example if X is time and you want to forecast Y in the future)?
You should describe the data collection process that you are proposing but you do not need to collect any data.
PLEASE DON'T COPY ANSWERS THAT ARE ALREADY
POSTED.
Problem:
Suppose we want to predict whether sugar intake by infants results in diabetes(Y) or not as age(X) increases because this will help us to identify whether sugar intake results in diabetes or not.
The birth data of infants and the contact details of parents will be collected from hospitals and then parents will be asked to record the sugar intake once the child starts taking sugar for five years. These infants will be regularly followed year by year to check whether they suffer from diabetes or not. After discussing the upper limit of age having diabetes with doctors having expertise in diabetes, the highest value will be finalized. If we want to predict the value of Y based for the value of age that is beyond the highest value then we will try to not use values that are very far from the highest value because that is more likely to result in the incorrect prediction of Y.
If you have more questions you can send them to us and we will try our best to help you out.
Good luck with your studies!!!