Question

In: Statistics and Probability

hey, how to solved the question such as "State which features are categorical". and "Which are...

hey, how to solved the question such as "State which features are categorical". and "Which are the two most strongly correlated features? What is the numerical and/or statistical relationship between them?" for a dataset

Solutions

Expert Solution

Answer:

"State which features are categorical"

Categorical features can only take on a limited, and usually fixed, number of possible values. For example, if a dataset is about information related to users, then you will typically find features like country, gender, age group, etc. Alternatively, if the data you're working with is related to products, you will find features like product type, manufacturer, seller and so on.

Hence the variable which contains different categories as observation then this variable is categorical feature.

"Which are the two most strongly correlated features?

Main thing to find correlation is, variable must be quantitative.

For this we have to find correlation between different variables. For this make all possible pairs of all varibles and then find correlation between each pair of variables. Large value of correlation of any pair is most strongly correlated two features.

You can also find the correlation matrix in Rstudio. And the large value detects that these two features are most strongly correlated.

What is the numerical and/or statistical relationship between them?"

To know the statistical relationship between these two variables, Simply find the regression equation of this two variables. This regression will give us the statistical relationship between them.


Related Solutions

classify each variable as quantitative or categorical. for categorical- state whether its ordinal or nominal for...
classify each variable as quantitative or categorical. for categorical- state whether its ordinal or nominal for quantitative- state whether its continuous or discrete and whether the level of measurement is ratio or interval VARIABLES: Marital Status Happiness Cholestoral Change Blood Pressure Change Vision Change Age Male
Which of the following is not a categorical variable?
Which of the following is not a categorical variable? Gender Diabetes Type Il status (Y/N) Height in cm Age divided into quartiles (<25th Percentile. 25-50th Percentile. 50-75th percentile, >75th percentile)
Question: a)Discuss three stylized features of Financial data b) Explain how the features in (a) can...
Question: a)Discuss three stylized features of Financial data b) Explain how the features in (a) can be modeled using linear time series models c) i) Ecplain the moments of a random variable ii) How can you estimate these in emphirical applications d) i) Explain the Jaque-Bera Test (JB) , stating clearly , the null alternative hypothesis. ii) In the case of financial data , do you agree JB test to accept or reject the null hypothesis?Explain
And please show me the step by step process of how you solved the question. I...
And please show me the step by step process of how you solved the question. I have a learning disability and I need to know the steps to solve the question If it takes 17.5 mL of 0.085 M NaOH to titrate a 15 mL sample of sauerkraut juice, what is the acidity of that juice, expressed as % lactic acid (wt/vol)?
State the two features of a centrally planned economy. Discuss how economic decisions are made in...
State the two features of a centrally planned economy. Discuss how economic decisions are made in this setting. In the course of your answer, develop an input-output table to complement your discussion.
Identify whether the following variables are numerical or categorical. If numerical, state whether the variable is...
Identify whether the following variables are numerical or categorical. If numerical, state whether the variable is discrete or continuous. If categorical, state whether the categories have a natural order (ordinal) or not (nominal). a. Fraction (or percentage) of birds in a large sample infected with avian flu virus b. Number of crimes committed by a randomly sampled individual. c. gender d. Logarithm of body mass e. Stage of fruit ripeness (eg., underripe, ripe, or overripe) f. Tree species g. Petal...
How can categorical population parameters be sampled?
How can categorical population parameters be sampled?
What is the puzzle of the Sea Battle, and how is it to be solved?
What is the puzzle of the Sea Battle, and how is it to be solved?
1. Name the six key features of good experimental studies and state how each is achieved....
1. Name the six key features of good experimental studies and state how each is achieved. 2. What is the aim of a randomized experimental study that tests a health care intervention?
State EIGHT (8) features of open source software.
State EIGHT (8) features of open source software.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT