Question

In: Computer Science

Scenario. Centres for Disease Control The Centres for Disease Control and Prevention (CDC) is the national...

Scenario. Centres for Disease Control The Centres for Disease Control and Prevention (CDC) is the national public health institute of the United States. Its main aim is to protect people health and safety through the control and prevention of diseases. CDC had to rely on doctor reports of influenza outbreaks. CDC was weeks behind in providing vaccines to the affected patients. Using historical data from the CDC, Google compared search term queries against geographical areas that were known to have had flu outbreaks. Google then found forty five terms correlated with the outbreak of flu. With this data, CDC can act immediately. Questions: 1. To effectively identify the terms related to influenza outbreaks, Google may need to apply clustering techniques, which type or types of clusters would best fit this application? Please justify your answer. 2. To better the clustering performance, what configuration of the clustering would you suggest Google to do according to the characteristics of the input data? Please justify your answer. 3. Another option for Google is to use the classification approach. Please compare the classification and clustering approaches in the context of this scenario, by listing at least three differences between them and the impacts to the sorting process.

Solutions

Expert Solution

Clustering is the task of dividing the population or data points into a number of groups such that data points in the same groups are more similar to other data points in the same group than those in other groups. In simple words, the aim is to segregate groups with similar traits and assign them into clusters.

We have clustering algorithms in machine learning

Connectivity model

Centroid model Distribution model

Density model

Semantic, Hierarchical, Online Clustering algorithm which comes under the Connectivity model is that uses suffix arrays to extract frequent phrases and singular value decomposition techniques to discover the cluster content. Hence it would be beneficial to extract out the terms related to any medical condition, as question specific, influenza outbreaks.

2. Clustering results mainly depend upon the selected objective function than on the selected algorithm. Clusters of variable sizes would also cause large clusters to be split, and smaller ones to be merged. Use of genetic algorithm,random swap, particle swarm optimization, spectral and density clustering helps understanding the complex algorithms. Better clustering result could be achieved by using an objective function based on Mahalanobis distance or Gaussian mixture model instead of SSE, if a natural cluster is the expected outcome.

3. Although both techniques have certain similarities, the difference lies in the fact that classification uses predefined classes in which objects are assigned, while clustering identifies similarities between objects, which it groups according to those characteristics in common and which differentiate them from other groups of objects. These groups are known as "clusters".


Related Solutions

Scenario. Centres for Disease Control The Centres for Disease Control and Prevention (CDC) is the national...
Scenario. Centres for Disease Control The Centres for Disease Control and Prevention (CDC) is the national public health institute of the United States. Its main aim is to protect people health and safety through the control and prevention of diseases. CDC had to rely on doctor reports of influenza outbreaks. CDC was weeks behind in providing vaccines to the affected patients. Using historical data from the CDC, Google compared search term queries against geographical areas that were known to have...
You are an administrator at the Centers for Disease Control and Prevention (CDC) in Atlanta, Georgia....
You are an administrator at the Centers for Disease Control and Prevention (CDC) in Atlanta, Georgia. You are informed that a new strain of the flu virus is quickly spreading across the country. It is potentially life-threatening for young children and the elderly. The CDC quickly begins producing a vaccine for the new strain but there is not going to be enough vaccine produced to distribute to everyone in need. You are asked to identify how to distribute the vaccine...
According to the Centers for Disease Control and Prevention (CDC), the mean weight of U.S. men...
According to the Centers for Disease Control and Prevention (CDC), the mean weight of U.S. men of age 30–39 years old is191.802 pounds. Using the provided sample data file, conduct a one-sample ?-z-test of a mean to test whether the mean weight of men of age 30–39 years old who smoke daily is lower than191.802. Conduct the z-test at a significance level of α=0.05 to test the null hypothesis, H0:μ=191.802, against the alternative hypothesis, H1:μ<191.802, where ?μ represents the mean...
Through a published schedule and set of guidelines, the Centers for Disease Control and Prevention (CDC)...
Through a published schedule and set of guidelines, the Centers for Disease Control and Prevention (CDC) and public health officials recommend that every child receive certain vaccinations by age 6. What are the benefits of this recommendation to public health officials, to the community and to other children? Some parents and health care professionals question the CDC’s recommendations and decide not to vaccinate their children, while others, like Jennifer Margulis, choose to vaccinate their children along an alternative schedule. How...
According to the Centers for Disease Control and Prevention (CDC), obesity in the U.S. population increased...
According to the Centers for Disease Control and Prevention (CDC), obesity in the U.S. population increased from about 12% in 1991 to about 34% in 2006. The highest increase occurred in 18- to 29-year-olds. Although part of the reason is our increasingly sedentary lifestyle, the principal cause is our overconsumption of abundant processed food products, many of which are high in fat and sugar content. Why do you think fat tastes good to so many people, and is difficult to...
If you were the director of the Centers for Disease Control and Prevention (CDC), what measures...
If you were the director of the Centers for Disease Control and Prevention (CDC), what measures would you implement to reduce the US vulnerability to pandemics in the future? Explain your answer; use evidence to support your answer.
Healthy Lifestyles The Centers for Disease Control and Prevention (CDC) in Atlanta, Georgia, is the government...
Healthy Lifestyles The Centers for Disease Control and Prevention (CDC) in Atlanta, Georgia, is the government agency responsible for disease-related issues in the United States. The CDC coordinates efforts to counteract outbreaks of diseases and funds a variety of medical and health research studies. The CDC also serves as a central clearinghouse for health-related data. The CDC conducts the annual Behavioral Risk Factor Surveillance Survey. The survey measures a whole series of lifestyle characteristics that relate to health and longevity,...
The Centers for Disease Control and Prevention (CDC) recommended against vaccinnating the whole population against the...
The Centers for Disease Control and Prevention (CDC) recommended against vaccinnating the whole population against the smallpox virus because the vaccination has undesirable, and sometimes fatal, side effects. Suppose that the accompanying table gives the data that are available about the effects of a smallpox vaccination program. Percent of population vaccinated Deaths Due to Small Pox Deaths due to vaccination side effects 0% 200 0 10% 180 4 20% 160 8 30% 140 12 40% 120 16 50% 100 20...
Use guidelines from the Centers for Disease Control and Prevention (CDC) to validate your information on...
Use guidelines from the Centers for Disease Control and Prevention (CDC) to validate your information on legal requirements in regard to reporting STDs. Present your findings on reportable diseases in the state of Florida and legal implications in failure to reporting.
Reducing Health Disparities In 2011, the Centers for Disease Control and Prevention (CDC) published the first...
Reducing Health Disparities In 2011, the Centers for Disease Control and Prevention (CDC) published the first CDC Health Disparities and Inequalities Report (CHDIR). This report examined health disparities in the United States associated with various characteristics, including race/ethnicity, sex, income, education, disability status, and geography. Among other recommendations, the 2011 CHDIR emphasized the need to address health disparities with a dual intervention strategy that focuses on populations at greatest need and improves the health of the general population by making...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT