Question

In: Statistics and Probability

There are two candidate RNAs for COVID-19 diagnosis: RNA1, RNA2. Canadian Disease Control Center carried out...

There are two candidate RNAs for COVID-19 diagnosis: RNA1, RNA2. Canadian Disease Control Center carried out a clinical trial to check the expression levels for these two RNAs in the subjects with the virus infection: one group of 50 randomly recruited subjects has no critical symptoms; and the other group of 50 subjects has symptoms. After normalization, RNA1 expression levels follow a normal distribution N(0,1) for no-symptom subjects while N(1,1) for subjects with symptoms requiring hospitalization. For RNA2, the corresponding expression levels in nonsymptom subjects and subjects with symptoms follow normal distributions N(0,1) and N(-1,1), respectively.

a. For one breast cancer patient with normalized RNA1 expression level at 2, what is the log-likelihood ratio (LLR) of this patient being diagnosed to be hospitalized? (3 pts)


b. Taking naive Bayes classifier, if we know RNA1=2, RNA2 = 1, what will be the naive Bayes score of the patient being hospitalized? (3 pts)



c. What is the basic assumption of naive Bayes classifier? Under what situations, it may be problematic? (4 pts)

Solutions

Expert Solution

a. infection by the virus can be provisionally diagnosed on the basis of symptoms ,though confirmation is ultimately by reverse transcription polymerase chain reaction(rRT-PCR) of infected secretions (71% sensitivity)or CT imaging (98% sensitivity).

A person is considered at risk if they have travelled to an area with ongoing community transmission within the previous 14 days , or have had close contact with an infected person.

common key indicators include fever,coughing, and shortness of breath. other possible indicators include fatigue, myalgia,anorexia,sputum production, and sore throat.

b. it is easy and fast to predict the class of the test data set.it also performs well in multiclass prediction.

when assumption of independence holds ,a naive bayes classifier performs better compare to other models like logistic regression and you need less training data.

it perform well in case of categorical input variables compared to numerical variable(s) .for numerical variable ,normal distribution is assumed (bell curve, which is a strong assumption).

c. Naive bayes classifier assume that the effect of the value of a predictor (x) on a given class (c) is independent of the values of other predictors. this assumption is called class conditional independence.p (c/x) is the posterior probability of class ( target) given predictor ( attribute)

Naive bayes is so called because the independence assumptions we have just made are indeed very naive for a model of natural language. the conditional independence assumption states that features are independent of each other given the class.this is hardly ever true for terms in documents.

A subtle issue ("disadvantage" if you like) with naive bayes is that if you have no occurences of a class label and a certain attribute value together ( e.g. class="nice",shape="sphere") then the frequency based probability estimate will be zero .given naive bayes conditional independence assumption, when all the probabilities are multiplied you will get zero and this will affect the posterior probability estimate.

this problem happens when we are drawing samples from a population and the drawn vectors are not fully representative of the population.lagrange correction and the other schemes have been proposed to avoid this undesirable situation.


Related Solutions

List similarities and differences between kawasaki disease and covid 19. Also, list treatment and diagnosis.
List similarities and differences between kawasaki disease and covid 19. Also, list treatment and diagnosis.
In 250 words, Discuss nCoVPC(noninfectious positive control material) using to develop PCR diagnosis kit for Covid-19.
In 250 words, Discuss nCoVPC(noninfectious positive control material) using to develop PCR diagnosis kit for Covid-19.
The presence of Corona Virus Disease 2019 (COVID-19) in Malaysia, is part of the COVID-19 pandemic,...
The presence of Corona Virus Disease 2019 (COVID-19) in Malaysia, is part of the COVID-19 pandemic, was first reported in January 2020. Due to that, Malaysia Movement Control (MCO) is a cordon sanitaire implemented as a preventive measure by the federal government of Malaysia in response to the COVID-19 pandemic in the country on 18 March 2020. The order was commonly referred to in local and international media as a "lockdown". As a University student in network especially in wireless...
According to the Center for Disease Control and Prevention (2015), cardiovascular disease (CVD) is the leading...
According to the Center for Disease Control and Prevention (2015), cardiovascular disease (CVD) is the leading cause of deaths in the United States, equating to about 1 in every 4 deaths, even though CVD is largely preventable. Recently, various studies have shown promise with stem cell therapy treating heart disease. Research stem cell therapy in the treatment of heart disease and the possible promises it has as a therapy. In your post, address the following questions. Should science and healthcare...
The past two months have led to an abundance of information on the novel disease COVID-19...
The past two months have led to an abundance of information on the novel disease COVID-19 and the virus that causes this disease, SARS CoV-2.      Describe your understanding of the physiology of the disease on any of the systems of the body and how our understanding has changed.      Because the seriousness and enormity of the pandemic, the rush to make recommendations on the treatment of the disease has led to poorly researched recommendations. Give an example of a bad recommendation...
A study was carried out to determine whether the resistance of a control circuit in a...
A study was carried out to determine whether the resistance of a control circuit in a machine is lower when the machine motor is running. To investigate this question, some of the control circuits were tested as follows. Their resistance was measured while the machine motor was not running and then again while the motor was running for a certain period of time. The values found are listed in ‘Dataset’, with kilo-Ohms as the unit of measurement. Answer the following...
Recently the COVID-19 pandemic has huge impact on Canadian economy. Explain how the COVID-19 pandemic affect...
Recently the COVID-19 pandemic has huge impact on Canadian economy. Explain how the COVID-19 pandemic affect the Canadian economy in terms of GDP, inflation rate and unemployment rate. Compare the economic impact of current COVID-19 pandemic with the economic impact of global financial crisis, which happened during 2008-2009.
Corona Virus Disease Pandemic (COVID-19) has a devastated impact on many economies of the world. COVID-19...
Corona Virus Disease Pandemic (COVID-19) has a devastated impact on many economies of the world. COVID-19 is not only a public health and medical issue but also an economic and fiscal matter. Ghana recorded the first case of the virus in March 2020, three months into the implementation of the national budget. Therefore, the cost of fighting the disease is unbudgeted for, which has created a fiscal challenge for the country. The Minister of Finance recently in a statement to...
COVID-19 is a contagious disease caused by a newly discovered coronavirus.
  PHC231 COVID-19 is a contagious disease caused by a newly discovered coronavirus. It is an ongoing health emergency globally, and the risk of hospital-acquired infection is worrying about health workers. In this situation, discuss the epidemiological aspects of disease transmission and the types of prevention and control methods that should be taken to restrict or minimize the spread of infection in a healthcare setting. How much of a current threat do you feel this outbreak has on the population...
Find out the control words for the following Microoperations. Also, specify the functions being carried out...
Find out the control words for the following Microoperations. Also, specify the functions being carried out by each of these Microoperations. (i) R2←clc(R2+R4+1) (ii) -R3+R5 (iii) Output← shr(R7-R2-1) (iv) R1←Input (v) R3←R4’+1 (vi) R6←R2-R1+1 (vii) R5←0 (viii) R3←R2, C←1 (ix) R1←Input-R4-1 (x) R7←R6’+Input .
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT