Question

In: Statistics and Probability

FiscalNote is a startup founded by a Washington, DC entrepreneur and funded by a Singapore sovereign...

FiscalNote is a startup founded by a Washington, DC entrepreneur and funded by a Singapore sovereign wealth fund, the Winklevoss twins of Facebook fame, and others. It uses machine learning and data mining techniques to predict for its clients whether legislation in the US Congress and in US state legislatures will pass or not. The company reports 94% accuracy. (Washington Post, November 21, 2014, “Capital Business”) ConsideringjustbillsintroducedintheUSCongress,do a bit of internet research to learn about the numbers of bills introduced and passage rates. Identifythepossibletypes of misclassifications, and comment on the use of overall accuracy as a metric. Include a discussion of other possible metrics and the potential role of propensities.

Solutions

Expert Solution

There are two types of misclassification:

Null hypothesis: bill not passed

Alternate hypothesis: the bill passed

1) Type 1 error: It is a condition when the bill is not passed actually but our system predicts that the bill is passed

2) Type 2 error: It is a condition when the bill is passed actually but our system predicts that bill is not passed

Using accuracy only as a measure will not capture both these two types of error. However, accuracy is a good measure only for the balanced problem( where the number of positive and negative classes are the same). But here we our data do not guarantee for the balanced problem. so other metrics such as below should be used

a) sensitivity

b) specificity

c) precision

There may be certain cases where one of the misclassifications plays a major role than the other. For example in court punishing an innocent suspect is harsher then giving a free pass to a criminal. In such cases, we need to frame the problems and pick the right metrics out of listed above in a,b,c


Related Solutions

Why was Washington DC built how and when it was? Who really built Washington DC?
Why was Washington DC built how and when it was? Who really built Washington DC?
Prototyping an Event-Driven IT Application DC Medical Supply, located in Washington, DC is in the industry...
Prototyping an Event-Driven IT Application DC Medical Supply, located in Washington, DC is in the industry of medical device sales. Its market share is small compared to the industry leaders, but it hopes to increase its share by becoming more responsive to customer needs. DC Medical Supply would like your help documenting their sales process using an event driven ERP. The following is a description of the sales/collection process of DC Medical Supply. When customers need to order medical devices,...
Let us consider the case of John, an entrepreneur and the CEO of a startup, named...
Let us consider the case of John, an entrepreneur and the CEO of a startup, named “Home Service”. The company started a small scale service where a couple of signed-up workers of the company were giving various services to the dweller of Ballarat, a city in Victoria. The services include plumbing, electric works, gas appliances’ works, and car wash. The people are Ballarat dwellers (roughly 20k in total) needed to download the software from App Store/Google Store and try to...
As an entrepreneur startup you seek to lure expensive talent to work for you as the...
As an entrepreneur startup you seek to lure expensive talent to work for you as the business starts and begins the growth phase. You really cannot afford many of these developers and other C-suite executives. What is the most attractive type of preferred stock along with their salaries to offer these potential highly talented employees and why? (4pts).
You are a federal prosecutor in Washington, DC, assigned to the following case: Robert is a...
You are a federal prosecutor in Washington, DC, assigned to the following case: Robert is a political commentator who actively criticizes the current administration on Twitter. Robert also publicly has discussed that he suffers from epilepsy, a condition that may cause seizures under certain conditions, for example exposure to flashing lights. In late 2017, Robert received a gif file which he opened. The file contained a blinking image of a strobe light, which cause Robert to have an epileptic seizure....
You are currently working as a Senior Economist for the Congressional Budget Office in Washington DC...
You are currently working as a Senior Economist for the Congressional Budget Office in Washington DC making $108,000 per year. Your lifelong ambition, however, has been to open your own cupcake store. You decide to quit your job as an economist to open your dream store near the River Walk in San Antonio. You estimate that you will be able to sell 10,000 cupcakes per month at a price of $3.40 per cupcake. You will have to pay monthly rent...
An agent for a residential real estate company in a suburb located outside of Washington, DC,...
An agent for a residential real estate company in a suburb located outside of Washington, DC, has the business objective of developing more accurate estimates of the monthly rental cost for apartments. Toward that goal, the agent would like to use the size of an apartment, as defined by square footage to predict the monthly rental cost. The agent selects a sample of 48 one-bedroom apartments and collects and stores the data here. What are the values for (1) the...
You are currently working as a Senior Economist for the Congressional Budget Office in Washington DC...
You are currently working as a Senior Economist for the Congressional Budget Office in Washington DC making $108,000 per year. Your lifelong ambition, however, has been to open your own cupcake store. You decide to quit your job as an economist to open your dream store near the River Walk in San Antonio. You estimate that you will be able to sell 10,000 cupcakes per month at a price of $3.40 per cupcake. You will have to pay monthly rent...
A real estate study was conducted in the Washington, DC, area to determine if certain variables...
A real estate study was conducted in the Washington, DC, area to determine if certain variables are related to the sales price of a townhouse. As part of his investigation, he ran the following multiple regression model: Sales Price = β0 + β1(Square Feet) + β2(Distance) + εi where the deviations εi were assumed to be independent and normally distributed with mean 0 and standard deviation σ. The two explanatory variables are the square footage of the townhouse and the...
The National Institutes of Health (NIH) is studying the air quality index of Washington DC. They...
The National Institutes of Health (NIH) is studying the air quality index of Washington DC. They are testing If the NIH wishes to test a two-sided alternative, write out their H1
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT