Question

In: Statistics and Probability

In a given year, we receive approximately 13 million unstructured text submissions and over 307,000 photos...

In a given year, we receive approximately 13 million unstructured text submissions and over 307,000 photos and videos from about 167,000 diverse contributors, all of whom are answering open-ended questions posed by us, as well as generating their own conversations on topics of their choosing.

what type of statistical problems can I use for this summary?

Solutions

Expert Solution

The proliferation of textual data in business is overwhelming. Unstructured textual data is being constantly generated via call center logs, emails, documents on the web, blogs, tweets, vidoes, customer reviews, and so on. While the amount of textual data is increasing rapidly, businesses’ ability to summarize, understand, and make sense of such data for making better business decisions using statisitcal remain challenging.

The basic premise to use text data in predictive models is that the terms contained within the text data can potentially represent the customer’s experiences (bad or good) which are supposedly consistent with the customer’s decision to continue with the business or churn in the nearest future. Hence the potential of mining text data in such applications cannot be undermined. Text data is first transformed into a set of numerical components called Singular Value Decomposition (SVD) units which collectively represent the text documents. These units are then used as additional inputs along with the existing structured input attributes to help improving the predictive power of the existing models.

Sentiment Analysis

An interesting and important goal of analyzing unstructured data such as customer complaints, issues, opinions or comments is to get a grasp on what they perceive about an entity. An entity can be a company’s brand image, product, service, person, group or an organization. Are consumers’ perceptions good, bad or neutral? What attributes (features) of the product or service they feel good or bad about? What do the customers think of the various attributes of a company’s product such as quality, price, durability, safety, ease of use? Typically, if customer feels good towards an entity, it is classified as a positive sentiment. If the perception towards the entity is bad, it can be considered as negative sentiment. A third kind of perception in which customer has neither good nor bad opinion implies a neutral sentiment. Social media sites such as Twitter and Facebook contains enormous volumes of customer opinions and comments on virtually all major organizations, events and products. This creates an unprecedented opportunity to mine text data in real-time to and analyze sentiment trends fluctuations over a period of time.


Related Solutions

In a given year, we receive approximately 13 million unstructured text submissions and over 307,000 photos...
In a given year, we receive approximately 13 million unstructured text submissions and over 307,000 photos and videos from about 167,000 diverse contributors, all of whom are answering open-ended questions posed by us, as well as generating their own conversations on topics of their choosing. what formula in Statistics works for this statement?
Question text Conroe Ltd expects to receive EUR 1 million in 6 months’ time. The following...
Question text Conroe Ltd expects to receive EUR 1 million in 6 months’ time. The following product rates are available: Spot is currently 0.5400 EUR /NZD 6-month forward rates are available at 0.5250/0.5370 EUR/NZD 6-month borrowing/investing rate for the company is 6% p.a. in NZD and 12% p.a. in EUR Assume the spot rate turns out to be 0.5200 EUR/NZD in 6-months If a money market hedge is used, what NZD amount is received in 6-months? Select one: a. 1,905,789...
Rita Gonzales won the $65 million lottery. She is to receive $3 million a year for...
Rita Gonzales won the $65 million lottery. She is to receive $3 million a year for the next 20 years plus an additional lump sum payment of $5 million after 20 years. The discount rate is 6 percent. What is the current value of her winnings? Use Appendix B and Appendix D for an approximate answer, but calculate your final answer using the formula and financial calculator methods.(Do not round intermediate calculations. Round your final answer to 2 decimal places.)
Rita Gonzales won the $38 million lottery. She is to receive $1.1 million a year for...
Rita Gonzales won the $38 million lottery. She is to receive $1.1 million a year for the next 25 years plus an additional lump sum payment of $10.5 million after 25 years. The discount rate is 12 percent. What is the current value of her winnings?
Rita Gonzales won the $62 million lottery. She is to receive $1.9 million a year for...
Rita Gonzales won the $62 million lottery. She is to receive $1.9 million a year for the next 25 years plus an additional lump sum payment of $14.5 million after 25 years. The discount rate is 13 percent. What is the current value of her winnings? Use Appendix B and Appendix D for an approximate answer, but calculate your final answer using the formula and financial calculator methods.(Do not round intermediate calculations. Round your final answer to 2 decimal places.)
Rita Gonzales won the $44 million lottery. She is to receive $1.3 million a year for...
Rita Gonzales won the $44 million lottery. She is to receive $1.3 million a year for the next 25 years plus an additional lump sum payment of $11.5 million after 25 years. The discount rate is 18 percent.
My friend owns a small old house that is worth approximately $1.1 million. Given the improved...
My friend owns a small old house that is worth approximately $1.1 million. Given the improved real estate market, my friend is considering that over the next three years, she would have the option of tearing down this small old house and build a more expensive house. Her research suggests that the current cost of tearing down the old house and building a new more expensive will be approximately $800,000 and that she should assume that the expected cost would...
13 50% of students entering four-year colleges receive a degree within six years. Is this percent...
13 50% of students entering four-year colleges receive a degree within six years. Is this percent different from for students who play intramural sports? 146 of the 256 students who played intramural sports received a degree within six years. What can be concluded at the level of significance of αα = 0.05? For this study, we should use Select an answer z-test for a population proportion t-test for a population mean The null and alternative hypotheses would be: Ho: ?...
In my physics class, we are going over the Doppler Effect. We were given four different...
In my physics class, we are going over the Doppler Effect. We were given four different equations for two different scenarios: - When the observer is stationary but the source is moving - When the observer is moving but the source is stationary I was given a problem where both the observer AND the source are moving at some velocity. The source is emitting a given frequency while traveling at a given velocity. The observer is OBSERVING that frequency at...
In the year 2002, approximately 22.5% of the U.S. population were currently smokers. We wonder if...
In the year 2002, approximately 22.5% of the U.S. population were currently smokers. We wonder if in Tennessee the proportion of current smokers was also 0.225 or if it was higher than that. Assume that in that year, a survey was conducted in Tennessee, and that a simple random sample of 2466 individuals were selected. Out of the 2466, 611 were classified as ’current smokers’. We wish to conduct the test of hypothesis. (a) State the appropriate null and alternative...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT