Question

In: Economics

1-Why would Zillow use a data lake? 2-Explain dirty data and its impact on the business?

1-Why would Zillow use a data lake?
2-Explain dirty data and its impact on the business?

Solutions

Expert Solution

1. To better reduce prices, Zillow leverages OCR technologies in its ingestion method. The framework also enhances the user interface since the data can be accessed quicker.

At Zillow, ensuring data accuracy is a significant subject, public records information arrives in several different formats, and the organisation hires a data scientist whose full-time role is to ensure data consistency. To check for variations in the number of sales purchases, Zillow uses pattern analysis. At the data field level, there are also tests, searching for listings that have, for instance, 30,000 bedrooms. Zillow also flags certain kinds of sales, such since foreclosures, as the Zestimate figures do not use these deals.

The technology framework at Zillow includes Apache Spark. For real-time scoring, the business often uses Redis and Python. For cloud computing, Zillow taps AWS S3 and relies on AWS Redshift and Presto for its warehouse of data. When looking at historical details, Zillow clearly turns to Presto. Beyond the Zestimate, Zillow also provides the viewers with other figures, such as a Turbo Zestimate and a classification for "hot homes" (which estimates how quickly a home can sell). Many of these estimates are based on a measure of Zillow's Zestimate.

Via personalization and quest, Zillow has also invested in anticipating the needs of its customer users. Based about how sparse the signals are for a single user, Zillow uses distinct kinds of user vectors.

2. Dirty data which is unreliable, incomplete or contradictory. Experian estimates that corporations around the world believe that 26 percent of their data is polluted on average. This leads to tremendous damages. It currently costs the average corporation 15 to 25 percent of its income, and the US economy more than $3 trillion a year. Anybody who has had to work with dirty data knows how irritating it can be, but it can be hard to get your mind around the effect untilthe numbers are added up. It is important to consider where it comes from, how it impacts industry and how it can be dealt with, because dirty data costs too much, a sobering understatement.

Dirty data lacks integrity, which ensures that end-users who rely on that information waste extra time checking its authenticity, limiting efficiency and competitiveness further. Growing volumes of dirty documents contribute to further inaccuracies and mounting discrepancies by adding another manual method.

In addition to the lack of sales, filthy data effects corporations more insidiously. Just 16% of company executives trust the consistency that underlies their corporate decisions. When you can't count on your own records, more has to be done to improve records quality and reliability. Garbage in, garbage out.


Related Solutions

Explain dirty data and its impact on a business?
Explain dirty data and its impact on a business?
What would happen to Zillow if it experienced dirty data?
What would happen to Zillow if it experienced dirty data?
Select one business structure and how the use of that business structure would impact the financial...
Select one business structure and how the use of that business structure would impact the financial management of that particular company. Briefly summarize how this financial management approach would differ in the other business structures.
1)The methods used by business can impact the productivity of the factors of production. Explain why...
1)The methods used by business can impact the productivity of the factors of production. Explain why this should be a concern for anybody who earns an income. 2)What are the differences between the equity financing of business and debt financing? 3)Review industrial employment trends in your home state. What are the growing industries in your area? What areas are shrinking?
1. Hands dirty with data. In this problem, you will retrieve and manipulate macroeconomic data and...
1. Hands dirty with data. In this problem, you will retrieve and manipulate macroeconomic data and verify some relationships that inform macroeconomic theory. (a) Go the Federal Reserve Bank of St. Louis’ Federal Reserve Economic Data(FRED) website at https://fred.stlouisfed.org and download the following three data series in to an Excel file. You can then do the subsequent analysis using Excel, Stata, or other software of your choice: Real Gross Domestic Product (GDPC1), percent change from year ago, deviation from trend....
1. Why is application integration an important part of running an online business? 2. Why would...
1. Why is application integration an important part of running an online business? 2. Why would database management software be an important component of an online business Web site’s technology? 3. What is the key function of a content management system as used in an online business? 4. Name four types of information that might be useful inputs to a customer relationship management (CRM) system. 5.Briefly explain why mobile advertising is growing so rapidly. 6.Explain what online text ads are...
Why would someone use the Coase Theorem to minimize the impact of externalities?
Why would someone use the Coase Theorem to minimize the impact of externalities?
1) explain how the Covid would affect the major risks of banks and its potential impact...
1) explain how the Covid would affect the major risks of banks and its potential impact on the IRM of banks. 2) Banks are regulated in terms of capital adequacy. Briefly discuss how this requirement would help banks maintain safety during the Covid-19 pandemic.
1. Should Unilever's stockholders endorse its sustainability plan? Why or why not? 2. Are there business...
1. Should Unilever's stockholders endorse its sustainability plan? Why or why not? 2. Are there business advantages to using sustainable or green suppliers? If so,what are they? If not,do you think a traditional return on investment analysis captures all possible benefits of going green? 3. Are there any ethical criticisms of unilever's sustainable living strategy? If so,what are they?
provide a hypothetical situation involving “dirty data” and discuss how data pre-processing would address this issue
provide a hypothetical situation involving “dirty data” and discuss how data pre-processing would address this issue
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT