Question

In: Computer Science

You are the data analyst on the project team building a data warehouse for an insurance...

You are the data analyst on the project team building a data warehouse for an insurance company. List the possible data sources from which you will bring the data into your data warehouse. State your assumptions. Support with research.

Solutions

Expert Solution

Before thinking of harvesting the data and looking for the source we should think of what kind of data we are looking at and how it is going to benefit us

assumption: i am assuming the company  to be a general insurance company , but while defining the data i will give examples of the specific types of the companies relying on those type of data , the stats i will be providing are all from researches

first of all lets look at how data can help us . Data can help us in taking various decision .

  • assessing the risk : for eg we can analyse whether a person buying a car insurance is prone to accident risk base on his driving decision , or whether a person buying health insurance is prone to Arrhythmia or not
  • detecting fraud : so we can analyse previous harvested records of people who have taken insurance to detect a fraud pattern
  • customers behaviour and their Insight : these help in shaping customer policy and decision making , we know their behaviour and the basis of their decision .
  • Marketing : once we know the customer behaviour and their segmentation we can use that to target our product and services , based on their behaviour , products can also be personalised  for example targeted ads on facebook(Trattner and Kappe)

now we know what kind of data we need now we will see their sources

lets say we are a health insurance based company

  1. assessing the risk : first of all for accessing the risk we will need clients medical record , this can be taken either from the client or from Medical association . we can also use fitbit devices to take physical activity reading from customers . Use of data in risk analysis is going to grow to 77% by 2021
  2. detecting fraud: for detecting fraud we will need fraudulent data from various other insurance companies and use those data to determine whether a customer is fraudulent or not , we can also use the social network data of those fraud people to determine risk using statistical measure . these researches are based on fraud detection .Identify the signs of fraudulent accounts and the patterns of fraudulent transactions(Quah and Sriganesh), Identification of fraudulent financial statement (Kirkos)
  3. customers behaviour and their Insight : for this we can have an app on customer device through which we can take permission and accumulate user data to provide him with personalised experience , this data can include user gps log or his social network activity and etc but this should not affect users privacy . Assess disease outbreaks from tweetsAssess disease outbreaks from tweets(Bodnar and Salathé) , Detect public health events(Fisichella)
  4. this is also done through taking customerpersonal data over their choices and preferences , this data can be taken through application or web interface to provide them personalised  experiences (Viral marketing in social networks mar at el)
  5. other than this we can also take data from private sectors and academic researchers . we can also take data that are harvested by government.

all his data can be imported in data hive or a database through various channel some of which can be IOT based like fitbit some of can be web and app interface by using firebase db and some can be direct interactive device based .we can even use the old data stored by the company by importing it into our data base and digitalizing it


Related Solutions

Assume you are a Data Analyst in an international economic consultancy firm. Your team leader has...
Assume you are a Data Analyst in an international economic consultancy firm. Your team leader has given you a research task to investigate the empirical relationship between China’s export volumes and per capita GDP (Gross Domestic Product). Relevant Variables: China’s Export volume index and China’s GDP per capita (constant 2010 US$). (Annual time series data (for the period 1980 – 2018) from the World Bank - World development indicators database) The data are stored in the file named “ASSIGNMENTDATA.XLSX” in...
Assume that you are part of a development team that is working on a new warehouse...
Assume that you are part of a development team that is working on a new warehouse management system. You have the task of investigating software packages that are available through ASPs. Using the World Wide Web, identify at least two potential sources of such software. What are the pros and cons of this approach to obtaining a software package? Write in complete sentences. The answer may not exceed 100 words.
You are a member in a project team and the project manager asked you to purchase...
You are a member in a project team and the project manager asked you to purchase 4 laptops with certain specifications (Lenovo ThinkPad T series, Core i7, 2 Tera Storage, 32 GB Ram, 15.6" size, NVIDIA® graphics). How do you set your procurement management to do that?
you are a member in a project team and the project manager asked you to purchase...
you are a member in a project team and the project manager asked you to purchase 4 laptops with certain specifications (lenovo thinkpad T series,core i7,2 tera storage ,32 GB Ram ,15.6 size , NVIDIA graphics ) how do you set your procurement management to do that?
A Project plan has been developed by the project team. The scheduled data are given in...
A Project plan has been developed by the project team. The scheduled data are given in the following table Activity predecessor Normal time Crash Time Normal cost Crash COST COST SLOPE A - 12 7 3000 5000 400 B A 8 5 2000 3500 500 C A 4 3 4000 7000 3000 D B,C 12 9 50000 71000 7000 E B,C 4 1 500 1100 200 F E 4 1 500 1100 200 G D,F 4 3 15000 22000 7000...
You are part of a team that is evaluating the feasibility of building a standardized nursing...
You are part of a team that is evaluating the feasibility of building a standardized nursing language into the electronic medical record that will be used by your department. Your group is reviewing the 12 ANA recognized terminologies. The goal is to make recommendations about one or more terminologies that could be built into EMR for use by nurses. Choose one of the 12 ANA recognized terminologies to evaluate (read materials for this unit and visit the website of the...
You are a project team manager, and your team members report each day to you to...
You are a project team manager, and your team members report each day to you to receive their primary assignments.   Not every team member is as efficient as another with particular kinds of tasks.    Time required (hours) to complete tasks Task Task complexity Team member 1 - Jones Team member 2 - Nguyen Team member 3 - Walpita Team member 4 - Manderas Task A Very high 3 5 4 3 Task A High 2 1 3 2 Task...
You have been hired as an analyst for Melvin Bank and your team is working on...
You have been hired as an analyst for Melvin Bank and your team is working on an independent assessment of TWINKY, which is a firm that specializes in the production and distribution of ice and glass products in Sweden. Your assistant has provided you with the following data about the company and its industry. You analysis should include intra-company, inter-company, and industry benchmark comparisons. What can you say about the firm's overall management in terms of the following? (Be as...
You have been hired as an analyst for Bank WA and your team is working on...
You have been hired as an analyst for Bank WA and your team is working on an independent assessment of a firm that specializes in the production of freshly imported farm products from New Zealand. Your assistant has provided you with the following data about the company and its industry. Ratio 2019 2018 2017 2019- Industry Average Long-term debt 0.45 0.40 0.35 0.35 Inventory Turnover 62.65 42.42 32.25 53.25 Depreciation/Total Assets 0.25 0.014 0.018 0.015 Days’ sales in receivables 113...
1. You have been hired as an analyst for an advisory company and your team is...
1. You have been hired as an analyst for an advisory company and your team is working on an independent assessment of G-Aviation. G-Aviation is a firm that specializes in the production of aviation material. Your assistant has provided you with the following data for G-Aviation and their industry. Ratio 2019 2018 2017 2019- Industry Average Long-term debt 0.45 0.40 0.35 0.35 Inventory Turnover 62.65 42.42 32.25 53.25 Depreciation/Total Assets 0.25 0.014 0.018 0.015 Days’ sales in receivables 113 98...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT