Question

In: Computer Science

1. You're processing data to be uploaded into a database. In what stage of data preprocessing...

1. You're processing data to be uploaded into a database. In what stage of data preprocessing would you deal with fields where there is no data, or data is missing? A.Data Consolidation B.Data Cleaning C. Data Transformation D. Data Reduction

2. You're company, Outdoor Excursions just acquired another local tour company, Excursions Inc. and you've been tasked with merging Excursions Inc's database with yours. During datapreprocessing, you encounter inconsistencies in the "marital status" column. Some values indicate "married", "single", or "widowed", others are represented by their first letters "m", "s", and "w", and others are left blank. What are some ways you could deal with that data? Select all that apply. Delete the column Recode the existing values Do Nothing Fill in missing values All of the above

3. Explain the term "model fitting". Why is it important in data mining and machine learning?

Solutions

Expert Solution

1.) Choice(B) is correct, i.e., data cleaning. In data cleaning, the raw is scanned for all incorrect,irrelevant, null data, missing data and it is handles accordingly.

2.) Recoding the existing data such as where m or f or w is written, writing code to replace those characters with male, female or widow would match attribute format correctly. It will ensure the process of handling the data of the attribute would become easier.

3.) Model fitting is a measuring factor which tracks how perfectly a machine learning model can be able to generalise and interpret the data which is similar to the data used in training. A good model fitting is determined when output is accurate when some unseen and untrained(data not used in training ML) data is passed as input. Fitting is adjusting the parameter so to improve the accuracy of determining the output with unseen input. This together helps to used data mine in an optimized way which in turn improves machine learning.


Related Solutions

Need answers for Normalization, Physical Design, Sql, And Security exam. 1. The database you're creating will...
Need answers for Normalization, Physical Design, Sql, And Security exam. 1. The database you're creating will be installed on a group of three servers. What feature of an enterprise RDBMS will allow one server to pick up the processing work if the main server becomes nonoperational? A. Failover B. Business intelligence C. Data warehouse D. Load balancing 2. What type of clause must you always use with DELETE or UPDATE to avoid inadvertently changing data elsewhere in the database? A....
1. Explain the purpose of resource pooling in regards to cloud computing. 2.When data is uploaded...
1. Explain the purpose of resource pooling in regards to cloud computing. 2.When data is uploaded to a cloud service provider's infrastructure, it is often said the organization loses physical control over their data. Explain how this is not entirely true. 3. Select the cloud technology that best fits the following scenario. A VM is allocated 8GB of memory and averages 3 to 4GB of memory utilization. Another VM is started and needs an additional 4GB of memory. The hypervisor...
You're caring for a mother in the transition stage of labor when she begins crying and...
You're caring for a mother in the transition stage of labor when she begins crying and says, “It hurts so much. I don’t know if I can take this anymore.” As you were admitting her, when contractions were less frequent and intense, the client told you it was very important to her that she deliver the baby without taking any pain medication and that she would feel like a failure if she gave in and took a narcotic. Will you...
1. for 1 molecule of glucose (6 C-atoms), the stage of pyruvate processing generates NADH, CO2...
1. for 1 molecule of glucose (6 C-atoms), the stage of pyruvate processing generates NADH, CO2 and Acetyl CoA ATP, H+, oxaloacetate 2NADH, 2CO2 and 2Acetyl CoA 2ATP, 2H+, 2 oxaloacetate 2. how many reduced electron carriers after glycolysis, pyruvate processing and citric acid cycle are available to make the ET work 5 NADH, 1 FADH2 4 ATP, 5NADH, 1 FADH2 4 ATP, 10NADH, 2 FADH2 10 NADH, 2 FADH2
What are the similarities and differences between database, data warehouse, and data mining?
What are the similarities and differences between database, data warehouse, and data mining?
1- How can database systems improve data quality and data integrity? 2- Discuss database constraints: Primary...
1- How can database systems improve data quality and data integrity? 2- Discuss database constraints: Primary key, check, and referential integrity constraints? Give an example for each.
Define data processing and explain the steps to be followed for data processing
Define data processing and explain the steps to be followed for data processing
Database design is the process of producing a detailed data model of database. This data model...
Database design is the process of producing a detailed data model of database. This data model contains all the needed logical and physical design choices and physical storage parameters needed to generate a design in a data definition language, which can then be used to create a database. (Wikipedia). Using a diagram/chart software, elaborate a database design Requirements: Define your database objective Explain your database's type of table relationship Explain and design your database elements and datatypes (tables, fields, etc,)....
Use the uploaded price data to answer the questions that follow. Assume that the S&P 500...
Use the uploaded price data to answer the questions that follow. Assume that the S&P 500 is the proxy for the market portfolio. • A: Price data for the American Funds Growth Fund of America (ticker symbol AGTHX) and S&P 500 is given in the Excel Upload file. Each month, calculate the excess returns for both AGTHX and the S&P 500. • B: Using the excess returns, estimate the fund’s alpha and beta (assume that the CAPM is the appropriate...
What is a Data Dictionary? What is a Database Engine? What is a Query Processor/Analyzer? What...
What is a Data Dictionary? What is a Database Engine? What is a Query Processor/Analyzer? What is a Forms Generator? What is a Reports writer? What is a DBMS? What is the difference between DB and DBMS?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT