Question

In: Computer Science

1.List the 3 categories of Data Management and 3 Layers of Data Architecture Explain the link...

1.List the 3 categories of Data Management and 3 Layers of Data Architecture

Explain the link between each of them, you can include drawings of Data Integration Flow to show that.

2. List the 5 categories of requirements for a BI project?

Provide 1 example of each requirement category for a BI system for any business.

3. Discuss the differences between structured and Unstructured Data.

Provide 3 examples of each type of data?

4.

Write R commands for below queries, assume the data is in file named input.csv.

Also explain your answer

id	name	salary	start_date	dept
1	Rick	623.3	1/01/2012	IT
2	Dan	515.2	23/09/2013	Operations
3	Michelle	611	15/11/2014	IT
4	Ryan	729	11/05/2014	HR
5	Gary	843.25	27/03/2015	Finance
6	Nina	578	21/05/2013	IT
7	Simon	632.8	30/07/2013	Operations
8	Guru	722.5	17/06/2014	Finance

Open the file and Get the max salary from data frame
Get the detail of person having max salary
Get all the people working in IT department
Get the persons in IT department whose salary is greater than 600

Write down the output of the following R commands. If not data is provided you can use any example data. Explain what does the command do?

category <- 'A'

price <- 10

if (category =='A'){

cat('A vat rate of 8% is applied.','The total price is',price *1.08)

} else{

cat('A vat rate of 10% is applied.','The total price is',price *1.10)

}

a = c(5,7,2,9)

ifelse(a %% 2 == 0,"even","odd")

Expert Solution

1. Data Management:

A database is a collection of data or records. Database management systems are designed to manage databases. A database management system (DBMS) is a software system that uses a standard method to store and organize data. The data can be added, updated, deleted, or traversed using various algorithms and queries by SQL.

Types of Database Management Systems:

There are several types of database management systems. Here is a list of seven common database management systems:

Hierarchical databases
Network databases
Relational databases
Object-oriented databases
NoSQL databases

Database architecture

Database architecture is an extension of the 2-tier architecture. 3-tier architecture has following layers

Presentation layer (your PC, Tablet, Mobile, etc.)
Application layer (server)
Data layer

Presentation Tier- The presentation tier is the front end layer in the 3-tier system and consists of the user interface. This user interface is often a graphical one accessible through a web browser or web-based application and which displays content and information useful to an end user. This tier is often built on web technologies such as HTML5, JavaScript, CSS, or through other popular web development frameworks, and communicates with others layers through API calls.
Application Tier- The application tier contains the functional business logic which drives an application’s core capabilities. It’s often written in Java, .NET, C#, Python, C++, etc.
Data Tier- The data tier comprises of the database/data storage system and data access layer. Examples of such systems are MySQL, Oracle, PostgreSQL, Microsoft SQL Server, MongoDB, etc. Data is accessed by the application layer via API calls.
The goal of Three-teir architecture is:
To separate the user applications and physical database
Proposed to support DBMS characteristics
Program-data independence
Support of multiple views of the data
Example of Three-teir Architecture is Any large website on the internet
2.BI REQUIREMENTS:
1.Functional Requirements:
Some functionalities, like projects or workspaces, help teams or departments work more effectively, together or apart. Collaboration tools such as messaging, comment threads, email or Slack integrations make it easy to start important conversations and keep them going.

Globalization Support
Projects or Workspaces
Collaboration and Information Sharing
Decentralized Analytics Environment
Write to Transactional Applications
2.Dashboarding and Data Visualization
Dashboards are a staple of business intelligence frankly because they work: they reveal the underlying value of data in a format that people can look at and understand in seconds. It’s no surprise then that data visualization is one of the most important requirements of BI software; by translating insights into a visual medium, data visualization turns complex results into easily understandable conclusions for the user to interpret, customize and share with others.
Dashboards
Storyboarding
Interactive Data Visualizations
Filtering
Drill-Down and Drill-Up Capabilities
Auto-Charting
Geospatial Visualizations and Maps
Animations
Advanced Visualizations using Python and R
Auto-refresh and Real-Time Updates
Pre-Built Templates
Web Accessibility and Embeddability
3.Data Source Connectivity:
you will be able to import all your data into the platform, whether it lives in Excel files, a cloud storage system, an on-premises server – or a combination of all of the above. Doing so ensures that your BI tool will deliver full visibility into all your operations and processes.

Standard Files (i.e. Excel, CSV, XML, JSON, PDF and more)
Statistical Files
Relational and NoSQL Databases
JDBC, ODBC and Parameterized Connections
Big Data Ecosystems
Enterprise BI and ERP Platforms
CRM, Customer Success and Marketing Platforms
E-Commerce and Accounting Platforms
Social Media, SEO and Web Analytics Platforms
Cloud File Storage Systems
Project Management and Enterprise Messaging Platforms
SFTP/FTP Support
4.Data Management

help users prepare, collect and organize data to ensure greater visibility and more accurate results overall.

Data Exploration
Data Modeling
Data Preparation
Data Blending
Extract, Transform, Load (ETL) Tool
Metadata Management and Data Catalog
OLAP and Multi-Dimensional Analysis
Data Governance
Advanced Data Preparation using Python and R
5.Data Querying

A query is a request for data written in a special syntax, often Structured Query Language (SQL), from a database that extracts information and formats it for consumption and analysis. Data querying can perform calculations, automate tasks or dig deeper through data mining, which uncovers hidden trends and relationships between data points. Though more specialized for the fields of data science and big data than business intelligence specifically, it is certainly a feature you can consider depending on your business needs.

Query Multiple Data Sources
Complex Queries
Scheduled Queries
Readable and Modifiable SQL
Multi-pass SQL
Batch Updates
Visual Querying
In-Memory Analysis
Live Connection
3.Structured data vs unstructured data:
Structured data is highly specific and is stored in a predefined format, where unstructured data is a conglomeration of many varied types of data that are stored in their native formats. This means that structured data takes advantage of schema-on-write and unstructured data employs schema-on-read.

Structured data is commonly stored in data warehouses and unstructured data is stored in data lakes. Both have cloud-use potential, but structured data allows for less storage space and unstructured data requires more.

The last difference could potentially have the most impact.
Unstructured data is most often categorized as qualitative data, and it cannot be processed and analyzed using conventional tools and methods.

Examples of unstructured data include text, video, audio, mobile activity, social media activity, satellite imagery, surveillance imagery – the list goes on and on.

Unstructured data is difficult to deconstruct because it has no pre-defined model, meaning it cannot be organized in relational databases. Instead, non-relational, or NoSQL databases, are best fit for managing unstructured data.

Another way to manage unstructured data is to have it flow into a data lake, allowing it to be in its raw, unstructured format.
4.queries:

i.

[Create a data frame]
data <- read.csv("input.csv")

[ Get the max salary from data frame.]
sal <- max(data$salary)
print(sal)
output:

843.25

ii.

[Create a data frame.]
data <- read.csv("input.csv")

[ Get the max salary from data frame.]
sal <- max(data$salary)

[ Get the person detail having max salary.]
retval <- subset(data, salary == max(salary))
print(retval)
When we execute the above code, it produces the following result −

id name salary start_date dept
5 NA Gary 843.25 2015-03-27 Finance
iii. Get all the people working in IT department

[ Create a data frame.]
data <- read.csv("input.csv")

retval <- subset( data, dept == "IT")
print(retval)
When we execute the above code, it produces the following result −

id name salary start_date dept
1 1 Rick 623.3 2012-01-01 IT
3 3 Michelle 611.0 2014-11-15 IT
6 6 Nina 578.0 2013-05-21 IT
iv.Get the persons in IT department whose salary is greater than 600

[ Create a data frame.]
data <- read.csv("input.csv")

info <- subset(data, salary > 600 & dept == "IT")
print(info)
When we execute the above code, it produces the following result −

id name salary start_date dept
1 1 Rick 623.3 2012-01-01 IT
3 3 Michelle 611.0 2014-11-15 IT

venereology answered 1 year ago

Answer the following questions: 1. List and explain the 3 layers of meninges? 2. Which parts...

Answer the following questions: 1. List and explain the 3 layers of meninges? 2. Which parts of the brain would work together to achieve the following: A). Good eye-hand coordination B) Concentrating on homework when TV is playing C) Avoiding dark alleys while walking home at night D) Keeping blood pressure constant 3. Write one (1) function of the parts of the brain: A) Cerebrum B) Cerebellum C) Thalamus D) Hypothalamus E) Midbrain F) Pons G) Medulla Oblongata 4) Which...

List and explain the racial and ethnic categories created by the Office of Management and Budget....

List and explain the racial and ethnic categories created by the Office of Management and Budget. How does cultural play a role in a person’s behaviors (including risk behaviors) and beliefs about health.

INTERNATIONAL Human Resource Management: List and explain five categories of external risk assessment which need to...

INTERNATIONAL Human Resource Management: List and explain five categories of external risk assessment which need to be addressed by a multinational enterprise?

a) List 3 characteristics that can different broiler breeders from normal layers

Define the following terms Caponization Ovulation Egg sequence Clutch First in sequence and terminal laid in sequence eggs a) List 3 characteristics that can different broiler breeders from normal layers b) Indicate 5 problems associated with broiler breeder reproduction as result of poormanagement c) Suggest 2 management practices to reduce reproductive failure in broiler breeders a) Draw a normal poultry sperm and label parts appropriately. Indicate clearly the tail,mitochondria, nucleus and acrosome regions b) Briefly describe the concept behind...

1. Based on your understanding of the innate immune system list 3 broad categories (and AT...

1. Based on your understanding of the innate immune system list 3 broad categories (and AT LEAST ONE specific example in each category) that constitute the first line of defense mechanisms by the innate immune system. (Give an approximate time line for this type of defense) 2. If this defense system is breached describe the process that follows (second line of innate defense). Explain how pathogens are detected and dealt with at this time with specific pathogen groups in mind,...

List and explain 3 characteristics of an entrepreneurship? subject. small business management

1. A. List the components of each of the three layers that make up a typical...

1. A. List the components of each of the three layers that make up a typical arterial blood vessel. B. Describe the differences in the middle layer structure between elastic and muscular arteries plus arterioles. C. Describe the two pathways leading to the formation of a clot and the factors that initiate each of these pathways. 2. A. List the four different layers or tunics associated with the digestive tract from the esophagus to the rectum and the components found...

IN JAVA Objectives Practice Link list, List node, compareTo, user defined data type, interfaces Movie List...

IN JAVA Objectives Practice Link list, List node, compareTo, user defined data type, interfaces Movie List Create a link list of the movies along with the rating, number of the people who watched the movie and the genre of the movie. Required classes Movie class ListNode class MovieList class List interface Driver class Movie class implements Comparable Attributes: movie’s name, genre, rating, number of people watched Methods: constructor, getter, setter, equals, compreTo, toString ListNode class Attributes: each node has two...

. Describe (in detail) the Spaulding Classification system and list the 3 device categories.

Explain what is data and list the different types of data? List and explain the different...

Explain what is data and list the different types of data? List and explain the different methods to collect data.