Question

In: Computer Science

1.List the 3 categories of Data Management and 3 Layers of Data Architecture Explain the link...

1.List the 3 categories of Data Management and 3 Layers of Data Architecture

Explain the link between each of them, you can include drawings of Data Integration Flow to show that.

2. List the 5 categories of requirements for a BI project?

Provide 1 example of each requirement category for a BI system for any business.

3. Discuss the differences between structured and Unstructured Data.

Provide 3 examples of each type of data?

4.

  1. Write R commands for below queries, assume the data is in file named input.csv.

Also explain your answer

id

name

salary

start_date

dept

1

Rick

623.3

1/01/2012

IT

2

Dan

515.2

23/09/2013

Operations

3

Michelle

611

15/11/2014

IT

4

Ryan

729

11/05/2014

HR

5

Gary

843.25

27/03/2015

Finance

6

Nina

578

21/05/2013

IT

7

Simon

632.8

30/07/2013

Operations

8

Guru

722.5

17/06/2014

Finance

  1. Open the file and Get the max salary from data frame
  2. Get the detail of person having max salary
  3. Get all the people working in IT department   
  4. Get the persons in IT department whose salary is greater than 600           

  1. Write down the output of the following R commands. If not data is provided you can use any example data. Explain what does the command do?

  1. category <- 'A'   

price <- 10

if (category =='A'){

    cat('A vat rate of 8% is applied.','The total price is',price *1.08)

} else{

      cat('A vat rate of 10% is applied.','The total price is',price *1.10)

}

  1. a = c(5,7,2,9)   

ifelse(a %% 2 == 0,"even","odd")

Solutions

Expert Solution

1. Data Management:

A database is a collection of data or records. Database management systems are designed to manage databases. A database management system (DBMS) is a software system that uses a standard method to store and organize data. The data can be added, updated, deleted, or traversed using various algorithms and queries by SQL.

Types of Database Management Systems:

There are several types of database management systems. Here is a list of seven common database management systems:

  1. Hierarchical databases
  2. Network databases
  3. Relational databases
  4. Object-oriented databases
  5. NoSQL databases

Database architecture

Database architecture is an extension of the 2-tier architecture. 3-tier architecture has following layers

  1. Presentation layer (your PC, Tablet, Mobile, etc.)
  2. Application layer (server)
  3. Data layer
  • Presentation Tier- The presentation tier is the front end layer in the 3-tier system and consists of the user interface. This user interface is often a graphical one accessible through a web browser or web-based application and which displays content and information useful to an end user. This tier is often built on web technologies such as HTML5, JavaScript, CSS, or through other popular web development frameworks, and communicates with others layers through API calls.
  • Application Tier- The application tier contains the functional business logic which drives an application’s core capabilities. It’s often written in Java, .NET, C#, Python, C++, etc.
  • Data Tier- The data tier comprises of the database/data storage system and data access layer. Examples of such systems are MySQL, Oracle, PostgreSQL, Microsoft SQL Server, MongoDB, etc. Data is accessed by the application layer via API calls.
  • The goal of Three-teir architecture is:

  • To separate the user applications and physical database
  • Proposed to support DBMS characteristics
  • Program-data independence
  • Support of multiple views of the data
  • Example of Three-teir Architecture is Any large website on the internet

  • 2.BI REQUIREMENTS:

  • 1.Functional Requirements:

  • Some functionalities, like projects or workspaces, help teams or departments work more effectively, together or apart. Collaboration tools such as messaging, comment threads, email or Slack integrations make it easy to start important conversations and keep them going.

    Globalization Support
    Projects or Workspaces
    Collaboration and Information Sharing
    Decentralized Analytics Environment
    Write to Transactional Applications

  • 2.Dashboarding and Data Visualization

  • Dashboards are a staple of business intelligence frankly because they work: they reveal the underlying value of data in a format that people can look at and understand in seconds. It’s no surprise then that data visualization is one of the most important requirements of BI software; by translating insights into a visual medium, data visualization turns complex results into easily understandable conclusions for the user to interpret, customize and share with others.

  • Dashboards
    Storyboarding
    Interactive Data Visualizations
    Filtering
    Drill-Down and Drill-Up Capabilities
    Auto-Charting
    Geospatial Visualizations and Maps
    Animations
    Advanced Visualizations using Python and R
    Auto-refresh and Real-Time Updates
    Pre-Built Templates
    Web Accessibility and Embeddability

  • 3.Data Source Connectivity:

  • you will be able to import all your data into the platform, whether it lives in Excel files, a cloud storage system, an on-premises server – or a combination of all of the above. Doing so ensures that your BI tool will deliver full visibility into all your operations and processes.

    Standard Files (i.e. Excel, CSV, XML, JSON, PDF and more)
    Statistical Files
    Relational and NoSQL Databases
    JDBC, ODBC and  Parameterized Connections
    Big Data Ecosystems
    Enterprise BI and ERP Platforms
    CRM, Customer Success and Marketing Platforms
    E-Commerce and Accounting Platforms
    Social Media, SEO and Web Analytics Platforms
    Cloud File Storage Systems
    Project Management and Enterprise Messaging Platforms
    SFTP/FTP Support

  • 4.Data Management

    help users prepare, collect and organize data to ensure greater visibility and more accurate results overall.

    Data Exploration
    Data Modeling
    Data Preparation
    Data Blending
    Extract, Transform, Load (ETL) Tool
    Metadata Management and  Data Catalog
    OLAP and Multi-Dimensional Analysis
    Data Governance
    Advanced Data Preparation using Python and R

  • 5.Data Querying

    A query is a request for data written in a special syntax, often Structured Query Language (SQL), from a database that extracts information and formats it for consumption and analysis. Data querying can perform calculations, automate tasks or dig deeper through data mining, which uncovers hidden trends and relationships between data points. Though more specialized for the fields of data science and big data than business intelligence specifically, it is certainly a feature you can consider depending on your business needs.

    Query Multiple Data Sources
    Complex Queries
    Scheduled Queries
    Readable and Modifiable SQL
    Multi-pass SQL
    Batch Updates
    Visual Querying
    In-Memory Analysis
    Live Connection

  • 3.Structured data vs unstructured data:

  • Structured data is highly specific and is stored in a predefined format, where unstructured data is a conglomeration of many varied types of data that are stored in their native formats. This means that structured data takes advantage of schema-on-write and unstructured data employs schema-on-read.

    Structured data is commonly stored in data warehouses and unstructured data is stored in data lakes. Both have cloud-use potential, but structured data allows for less storage space and unstructured data requires more.

    The last difference could potentially have the most impact.

  • Unstructured data is most often categorized as qualitative data, and it cannot be processed and analyzed using conventional tools and methods.

    Examples of unstructured data include text, video, audio, mobile activity, social media activity, satellite imagery, surveillance imagery – the list goes on and on.

    Unstructured data is difficult to deconstruct because it has no pre-defined model, meaning it cannot be organized in relational databases. Instead, non-relational, or NoSQL databases, are best fit for managing unstructured data.

    Another way to manage unstructured data is to have it flow into a data lake, allowing it to be in its raw, unstructured format.

  • 4.queries:

    i.

    [Create a data frame]
    data <- read.csv("input.csv")

    [ Get the max salary from data frame.]
    sal <- max(data$salary)
    print(sal)
    output:

    843.25

    ii.

    [Create a data frame.]
    data <- read.csv("input.csv")

    [ Get the max salary from data frame.]
    sal <- max(data$salary)

    [ Get the person detail having max salary.]
    retval <- subset(data, salary == max(salary))
    print(retval)
    When we execute the above code, it produces the following result −

    id name salary start_date dept
    5 NA Gary 843.25 2015-03-27 Finance
    iii. Get all the people working in IT department

    [ Create a data frame.]
    data <- read.csv("input.csv")

    retval <- subset( data, dept == "IT")
    print(retval)
    When we execute the above code, it produces the following result −

    id name salary start_date dept
    1 1 Rick 623.3 2012-01-01 IT
    3 3 Michelle 611.0 2014-11-15 IT
    6 6 Nina 578.0 2013-05-21 IT
    iv.Get the persons in IT department whose salary is greater than 600

    [ Create a data frame.]
    data <- read.csv("input.csv")

    info <- subset(data, salary > 600 & dept == "IT")
    print(info)
    When we execute the above code, it produces the following result −

    id name salary start_date dept
    1 1 Rick 623.3 2012-01-01 IT
    3 3 Michelle 611.0 2014-11-15 IT



Related Solutions

Answer the following questions: 1. List and explain the 3 layers of meninges? 2. Which parts...
Answer the following questions: 1. List and explain the 3 layers of meninges? 2. Which parts of the brain would work together to achieve the following: A). Good eye-hand coordination B) Concentrating on homework when TV is playing C) Avoiding dark alleys while walking home at night D) Keeping blood pressure constant 3. Write one (1) function of the parts of the brain: A) Cerebrum B) Cerebellum C) Thalamus D) Hypothalamus E) Midbrain F) Pons G) Medulla Oblongata 4) Which...
List and explain the racial and ethnic categories created by the Office of Management and Budget....
List and explain the racial and ethnic categories created by the Office of Management and Budget. How does cultural play a role in a person’s behaviors (including risk behaviors) and beliefs about health.
INTERNATIONAL Human Resource Management: List and explain five categories of external risk assessment which need to...
INTERNATIONAL Human Resource Management: List and explain five categories of external risk assessment which need to be addressed by a multinational enterprise?
a) List 3 characteristics that can different broiler breeders from normal layers
  Define the following terms Caponization Ovulation Egg sequence Clutch First in sequence and terminal laid in sequence eggs a) List 3 characteristics that can different broiler breeders from normal layers b) Indicate 5 problems associated with broiler breeder reproduction as result of poormanagement c) Suggest 2 management practices to reduce reproductive failure in broiler breeders a) Draw a normal poultry sperm and label parts appropriately. Indicate clearly the tail,mitochondria, nucleus and acrosome regions b) Briefly describe the concept behind...
1.   Based on your understanding of the innate immune system list 3 broad categories (and AT...
1.   Based on your understanding of the innate immune system list 3 broad categories (and AT LEAST ONE specific example in each category) that constitute the first line of defense mechanisms by the innate immune system. (Give an approximate time line for this type of defense) 2.   If this defense system is breached describe the process that follows (second line of innate defense). Explain how pathogens are detected and dealt with at this time with specific pathogen groups in mind,...
List and explain 3 characteristics of an entrepreneurship? subject. small business management
List and explain 3 characteristics of an entrepreneurship? subject. small business management
1. A. List the components of each of the three layers that make up a typical...
1. A. List the components of each of the three layers that make up a typical arterial blood vessel. B. Describe the differences in the middle layer structure between elastic and muscular arteries plus arterioles. C. Describe the two pathways leading to the formation of a clot and the factors that initiate each of these pathways. 2. A. List the four different layers or tunics associated with the digestive tract from the esophagus to the rectum and the components found...
IN JAVA Objectives Practice Link list, List node, compareTo, user defined data type, interfaces Movie List...
IN JAVA Objectives Practice Link list, List node, compareTo, user defined data type, interfaces Movie List Create a link list of the movies along with the rating, number of the people who watched the movie and the genre of the movie. Required classes Movie class ListNode class MovieList class List interface Driver class Movie class implements Comparable Attributes: movie’s name, genre, rating, number of people watched Methods: constructor, getter, setter, equals, compreTo, toString ListNode class Attributes: each node has two...
. Describe (in detail) the Spaulding Classification system and list the 3 device categories.
. Describe (in detail) the Spaulding Classification system and list the 3 device categories.
Explain what is data and list the different types of data? List and explain the different...
Explain what is data and list the different types of data? List and explain the different methods to collect data.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT