In: Computer Science
There are four V’s commonly used to describe four aspects of big data:
……………… for the scale of data, ……………… for analysis of streaming data, ……………… for different forms of data & ……………… for the uncertainty of data.
Spark uses parallel ……………… to create RDDs from data in stable storage & performs various ……………… on RDDs (working dataset) in memory at high speed.
Three commonly used cloud service models in the service-oriented architecture are: ………………, ………………, & ……………….
Hadoop can be set up in 3 different modes: ………………, ………………, & ……………….
Oozie coordination engine can trigger workflows by ………………, & supports a wide range of job types, including ………………, ………………, & ……………….
Pig Latin is a ……………… language, where a data stream is defined & a series of transformations are applied to the data as it flows through the application.
Please fill in the blanks
Answer:------------
There are four V’s commonly used to describe four aspects of big data:
Volume for the scale of data, Velocity for analysis of streaming data, Variety for different forms of data & Veracity for the uncertainty of data.
Spark uses parallel data structures to create RDDs from data in stable storage & performs various Operation on RDDs (working dataset) in memory at high speed.
Three commonly used cloud service models in the service-oriented architecture are: Software as a Service (SaaS), Platform as a Service (PaaS) & Infrastructure as a Service (IaaS)
Hadoop can be set up in 3 different modes: Standalone mode, Pseudo-Distributed mode & Fully-Distributed mode.
Oozie coordination engine can trigger workflows by Time, & supports a wide range of job types, including Pig, Hive & MapReduce.
Pig Latin is a dataflow language, where a data stream is defined & a series of transformations are applied to the data as it flows through the application.