Big Data Computing Assignment 5 Answers 2021

Big Data Computing Assignment 5 Answers 2021:- We do not claim 100% surety of answers, these answers are based on our sole knowledge, and by posting these answers we are just trying to help students, so we urge do your assignment on your own.

NPTEL ALL WEEK ASSIGNMENT ANSWERS:-

Q1. Columns in HBase are organized to___________________________ 

(A) Column group 

(B) Column list 

(C) Column base 

(D) Column families

Ans:- (D) Column families

Q2. HBase is a distributed ________ database built on top of the Hadoop file system 

(A) Row-oriented 

(B) Tuple-oriented 

(C) Column-oriented 

(D) None of the mentioned

Ans:- (C) Column-oriented 

Q3. A small chunk of data residing in one machine which is part of a cluster of machines holding one HBase table is known as__________________ 

(A) Region 

(B) Split 

(C) Rowarea 

(D) Tablearea

Ans:- (A) Region 

Q4. In HBase, __________________is a combination of row, column family, column qualifier and contains a value and a timestamp. 

(A) Cell 

(B) Stores 

(C) HMaster 

(D) Region Server

Ans:- (A) Cell 

Q5. HBase architecture has 3 main components: 

(A) Client, Column family, Region Server 

(B) Cell, Rowkey, Stores 

(C) HMaster, Region Server, Zookeeper 

(D) HMaster, Stores, Region Server

Ans:- (C) HMaster, Region Server, Zookeeper 

Q6. HBase stores data in_______________________ 

(A) As many filesystems as the number of region servers 

(B) One filesystem per column family 

(C) A single filesystem available to all region servers 

(D) One filesystem per table

Ans:- (C) A single filesystem available to all region servers 

Q7. Kafka is run as a cluster comprised of one or more servers each of which is called___________________ 

(A) cTakes 

(B) Chunks 

(C) Broker 

(D) None of the mentioned

Ans:- For Answer Click Here

Q8. True or False ?

Statement 1: Batch Processing provides ability to process and analyze data at-rest (stored data). 

Statement 2: Stream Processing provides ability to ingest, process and analyze data in-motion in real or near-real-time. 

(A) Only Statement 1 is true 

(B) Only Statement 2 is true 

(C) Both Statements are true 

(D) Both Statements are false

Ans:- (C) Both Statements are true 

Q9. ________________is a central hub to transport and store event streams in real time. 

(A) Kafka Core 

(B) Kafka Connect 

(C) Kafka Streams 

(D) None of the mentioned

Ans:- (C) Kafka Streams 

Q10. What are the parameters defined to specify window operation ? 

(A) State size, window length 

(B) State size, sliding interval 

(C) Window length, sliding interval 

(D) None of the mentioned

Ans:- For Answer Click Here

Q11. Consider the following dataset Customers:

big Data computing 5

Using the Customers table answer the following using spark streaming fundamentals:

Using the following pseudo code, find the rank of each customer visiting the supermarket

val wSpec3 = Window.partitionBy(“name”).orderBy(“date”)

customers.withColumn( “rank”, rank().over(wSpec3) ).show()

(A)

big Data computing 5

(B)

big Data computing 5

(C)

big Data computing 5

(D)

Big Data Computing Assignment 5 Answers 2021

Ans:- (B)

Q12. ________________is a Java library to process event streams live as they occur. 

(A) Kafka Core 

(B) Kafka Connect 

(C) Kafka Streams 

(D) None of the mentioned

Ans:- (C) Kafka Streams 

Big Data Computing Assignment 5 Answers 2021:- We do not claim 100% surety of answers, these answers are based on our sole knowledge, and by posting these answers we are just trying to help students, so we urge do your assignment on your own.

JOIN US ON YOUTUBE

Leave a Comment