ACA-BigData1 ACA Big Data Associate

Loading demo links...

Showing 1–3 of 10 questions

Question 1

Which of the following Hadoop ecosystem componets can you choose to setup a streaming log

analysis system?(Number of correct answers: 3)

Score 2

Select all that apply, then click Submit answer.

  • Apache Flume

  • Apache Kafka

  • Apache Spark

  • Apache Lucene


Question 2

In DataWorks, a task should be instantiated first before a

scheduled task is running every time, that is, generating a

corresponding instance which is executed for running the

scheduled task. The status is different in each phase of the

scheduling process, including ________. (Number of correct

answers: 3)

Select all that apply, then click Submit answer.

  • Not running

  • Running

  • Running Successfully


Question 3

A distributed file system like GFS and Hadoop are design to have much larger block(or chunk) size

like 64MB or 128MB, which of the following descriptions are correct? (Number of correct answers: 4)

Score 2

Select all that apply, then click Submit answer.

  • It reduces clients' need to interact with the master because reads and writes on the same block( or
    chunck) require only one initial request to the master for block location information

  • Since on a large block(or chunk), a client is more likely to perform many operations on a given block, it can reduce network overhead by keeping a persistent TCP connection to the metadata server over an extended period of time

  • It reduces the size of the metadata stored on the master

  • The servers storing those blocks may become hot spots if many clients are accessing the same small
    files

  • If necessary to support even larger file systems, the cost of adding extra memory to the meta data
    server is a big price