HDPCD Exam - Free Hortonworks Questions and Answers

Question #1 (Topic: )

In a MapReduce job with 500 map tasks, how many map task attempts will there be?

A. It depends on the number of reduces in the job. B. Between 500 and 1000. C. At most 500. D. At least 500. E. Exactly 500.

Answer: D

Question #2 (Topic: )

Workflows expressed in Oozie can contain:

A. Sequences of MapReduce and Pig. These sequences can be combined with other actions including forks, decision points, and path joins. B. Sequences of MapReduce job only; on Pig on Hive tasks or jobs. These MapReduce sequences can be combined with forks and path joins. C. Sequences of MapReduce and Pig jobs. These are limited to linear sequences of actions with exception handlers but no forks. D. Iterntive repetition of MapReduce jobs until a desired answer or state is reached.

Answer: A

Question #3 (Topic: )

Consider the following two relations, A and B.
[Hortonworks-HDPCD-5.0/Hortonworks-HDPCD-3_2.png]
Which Pig statement combines A by its first field and B by its second field?

A. C = DOIN B BY a1, A by b2; B. C = JOIN A by al, B by b2; C. C = JOIN A a1, B b2; D. C = JOIN A SO, B $1;

Answer: B

Question #4 (Topic: )

You need to perform statistical analysis in your MapReduce job and would like to call
methods in the Apache Commons Math library, which is distributed as a 1.3 megabyte
Java archive (JAR) file. Which is the best way to make this library available to your
MapReducer job at runtime?

A. Have your system administrator copy the JAR to all nodes in the cluster and set its location in the HADOOP_CLASSPATH environment variable before you submit your job. B. Have your system administrator place the JAR file on a Web server accessible to all cluster nodes and then set the HTTP_JAR_URL environment variable to its location. C. When submitting the job on the command line, specify the libjars option followed by the JAR file path. D. Package your code and the Apache Commands Math library into a zip file named JobJar.zip

Answer: C

Question #5 (Topic: )

Which one of the following statements is true about a Hive-managed table?

A. Records can only be added to the table using the Hive INSERT command. B. When the table is dropped, the underlying folder in HDFS is deleted. C. Hive dynamically defines the schema of the table based on the FROM clause of a SELECT query. D. Hive dynamically defines the schema of the table based on the format of the underlying data.

Answer: B

Hortonworks HDPCD - Hortonworks Data Platform Certified Developer Exam