PdfPrep.com

HCAHD Apache Hadoop Developer Online Free Test

With our Apache Hadoop Developer practice test, you don’t need to look for other online testing engine that are often obsolete. In most of the cases, people looking for Apache Hadoop Developer prep questions online from us for their certification prep requirements. Our top ranked Apache Hadoop Developer prep questions usually searched on the internet using different search terms like specified below.

Page 1 of 5

Loading... Loading...
1. Which one of the following statements is FALSE regarding the communication between DataNodes and a federation of NameNodes in Hadoop 2.0?

Loading... Loading...
2. Your client application submits a MapReduce job to your Hadoop cluster. Identify the Hadoop daemon on which the Hadoop framework will look for an available slot schedule a MapReduce operation.

Loading... Loading...
3. Which process describes the lifecycle of a Mapper?

Loading... Loading...
4. Which two of the following are true about this trivial Pig program' (choose Two)

Loading... Loading...
5. Examine the following Pig commands:



Which one of the following statements is true?

Loading... Loading...
6. Which two of the following statements are true about Pig's approach toward data? Choose 2 answers

Loading... Loading...
7. You want to perform analysis on a large collection of images. You want to store this data in HDFS and process it with MapReduce but you also want to give your data analysts and data scientists the ability to process the data directly from HDFS with an interpreted high-level programming language like Python.

Which format should you use to store this data in HDFS?

Loading... Loading...
8. What does the following command do?

register '/piggyban):/pig-files.jar';

Loading... Loading...
9. Review the following 'data' file and Pig code.



Which one of the following statements is true?

Loading... Loading...
10. Given the following Hive command:

INSERT OVERWRITE TABLE mytable SELECT * FROM myothertable;

Which one of the following statements is true?

Page 2 of 5

Loading... Loading...
11. Which best describes how TextInputFormat processes input files and line breaks?

Loading... Loading...
12. You have written a Mapper which invokes the following five calls to the OutputColletor.collect method:

output.collect (new Text (“Apple”), new Text (“Red”) ) ;

output.collect (new Text (“Banana”), new Text (“Yellow”) ) ;

output.collect (new Text (“Apple”), new Text (“Yellow”) ) ;

output.collect (new Text (“Cherry”), new Text (“Red”) ) ;

output.collect (new Text (“Apple”), new Text (“Green”) ) ;

How many times will the Reducer’s reduce method be invoked?

Loading... Loading...
13. To process input key-value pairs, your mapper needs to lead a 512 MB data file in memory.

What is the best way to accomplish this?

Loading... Loading...
14. Which one of the following statements is true about a Hive-managed table?

Loading... Loading...
15. Consider the following two relations, A and B.



What is the output of the following Pig commands?

X = GROUP A BY S1;

DUMP X;

Loading... Loading...
16. What is a SequenceFile?

Loading... Loading...
17. You use the hadoop fs Cput command to write a 300 MB file using and HDFS block size of 64 MB.

Just after this command has finished writing 200 MB of this file, what would another user see when trying to access this life?

Loading... Loading...
18. Indentify the utility that allows you to create and run MapReduce jobs with any executable

or script as the mapper and/or the reducer?

Loading... Loading...
19. You have just executed a MapReduce job. Where is intermediate data written to after being emitted from the Mapper’s map method?

Loading... Loading...
20. A combiner reduces:

Page 3 of 5

Loading... Loading...
21. How are keys and values presented and passed to the reducers during a standard sort and shuffle phase of MapReduce?

Loading... Loading...
22. All keys used for intermediate output from mappers must:

Loading... Loading...
23. Which one of the following statements describes a Pig bag. tuple, and map, respectively?

Loading... Loading...
24. You want to run Hadoop jobs on your development workstation for testing before you submit them to your production cluster.

Which mode of operation in Hadoop allows you to most closely simulate a production cluster while using a single machine?

Loading... Loading...
25. Which HDFS command uploads a local file X into an existing HDFS directory Y?

Loading... Loading...
26. In Hadoop 2.0, which TWO of the following processes work together to provide automatic failover of the NameNode? Choose 2 answers

Loading... Loading...
27. To use a lava user-defined function (UDF) with Pig what must you do?

Loading... Loading...
28. When is the earliest point at which the reduce method of a given Reducer can be called?

Loading... Loading...
29. Which one of the following statements describes the relationship between the ResourceManager and the ApplicationMaster?

Loading... Loading...
30. Which HDFS command copies an HDFS file named foo to the local filesystem as localFoo?

Page 4 of 5

Loading... Loading...
31. You need to perform statistical analysis in your MapReduce job and would like to call methods in the Apache Commons Math library, which is distributed as a 1.3 megabyte Java archive (JAR) file.

Which is the best way to make this library available to your MapReducer job at runtime?

Loading... Loading...
32. In a MapReduce job with 500 map tasks, how many map task attempts will there be?

Loading... Loading...
33. You want to count the number of occurrences for each unique word in the supplied input data. You’ve decided to implement this by having your mapper tokenize each word and emit a literal value 1, and then have your reducer increment a counter for each literal 1 it receives. After successful implementing this, it occurs to you that you could optimize this by specifying a combiner.

Will you be able to reuse your existing Reduces as your combiner in this case and why or why not?

Loading... Loading...
34. What data does a Reducer reduce method process?

Loading... Loading...
35. Given a directory of files with the following structure: line number, tab character, string:

Example:

1abialkjfjkaoasdfjksdlkjhqweroij

2kadfjhuwqounahagtnbvaswslmnbfgy

3kjfteiomndscxeqalkzhtopedkfsikj

You want to send each line as one record to your Mapper.

Which InputFormat should you use to complete the line: conf.setInputFormat (____.class) ; ?

Loading... Loading...
36. Examine the following Hive statements:



Assuming the statements above execute successfully, which one of the following statements is true?

Loading... Loading...
37. When can a reduce class also serve as a combiner without affecting the output of a MapReduce program?

Loading... Loading...
38. What does the following WebHDFS command do?

Curl -1 -L “http://host:port/webhdfs/v1/foo/bar?op=OPEN”

Loading... Loading...
39. You need to run the same job many times with minor variations. Rather than hardcoding all job configuration options in your drive code, you’ve decided to have your Driver subclass org.apache.hadoop.conf.Configured and implement the org.apache.hadoop.util.Tool interface.

Indentify which invocation correctly passes.mapred.job.name with a value of Example to Hadoop?

Loading... Loading...
40. Determine which best describes when the reduce method is first called in a MapReduce job?

Page 5 of 5

Loading... Loading...
41. You have a directory named jobdata in HDFS that contains four files: _first.txt, second.txt, .third.txt and #data.txt.

How many files will be processed by the FileInputFormat.setInputPaths () command when it's given a path object representing this directory?

Loading... Loading...
42. In a large MapReduce job with m mappers and n reducers, how many distinct copy operations will there be in the sort/shuffle phase?

Loading... Loading...
43. Which Hadoop component is responsible for managing the distributed file system metadata?

Loading... Loading...
44. Review the following data and Pig code.

M,38,95111

F,29,95060

F,45,95192

M,62,95102

F,56,95102

A = LOAD 'data' USING PigStorage('.') as (gender:Chararray, age:int, zlp:chararray);

B = FOREACH A GENERATE age;

Which one of the following commands would save the results of B to a folder in hdfs named myoutput?

Loading... Loading...

 

Exit mobile version