Which InputFormat should you use to complete the line: conf.setInputFormat (____.class) ; ?


Given a directory of files with the following structure: line number, tab character, string:

Example:

1abialkjfjkaoasdfjksdlkjhqweroij

2kadfjhuwqounahagtnbvaswslmnbfgy

3kjfteiomndscxeqalkzhtopedkfsikj

You want to send each line as one record to your Mapper.

Which InputFormat should you use to complete the line: conf.setInputFormat (____.class) ; ?
A . SequenceFileAsTextInputFormat
B . SequenceFileInputFormat
C . KeyValueFileInputFormat
D . BDBInputFormat

Answer: C

Explanation:

http://stackoverflow.com/questions/9721754/how-to-parse-customwritable-from-text-in-hadoop

Leave a Reply

Your email address will not be published.