Skip to main content

Hadoop Interview Questions


1)   Define Hadoop?


A> Hadoop is a distributed computing platform which was developed using Java

2)  What Kind of Hardware does hadoop support?

A>Hadoop can run on a dual processor/ dual core machines with 4-8 GB RAM

3) What are the common input formats defined in Hadoop?

A> The common input formats are

                                                    1.     TextInputFormat
                                               2.     KeyValueInputFormat
                                               3.     SequenceFileInputFormat
4)  What is InputSplit ?

A> When hadoop runs job, it splits input files into chunks and assign each split to a mapper for processing which is called InputSplit.

5)  How many InputSplits does Hadoop Framework make?

A> Hadoop will make 5 splits as following:

                                                 1. One split for 64K files
                                                 2. Two splits for 65MB files, and
                                                 3. Two splits for 127MB files

6)  Define the use of RecordReader ?

A> InputSplit is assigned with a work but doesn't know how to access it the record   holder class is totally responsible for loading the data from its source and convert it   into keys pair suitable for reading by the Mapper the RecordReaders instance can be defined by the Input Format

7)  What is JobTracker ?

A>JobTracker is a service within Hadoop which runs MapReduce jobs on the cluster.

8)  Define TaskTracker ?

A>TaskTracker is a node in the cluster that accepts tasks like MapReduce and Shuffle operations from a JobTracker.

9) What is Hadoop Streaming?

A>Hadoop streaming is a utility which allows you to create and run map/reduce job ita generic API that allows programs written in any languages to be used as Hadoop   mapper.

10)What is a combiner?


A> Combiner is a mini-reduce process which operates only on data generated by a   mapper when mapper emits the data, combiner receives it as input and sends the   output to reducer.


Think above interview questions would be useful for you.To Become Expert in Hadoop Enroll With HadoopTraining in Chennai Who offer best Hadoop training in Chennai  with real time example and Best placement Support.






  


Comments

Popular posts from this blog

Why Is Python So Famous?

Have you ever come across word the python anywhere? If no, let me brief you about “Python”. Python is an interpreted, object oriented and high level programming languages more over it has built in data structures which makes it attractive for rapid application development as well suitable for scripting language. The applications of python languages are wider it used to create GUI based desktop applications, web applications and business applications. So the opportunities for python are overwhelming, so grab the opportunity by enrolling Python Training in Chennai Advantages of python Python has numerous third party modules which make it capable for interacting with other language and platform Python is open source software which makes it free to use and distribute Python provides excellent readability which helps beginners to learn the language fastly Python offers user friendly data structures Python increases the speed of application developmen...

Why Certification in Hadoop?

Today the hot word in the IT industry is “ Hadoop ”. Since every companies are dealing with large amount of data. Many of the IT recruiters say there would be larger demand for hadoop trained professional.  Many of the professional who want to make career in hadoop are planning to get trained in Hadoop and they want to have certification in hadoop certification moreover   hadoop is gaining popularity in all over world as discussed many of industries like financial service,utilities,media ,retail and paramedical are moving towards hadoop for processing the big data Today many corporate are looking for hadoop trained professional. Having certification would give them assurance that you are master in hadoop and more over having certification normally would be an added advantage and it would help you to get job easily and you can demand your dream salary since you have certification Let me brief you about the power of certification in hadoop IT recruiter’s p...

Software Testing Interview Questions

1) What is PDCA cycle and where testing fits in? There are four steps in a normal software development process. In short, these steps are referred as PDCA. PDCA stands for Plan, Do, Check, Act. Plan:  It defines the goal and the plan for achieving that goal. Do/ Execute:  It depends on the plan strategy decided during the plan stage. It is done according to this phase. Check:  This is the testing part of the software development phase. It is used to ensure that we are moving according to plan and getting the desired result. Act:  This step is used to solve if there any issue is occurred during the check cycle. It takes appropriate action accordingly and revises the plan again. The developers do the "planning and building" of the project while testers do the "check" part of the project.   2) What is the difference among white box, black box and gray box testing? Black box Testing:  The s...