Skip to main content

Frequently Asked Hadoop Interview Questions

1) What is Hadoop?
Hadoop is a distributed computing platform. It is written in Java. It consist of the features like Google File System and MapReduce.
2.) What is Hadoop Map Reduce ?
For processing large data sets in parallel across a hadoop cluster, Hadoop MapReduce framework is used. Data analysis uses a two-step map and reduce process.
3).How Hadoop MapReduce works?
In MapReduce, during the map phase it counts the words in each document, while in the reduce phase it aggregates the data as per the document spanning the entire collection. During the map phase the input data is divided into splits for analysis by map tasks running in parallel across Hadoop framework.
4.)Explain what is sqoop in Hadoop ?
To transfer the data between Relational database management (RDBMS) and Hadoop HDFS a tool is used known as Sqoop. Using Sqoop data can be transferred from RDMS like MySQL or Oracle into HDFS as well as exporting data from HDFS files to RDBMS
5.) What is the Hadoop MapReduce API contract for a key and value Class?
The Key must implement the org.apache.hadoop.io.WritableComparable interface. The value must implement the org.apache.hadoop.io.Writable interface.
6) What is big data?
Big Data is nothing but an assortment of such a huge and complex data that it becomes very tedious to capture, store, process, retrieve and analyze it with the help of on-hand database management tools or traditional data processing techniques.
7 .Explain what is sqoop in Hadoop ?
To transfer the data between Relational database management (RDBMS) and Hadoop HDFS a tool is used known as Sqoop. Using Sqoop data can be transferred from RDMS like MySQL or Oracle into HDFS as well as exporting data from HDFS file to RDBMS
8. What is the Hadoop MapReduce API contract for a key and value Class?
A.The Key must implement the org.apache.hadoop.io.WritableComparable interface. The value must implement the org.apache.hadoop.io.Writable interface.
The above questions are frequently asked interview question. Besant Technologies offer best Hadoop Training in Chennai with best placement support 


Comments

Popular posts from this blog

Why Is Python So Famous?

Have you ever come across word the python anywhere? If no, let me brief you about “Python”. Python is an interpreted, object oriented and high level programming languages more over it has built in data structures which makes it attractive for rapid application development as well suitable for scripting language. The applications of python languages are wider it used to create GUI based desktop applications, web applications and business applications. So the opportunities for python are overwhelming, so grab the opportunity by enrolling Python Training in Chennai Advantages of python Python has numerous third party modules which make it capable for interacting with other language and platform Python is open source software which makes it free to use and distribute Python provides excellent readability which helps beginners to learn the language fastly Python offers user friendly data structures Python increases the speed of application developmen...

Why Certification in Hadoop?

Today the hot word in the IT industry is “ Hadoop ”. Since every companies are dealing with large amount of data. Many of the IT recruiters say there would be larger demand for hadoop trained professional.  Many of the professional who want to make career in hadoop are planning to get trained in Hadoop and they want to have certification in hadoop certification moreover   hadoop is gaining popularity in all over world as discussed many of industries like financial service,utilities,media ,retail and paramedical are moving towards hadoop for processing the big data Today many corporate are looking for hadoop trained professional. Having certification would give them assurance that you are master in hadoop and more over having certification normally would be an added advantage and it would help you to get job easily and you can demand your dream salary since you have certification Let me brief you about the power of certification in hadoop IT recruiter’s p...

Software Testing Interview Questions

1) What is PDCA cycle and where testing fits in? There are four steps in a normal software development process. In short, these steps are referred as PDCA. PDCA stands for Plan, Do, Check, Act. Plan:  It defines the goal and the plan for achieving that goal. Do/ Execute:  It depends on the plan strategy decided during the plan stage. It is done according to this phase. Check:  This is the testing part of the software development phase. It is used to ensure that we are moving according to plan and getting the desired result. Act:  This step is used to solve if there any issue is occurred during the check cycle. It takes appropriate action accordingly and revises the plan again. The developers do the "planning and building" of the project while testers do the "check" part of the project.   2) What is the difference among white box, black box and gray box testing? Black box Testing:  The s...