Latest Hadoop Interview Questions-part 3

Q22. Whats is Distributed Cache in Hadoop
Distributed Cache is a facility provided by the Map/Reduce framework to cache files (text, archives, jars and so on) needed by applications during execution of the job. The framework will copy the necessary files to the slave node before any tasks for the job are executed on that node.

Q23. What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it  
This is because distributed cache is much faster. It copies the file to all trackers at the start of the job. Now if the task tracker runs 10 or 100 mappers or reducer, it will use the same copy of distributed cache. On the other hand, if you put code in file to read it from HDFS in the MR job then every mapper will try to access it from HDFS hence if a task tracker run 100 map jobs then it will try to read this file 100 times from HDFS. Also HDFS is not very efficient when used like this.

Q.24 What mechanism does Hadoop framework provides to synchronize changes made in Distribution Cache during runtime of the application  
This is a trick questions. There is no such mechanism. Distributed Cache by design is read only during the time of Job execution

Q25. Have you ever used Counters in Hadoop. Give us an example scenario
Anybody who claims to have worked on a Hadoop project is expected to use counters

Q26. Is it possible to provide multiple input to Hadoop? If yes then how can you give multiple directories as input to the Hadoop job  
Yes, The input format class provides methods to add multiple directories as input to a Hadoop job

Q27. Is it possible to have Hadoop job output in multiple directories. If yes then how  
Yes, by using Multiple Outputs class

Q28. What will a hadoop job do if you try to run it with an output directory that is already present? Will it
- overwrite it
- warn you and continue
- throw an exception and exit
The hadoop job will throw an exception and exit.

Q29. How can you set an arbitary number of mappers to be created for a job in Hadoop  
This is a trick question. You cannot set it

Q30. How can you set an arbitary number of reducers to be created for a job in Hadoop  
You can either do it progamatically by using method setNumReduceTasksin the JobConfclass or set it up as a configuration setting

41 comments:

  1. This information you provided in the blog that was really unique I love it!!, Thanks for sharing such a great blog..Keep posting..

    Hadoop Training in Chennai

    ReplyDelete
  2. This comment has been removed by a blog administrator.

    ReplyDelete
  3. Your posts is really helpful for me.Thanks for your wonderful post. I am very happy to read your post.AWS Training in chennai | AWS Training chennai | AWS course in chennai

    ReplyDelete
  4. Good Post with interesting assessment. . It’s really helpful for who want to attend Hadoop interviews. The author has done really great job. I will share you link related to Hadoop interview Question with answers just look, I hope I will helpful for your career: Hadoop Interview Q/A's

    ReplyDelete
  5. It is amazing and wonderful to visit your site.Thanks for sharing this information,this is useful to me...
    SEO Company in India

    ReplyDelete
  6. It is very excellent blog and useful article thank you for sharing with us , keep posting learn more about Big Data Hadoop important information thank you providing this important information onBig Data Hadoop Online Training

    ReplyDelete
  7. Hi,
    Thank you so much for sharing Hadoop questions, these are most helpful and informative to face the interview. Keep updating with latest questons
    Thank You
    Hari prasad

    ReplyDelete
  8. My rather long internet look up has at the end of the day been compensated with pleasant insight to talk about with my family and friends.


    ccna training in chennai



    ccna training in bangalore


    ccna training in pune

    ReplyDelete
  9. The site was so nice, I found out about a lot of great things. I like the way you make your blog posts. Keep up the good work and may you gain success in the long run.
    java training in chennai | java training in bangalore

    java training in tambaram | java training in velachery

    java training in omr

    ReplyDelete
  10. This comment has been removed by the author.

    ReplyDelete
  11. Hello! This is my first visit to your blog! We are a team of volunteers and starting a new initiative in a community in the same niche. Your blog provided us useful information to work on. You have done an outstanding job.


    AWS Training in Rajaji Nagar | Amazon Web Services Training in Rajaji Nagar


    Amazon Web Services Training in Pune | Best AWS Training in Pune

    AWS Online Training | Online AWS Certification Course - Gangboard

    ReplyDelete
  12. What a great effort, it was superb blog and thanks for sharing...! Big Data Hadoop Training in Pune

    ReplyDelete
  13. I really appreciate this post. I’ve been looking all over for this! Thank goodness I found it on Bing. You’ve made my day! Thx again!
    python training in velachery
    python training institute in chennai

    ReplyDelete
  14. I really appreciate this post. I’ve been looking all over for this! Thank goodness I found it on Bing. You’ve made my day! Thx again!
    python training in velachery
    python training institute in chennai

    ReplyDelete
  15. Fantastic work! This is the type of information that should follow collective approximately the web. Embarrassment captivating position Google for not positioning this transmit higher! Enlarge taking place greater than and visit my web situate
    Data Science training in rajaji nagar | Data Science Training in Bangalore | Data Science with Python training in chennai

    Data Science training in electronic city | Data Science training in USA

    Data science training in pune | Data science training in kalyan nagar

    ReplyDelete
  16. Thank you for taking the time and sharing this information with us. It was indeed very helpful and insightful while being straight forward and to the point.
    Java training in Chennai | Java training in Annanagar | Java training in Chennai

    Java training in Chennai | Java training in Bangalore | Java training in Electronic city

    ReplyDelete
  17. Good Post! Thank you so much for sharing this pretty post, it was so good to read and useful to improve my knowledge as updated one, keep blogging.
    Online DevOps Certification Course - Gangboard | Java online training

    ReplyDelete
  18. I have visited this blog first time and i got a lot of informative data from here which is quiet helpful for me indeed.
    apple service center chennai | apple iphone service center chennai | apple ipad service center chennai | apple mac service center chennai | ipad service center

    ReplyDelete

  19. It's interesting that many of the bloggers to helped clarify a few things for me as well as giving.Most of ideas can be nice content.The people to give them a good shake to get your point and across the command
    Data Science training in Chennai
    Data science training in Bangalore
    Data science training in pune
    Data science online training
    Data Science Interview questions and answers
    Data Science Tutorial
    Data science training in bangalore

    ReplyDelete
  20. Some us know all relating to the compelling medium you present powerful steps on this blog and therefore strongly encourage contribution from other ones on this subject while our own child is truly discovering a great deal. Have fun with the remaining portion of the year.
    fire and safety course in chennai

    ReplyDelete
  21. This comment has been removed by the author.

    ReplyDelete
  22. Informative post indeed, I’ve being in and out reading posts regularly and I see alot of engaging people sharing things and majority of the shared information is very valuable and so, here’s my fine read.



    Big Data Hadoop Training In Chennai | Big Data Hadoop Training In anna nagar | Big Data Hadoop Training In omr | Big Data Hadoop Training In porur | Big Data Hadoop Training In tambaram | Big Data Hadoop Training In velachery


    ReplyDelete
  23. I prefer to study this kind of material. Nicely written information in this post, the quality of content is fine and the conclusion is lovely. Things are very open and intensely clear explanation of issues...
    Salesforce Training in Chennai

    Salesforce Online Training in Chennai

    Salesforce Training in Bangalore

    Salesforce Training in Hyderabad

    Salesforce training in ameerpet

    Salesforce Training in Pune

    Salesforce Online Training

    Salesforce Training

    ReplyDelete
  24. Everything is very open with a clear clarification of the issues. It was truly informative. Your site is useful. Thank you for sharing!|
    data scientist training and placement

    ReplyDelete
  25. Really an awesome blog and informative content. Keep posting more blogs with us. Thank you.
    Data Science Course Training in Hyderabad

    ReplyDelete
  26. North section friend to fly. Ability about key ask popular nice.information

    ReplyDelete