Saturday, 14 July 2012

How to create a Hadoop project in eclipse



Most of us prefer IDE for the development process as it makes life easier. So here we will see  how to create a hadoop project on eclipse.
Hadoop on Eclipse:
1) Follow the installation setup of hadoop single node cluster on Ubuntu (My previous blog).
2) Download Hadoop-eclipse-plugin-0.20.3-SNAPSHOT.jar from the following site



(I prefer this plugin and it works well for me).
3) Paste that jar file into /usr/lib/eclipse/plugins/ 
4) Run the eclipse via terminal

     $ cd /usr/lib/eclipse
     $ ./eclipse –clean

5) Create a new map-reduce project
     
a) File->new->map/reduce project
    
 b) Select configure Hadoop install directory or specify hadoop library location and give the pathname of hadoop folder in which you configured single node cluster.

6) From window->show view -> map-reduce locations.
     
a) In that map-reduce location, you have to set hadoop location by using New hadoop location icon (blue color elephant symbol). Hadoop location can be any name as you like.
     b) In General
           1) Give Map/Reduce Master as    
              Host -> localhost
              port -> 54311
           2) Give HDFS Master as
              Host -> localhost
              port -> 54310
           3) Refer: /conf/*-site.xml files for the values.
7) Refresh the dfs locations in project explorer tab. 
      a) You can upload the file into dfs location and also download it.
      b) This location will be helpful for input – output operation for hadoop map-reduce programs.
8) Finally you can run your project on hadoop.

Now create your own map/reduce program on eclipse and enjoy with hadoop.

Cheers,
Kiran.




3 comments:

  1. There are lots of information about hadoop have spread around the web, but this is a unique one according to me. The strategy you have updated here will make me to get to the next level in big data. Thanks for sharing this.

    Hadoop training institute in t nagar
    Hadoop course in t nagar

    ReplyDelete
  2. Cloud is one of the tremendous technology that any company in this world would rely on(Cloud computing course in Chennai). Using this technology many tough tasks can be accomplished easily in no time. Your content are also explaining the same(Cloud computing training chennai). Thanks for sharing this in here. You are running a great blog, keep up this good work.

    ReplyDelete
  3. The content provided here is vital in increasing one's knowledge regarding hadoop, the way you have presented here is simply awesome. Thanks for sharing this. The uniqueness I see in your content made me to comment on this. Keep sharing article like this. Thanks :)

    Hadoop Training in Chennai | Hadoop Course in Chennai | Big data training in Chennai

    ReplyDelete