#title Apache Hadoop MapReduce ÀÌÀü À§Å° ³»¿ëÀÌ ¿µ¾î ¹ßÇ¥ÀڷḦ ±×´ë·Î ¿Å°Ü Àû¾î ³í °ÍÀ̶ó óÀ½ Á¢ÇϽô ºÐµé¿¡°Ô º° µµ¿òÀÌ ¾ÈµÉ°Í °°¾Æ¼­ ¾ÆÆÄÄ¡ ÇϵÓÀÇ °ø½Ä À§Å°¸¦ ¹ø¿ªÇÑ °ÍÀÔ´Ï´Ù. ¼­ºê ¸Þ´ºµéµµ ««È÷ ¹ø¿ªÇÏ°Ú½À´Ï´Ù. ÀÌ ÆäÀÌÁöÀÇ ¿øº»Àº [http://wiki.apache.org/lucene-hadoop/ Lucene-hadoop Wiki]ÀÔ´Ï´Ù. = ¾ÆÆÄÄ¡ ÇÏµÓ (Apache Hadoop) = [http://hadoop.apache.org/ ¾ÆÆÄÄ¡ ÇϵÓ]Àº ´Ù¼öÀÇ Àú°¡ ¼­¹ö·Î ±¸¼ºµÈ Ŭ·¯½ºÅ͸¦ ÀÌ¿ëÇؼ­ ¾îÇø®ÄÉÀ̼ÇÀ» ½ÇÇàÇÏ´Â ÇÁ·¹ÀÓ¿öÅ©(framework)ÀÌ´Ù. ÇÏµÓ ÇÁ·¹ÀÓ¿öÅ©¸¦ ÀÌ¿ëÇÏ¸é ¾îÇø®ÄÉÀ̼ÇÀº ¼Õ½±°Ô ½Å·Ú¼º°ú µ¥ÀÌÅÍ ¿îµ¿¼ºÀ» È®º¸ÇÒ ¼ö ÀÖ´Ù. ÇϵÓÀº [http://wiki.apache.org/hadoop/HadoopMapReduce ¸Ê/¸®µà½º(Map/Reduce)]¶ó´Â °è»ê Æз¯´ÙÀÓ(paradigm)À» ±¸ÇöÇϴµ¥, ¸Ê/¸®µà½º¿¡¼­ ÇÑ ¾îÇø®ÄÉÀ̼ÇÀº Ŭ·¯½ºÅÍ »óÀÇ ÀÓÀÇÀÇ ÇÑ ¼­¹ö¿¡¼­ ½ÇÇàµÇ´Â ÀÛÀº ´ÜÀ§ÀÇ ÀÏ ¿©·¯ °³·Î ÂÉ°³Á®¼­ ½ÇÇàµÈ´Ù. Ãß°¡·Î, ÇÏµÓ ÇÁ·¹ÀÓ¿öÅ©´Â [http://wiki.apache.org/hadoop/DFS HDFS]¶ó´Â ºÐ»ê ÆÄÀÏ ½Ã½ºÅÛÀ» Æ÷ÇÔÇÏ°í Àִµ¥, ÀÌ´Â °è»ê ¼­¹öµé¿¡ µ¥ÀÌÅ͸¦ ÀúÀåÇϸ鼭 Ŭ·¯½ºÅÍ Àü¿ª¿¡ °ÉÃÄ ¿ì¼öÇÑ ¼º´ÉÀ» º¸¿©ÁØ´Ù. ¸Ê/¸®µà½º¿Í ºÐ»ê ÆÄÀÏ ½Ã½ºÅÛ ¸ðµÎ Ŭ·¯½ºÅÍ ³»ÀÇ ÀϺΠ¼­¹ö °íÀå¿¡ ´ëÇØ ÇÁ·¹ÀÓ¿öÅ©°¡ ÀÚµ¿À¸·Î ´ëóÇϵµ·Ï ¼³°èÇÏ¿´´Ù. == Àü¹ÝÀûÀÎ Á¤º¸ == * [http://hadoop.apache.org/ ¾ÆÆÄÄ¡ ÇÏµÓ °ø½Ä À¥»çÀÌÆ®]: ´Ù¿î·Îµå, ¹ö±× Æ®·¢Å·, ¸ÞÀϸµ ¸®½ºÆ® µî * [http://wiki.apache.org/hadoop/ProjectDescription ¾ÆÆÄÄ¡ ÇÏµÓ °³¿ä] * [http://wiki.apache.org/hadoop/FAQ ÀÚÁÖÇÏ´Â Áú¹®µé] * [http://wiki.apache.org/hadoop/HadoopIsNot Çϵӿ¡ °üÇÑ ¿ÀÇØ] * [http://wiki.apache.org/hadoop/Distribution ÇÏµÓ ¹èÆ÷ÆÇ] * ÇÏµÓ °ü·Ã [http://wiki.apache.org/hadoop/HadoopPresentations ¹ßÇ¥ÀÚ·á], [http://wiki.apache.org/hadoop/Books °ü·Ã¼­Àû], [http://wiki.apache.org/hadoop/HadoopArticles ±â»ç], [http://wiki.apache.org/hadoop/Papers ³í¹®] * [http://wiki.apache.org/hadoop/PoweredBy ÆÄ¿öµå¹ÙÀÌ]: ¾ÆÆÄÄ¡ ÇϵÓÀ» ÀÌ¿ëÇÏ´Â »çÀÌÆ®¿Í ¾îÇø®ÄÉÀ̼ÇÀÇ ¸ñ·Ï * Áö¿ø * [http://wiki.apache.org/hadoop/Help ÇÏµÓ Ä¿¹Â´ÏƼ] * [http://wiki.apache.org/hadoop/Support °í¿ë °¡´ÉÇÑ ÀÎÀç¿Í ±â¾÷] * ÇÏµÓ Ä¿¹Â´ÏƼ À̺¥Æ®¿Í ÇÐȸ * [http://wiki.apache.org/hadoop/HadoopUserGroups HadoopUserGroups (HUGs)] * [http://wiki.apache.org/hadoop/HadoopSummit HadoopSummit] * [http://developer.yahoo.com/hadoop/tutorial/ Yahoo! ÇÏµÓ Æ©Å丮¾ó ]: ÇÏµÓ ¼³Á¤, HDFS, ¸Ê/¸®µà½º¸¦ Æ÷ÇÔÇÏ´Â Æ©Å丮¾ó * [http://www.cloudera.com/hadoop-training-basic Cloudera ¿Â¶óÀÎ ÇÏµÓ ±³À°]: ºñµð¿À ±³À°, ½Ç½À, ¹Ì¸® ¼³Á¤µÈ [http://www.cloudera.com/hadoop-training-virtual-machine °¡»ó ¸Ó½Å] Á¦°ø. ¼ö¾÷Àº [http://www.cloudera.com/hadoop-training-programming-with-hadoop ÇϵÓ], [http://www.cloudera.com/hadoop-training-mapreduce-algorithms ¸Ê/¸®µà½º], [http://www.cloudera.com/hadoop-training-hive-introduction ÇÏÀ̺ê(Hive)], [http://www.cloudera.com/hadoop-training-pig-introduction ÇÈ(Pig)] µîÀ» Æ÷ÇÔÇÔ == »ç¿ëÀÚ ¹®¼­ == * [http://wiki.apache.org/hadoop/ImportantConcepts Áß¿ä °³³ä] * [http://wiki.apache.org/hadoop/GettingStartedWithHadoop ÇÏµÓ ½ÃÀÛ] * [http://wiki.apache.org/hadoop/QuickStart ¼Ó¼º¹Ý] * [http://hadoop.apache.org/core/docs/current/commands_manual.html ÇÏµÓ ½© ½ºÅ©¸³Æ®¸¦ À§ÇÑ ÄÁ¸àµå ¶óÀÎ ¿É¼Ç] * [http://wiki.apache.org/hadoop/HadoopOverview ÇÏµÓ ÄÚµå °³¿ä] * [http://wiki.apache.org/hadoop/TroubleShooting ¹®Á¦ÇØ°á] * Ŭ·¯½ºÅÍ ¼Â¾÷ * [http://wiki.apache.org/hadoop/Running_Hadoop_On_Ubuntu_Linux_%28Single-Node_Cluster%29 ¿ìºÐÅõ ¸®´ª½º¿¡¼­ ÇÏµÓ ½ÇÇà (¸Ó½Å Çϳª·Î ±¸¼ºµÈ Ŭ·¯½ºÅÍ)] (¸Ó½Å ÇѴ븦 ÀÌ¿ëÇؼ­ ÇϵÓÀ» ¼³Ä¡, ¼³Á¤, ½ÇÇàÇÏ´Â Æ©Å丮¾ó) * [http://wiki.apache.org/hadoop/Running_Hadoop_On_OS_X_10.5_64-bit_%28Single-Node_Cluster%29 OS X 10.5 64-bit¿¡¼­ ÇÏµÓ ½ÇÇà (¸Ó½Å Çϳª·Î ±¸¼ºµÈ Ŭ·¯½ºÅÍ)] * [http://wiki.apache.org/hadoop/HowToConfigure ÇÏµÓ ¼³Á¤ÇÏ´Â ¹ý] * [http://wiki.apache.org/hadoop/WebApp%20URLs WebApp¸¦ ÀÌ¿ëÇÑ ½Ã½ºÅÛ ¸ð´ÏÅ͸µ * [http://wiki.apache.org/hadoop/NameNodeFailover ³×ÀÓ³ëµå Àå¾Ö ´ëó] * [http://wiki.apache.org/hadoop/GangliaMetrics How to get metrics into ganglia] * [http://wiki.apache.org/hadoop/LargeClusterTips ´ë±Ô¸ð Ŭ·¯½ºÅÍ ¿î¿ë ÆÁ] * [http://wiki.apache.org/hadoop/VirtualCluster °¡»ó ¸Ó½ÅÀ» ÀÌ¿ëÇÑ Å¬·¯½ºÅÍ ±¸¼º] * [http://wiki.apache.org/hadoop/DiskSetup µð½ºÅ© ¼Â¾÷¿¡ °üÇÑ Á¶¾ð] * [http://wiki.apache.org/hadoop/PerformanceTuning ¼º´É] ¼º´É Æ©´× * [http://v-lad.org/Tutorials/Hadoop/00%20-%20Intro.html ÇÏµÓ À©µµ¿ì/ÀÌŬ¸³½º Æ©Å丮¾ó] * ¸Ê/¸®µà½º * [http://wiki.apache.org/hadoop/HadoopMapReduce ÇÏµÓ ¸Ê/¸®µà½º] * [http://wiki.apache.org/hadoop/HadoopMapRedClasses ÇÏµÓ ¸Ê/¸®µà½º Ŭ·¡½º] * [http://wiki.apache.org/hadoop/HowManyMapsAndReduces ÇÊ¿äÇÑ ¸Ê, ¸®µà½º °³¼ö Ãß»ê¹ý] * [http://wiki.apache.org/hadoop/TaskExecutionEnvironment ½ÇÇàȯ°æ] * [http://wiki.apache.org/hadoop/HowToDebugMapReducePrograms ¸Ê/¸®µà½º ÇÁ·Î±×·¥ µð¹ö±ë] * ¿¹Á¦ * [http://wiki.apache.org/hadoop/WordCount WordCount] * [http://wiki.apache.org/hadoop/PythonWordCount Python Word Count] * [http://wiki.apache.org/hadoop/C++WordCount C/C++ Word Count] * [http://wiki.apache.org/hadoop/Grep Grep] * [http://wiki.apache.org/hadoop/Sort Sort] * [http://wiki.apache.org/hadoop/RandomWriter RandomWriter] * [http://wiki.apache.org/hadoop/HadoopDfsReadWriteExample HDFS¿¡ ÀÐ°í ¾²±â] * ¾Æ¸¶Á¸(Amazon) * [http://wiki.apache.org/hadoop/AmazonEC2 AmazonEC2¸¦ ÀÌ¿ëÇÑ ÇÏµÓ ½ÇÇà] * [http://wiki.apache.org/hadoop/AmazonS3 AmazonS3¸¦ ÀÌ욯ÇÑ ÇÏµÓ ½ÇÇà] * º¥Ä¡¸¶Å© * [http://wiki.apache.org/hadoop/HardwareBenchmarks Çϵå¿þ¾î º¥Ä¡¸¶Å©] * [http://wiki.apache.org/hadoop/DataProcessingBenchmarks µ¥ÀÌÅÍ Ã³¸® º¥Ä¡¸¶Å©] * ¼­ºê ÇÁ·ÎÁ§Æ® * [http://wiki.apache.org/hadoop/Hbase :Hbase], a Bigtable-like structured storage system for Hadoop HDFS * [http://wiki.apache.org/pig/ Apache Pig] is a high-level data-flow language and execution framework for parallel computation. It is built on top of Hadoop Core. * [http://wiki.apache.org/hadoop/Hive Hive] a data warehouse infrastructure which allows sql-like adhoc querying of data (in any format) stored in Hadoop * [http://wiki.apache.org/hadoop/ZooKeeper ZooKeeper] is a high-performance coordination service for distributed applications. * Contrib * [http://wiki.apache.org/hadoop/HadoopStreaming HadoopStreaming] (Useful for using Hadoop with other programming languages) * [http://wiki.apache.org/hadoop/DistributedLucene DistributedLucene], a Proposal for a distributed Lucene index in Hadoop * [http://wiki.apache.org/hadoop/MountableHDFS MountableHDFS], Fuse-DFS & other Tools to mount HDFS as a standard filesystem on Linux (and some other Unix OSs) * [http://wiki.apache.org/hadoop/HDFS-APIs HDFS-APIs] in perl, python, php, etc * [http://wiki.apache.org/hadoop/Chukwa Chukwa] a data collection, storage, and analysis framework == °³¹ßÀÚ ¹®¼­ == * [http://wiki.apache.org/hadoop/Roadmap Roadmap], listing release plans. * [http://wiki.apache.org/hadoop/HowToContribute HowToContribute] * [http://wiki.apache.org/hadoop/HowToDevelopUnitTests HowToDevelopUnitTests] * [http://wiki.apache.org/hadoop/HowToSetupYourDevelopmentEnvironment HowToSetupYourDevelopmentEnvironment] * [:CodeReviewChecklist: HowToCodeReview] * [http://wiki.apache.org/hadoop/CodeReviewChecklist Jira] usage guidelines * [http://wiki.apache.org/hadoop/HowToCommit HowToCommit] * [http://wiki.apache.org/hadoop/HowToRelease HowToRelease] * [http://wiki.apache.org/hadoop/HudsonBuildServer HudsonBuildServer] * [http://wiki.apache.org/hadoop/DevelopmentHints DevelopmentHints] * [http://wiki.apache.org/hadoop/ProjectSuggestions ProjectSuggestions] * [http://wiki.apache.org/hadoop/HadoopUnderIDEA Building/Testing under IntelliJ IDEA] == °ü·Ã ¸®¼Ò½º == * [http://wiki.apache.org/nutch/NutchHadoopTutorial Nutch Hadoop Tutorial] (Useful for understanding Hadoop in an application context) * [http://www.alphaworks.ibm.com/tech/mapreducetools IBM MapReduce Tools for Eclipse] (An Eclipse plug-in that simplifies the creation and deployment of MapReduce programs) * Hadoop IRC channel is #hadoop at irc.freenode.net. * [http://www.tom-doehler.de/wordpress/index.php/2007/12/19/spring-and-hadoop/ Using Spring and Hadoop] (Discussion of possibilities to use Hadoop and Dependency Injection with Spring) * [http://wiki.apache.org/hama Hama], a Distributed Matrix Computational Package based on Hadoop Map/Reduce * [http://heart.korea.ac.kr Heart], a Planet-Scale RDF Data Store and a Distributed Processing Engine * [http://lucene.apache.org/mahout Mahout], scalable Machine Learning algorithms using Hadoop * [http://opensolaris.org/os/project/livehadoop/ Live Hadoop] A three-node, distributed Hadoop cluster running on an !OpenSolaris live CD * [https://rc.usf.edu/trac/hadoop/wiki/SGEIntegration SGE Integration] A guide on tight-integration of Hadoop on Sun Gridengine