Hadoop Wordscount Exam

Tech

Hadoop Wordscount Exam

glqdlt 2016. 9. 8. 11:01

#Word Count Exam

> cd hadoop/bin

하둡을 설치하고 bin 폴더로 들어간다.

> ./hadoop fs -mkdir input

대상을 담을 input 을 만들자.

> vi /opt/test.txt

opt 폴더에 test.txt 란 검색 docu를 만들어본다.

[test.txt]

hello

hadoop

hello hadoop

hello glqdlt

...

> w q

이런 식으로 기입하고 저장한다.

> ./hadoop fs -put /opt/test.txt input

test.txt 를 input 에 넣는다.

> ./hadoop jar ../hadoop-example ~~ .jar wordcount input output

아까 만들고 넣었던 input 을 hadoop-example ~~ .jar 의 wordcount 란 메소드로 실행하겠다는 의미 input은 대상이고 output은 로그가 나올 대상임

샬라샬라하고 job scedule이 실행될거

> ./hadoop dfs -ls output

로 잘됬는가 확인해본다. (part-r-00000) 이라고 나오면 요놈이 word count 결과 로그임

> ./hadoop dfs -cat output/part-r-00000

cat으로 결과 로그를 한번 찍어봄

hello 5

hadoop 2

jhun 1

호우! 이렇게 나올거다, 우왕굳

> ./hadoop dfs - get output/part-r-00000 /opt/test.txt.result

결과를 로그파일(test.txt.result)로 get 해보자

대충 이런식.

저작자표시 비영리 변경금지 (새창열림)

'Tech' 카테고리의 다른 글

Virtualizaion 이란 (0)	2016.11.09
ELK 힙사이즈 확장 (0)	2016.11.09
'automatic updates download' has encountered a problem. an err occurred while colletcing items to be installed (0)	2016.09.02
Cuckoo 1.2 On Ubuntu 16.04 (0)	2016.08.02
실행 창에서 설치 프로그램 실행하기(bat 파일 제작 응용) (0)	2016.02.23

현재글Hadoop Wordscount Exam

Freesia

github : https://github.com/glqdlt

Today :
Yesterday :

일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Freesia