hive 基本操作与应用

通过 hadoop 上的 hive 完成 WordCount

启动 hadoop

cd /usr/local/hadoop
./sbin/start-dfs.sh
cd /usr/local/hive/lib
service mysql start
start-all.sh

Hdfs 上创建文件夹

hdfs dfs -mkdir test
hdfs dfs -ls /user/hadoop

上传文件至 hdfs

hdfs dfs -put ./test.txt test
hdfs dfs -ls /user/hadoop/test

启动 Hive

创建原始文档表

hive
create table docs(line string)

导入文件内容到表 docs 并查看

load data inpath '/user/hadoop/tese1/test.txt' overwrite into table docs
select * from docs

用 HQL 进行词频统计, 结果放在表 word_count 里

create table word_count as select word,count(1) as count from (select explode(split(line," ")) as word from docs) word group by word order by word;

查看统计结果

show tables;
select * from word_count;

来源: http://www.bubuko.com/infodetail-2603979.html

暂无,快来抢沙发吧！