黑猴子的家:Kylin 快速入门之数据准备

2020-02-07  本文已影响0人  黑猴子的家

在Hive中创建数据,分别创建部门和员工外部表,并向表中导入数据。

1、原始数据

1)dept

10  ACCOUNTING  1700
20  RESEARCH    1800
30  SALES   1900
40  OPERATIONS  1700

2)emp

7369    SMITH   CLERK   7902    1980-12-17  800.00      20
7499    ALLEN   SALESMAN    7698    1981-2-20   1600.00 300.00  30
7521    WARD    SALESMAN    7698    1981-2-22   1250.00 500.00  30
7566    JONES   MANAGER 7839    1981-4-2    2975.00     20
7654    MARTIN  SALESMAN    7698    1981-9-28   1250.00 1400.00 30
7698    BLAKE   MANAGER 7839    1981-5-1    2850.00     30
7782    CLARK   MANAGER 7839    1981-6-9    2450.00     10
7788    SCOTT   ANALYST 7566    1987-4-19   3000.00     20
7839    KING    PRESIDENT       1981-11-17  5000.00     10
7844    TURNER  SALESMAN    7698    1981-9-8    1500.00 0.00    30
7876    ADAMS   CLERK   7788    1987-5-23   1100.00     20
7900    JAMES   CLERK   7698    1981-12-3   950.00      30
7902    FORD    ANALYST 7566    1981-12-3   3000.00     20
7934    MILLER  CLERK   7782    1982-1-23   1300.00     10

2、Hive 表操作

1)选中数据库

[alex@hadoop102 hive]$ bin/hive
hive> show databases;
hive> use default;

2)创建部门表

create external table if not exists default.dept(
deptno int,
dname string,
loc int
)
row format delimited fields terminated by '\t';

3)创建员工表

create external table if not exists default.emp(
empno int,
ename string,
job string,
mgr int,
hiredate string,
sal double, 
comm double,
deptno int)
row format delimited fields terminated by '\t';

4)查看Hive创建的表

hive (default)> show tables;
OK
tab_name
dept
emp

5)向外部表中导入数据
导入数据

hive (default)> load data local inpath '/opt/module/data/dept.txt' into table default.dept;
hive (default)> load data local inpath '/opt/module/data/emp.txt' into table default.emp;

6)查看表数据

hive (default)> select * from emp;
hive (default)> select * from dept;
上一篇 下一篇

猜你喜欢

热点阅读