Hive操作HBase外表

更新时间：2025-08-21

创建 Hbase 表

Plain Text

1hbase shell

创建 hbase 表 hbase_hive_table：

Plain Text

1hbase(main):004:0> create 'hbase_hive_table', 'cf'
2Created table hbase_hive_table
3Took 1.3137 seconds

Hive 操作

进入 hive 环境

Plain Text

1hive

设置引擎为 mr MapReduce 引擎和本集群的 Hbase 环境已经调好，如果使用其他集群的 Hbase，可以用 add file hbase-site.xml 添加其他集群的配置文件（需要服务器打通）。TEZ 引擎需要额外的 Hbase 相关的配置，需使用 add jar 的方式把 hbase 的相关 jar 包放到执行环境里。

Plain Text

1set hive.execution.engine=mr;

创建外部表

Plain Text

1CREATE EXTERNAL TABLE hbase_hive_table (key int, value string) 
2STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
3WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,cf:val")
4TBLPROPERTIES ("hbase.table.name" = "hbase_hive_table", "hbase.mapred.output.outputtable" = "hbase_hive_table");

插入数据

Plain Text

1INSERT OVERWRITE TABLE HBASE_HIVE_TABLE values(98, 'abc');

检索数据

Plain Text

1select * from HBASE_HIVE_TABLE;
2OK
398	abc
4Time taken: 0.246 seconds, Fetched: 1 row(s)

Join 测试

创建表

Plain Text

1create table t1(t1_c1 int, t1_c2 string);
2insert into t1 values(1,'t1_key1'),(2,'t1_key2'),(3,'t1_key3');
3
4create table t2(t2_c1 int, t2_c2 string);
5insert into t2 values(2,'t2_key2'),(3,'t2_key3'),(4,'t2_key4');

插入数据

Plain Text

1INSERT OVERWRITE TABLE HBASE_HIVE_TABLE
2select t1_c1, concat(t1_c2, t2_c2) 
3from t1 join t2 on t1_c1 = t2_c1;

检索结果

Plain Text

1select * from HBASE_HIVE_TABLE;
2OK
32	t1_key2t2_key2
43	t1_key3t2_key3
598	abc
6Time taken: 0.141

Hive迁移

基础使用

MapReduce BMR

MapReduce BMR

Hive操作HBase外表

创建 Hbase 表

Hive 操作

Join 测试