Hbase0.96与Java的交互访问

作者：swa乄ycat曼颜 | 来源：互联网 | 2023-09-24 10:35

hbase-0.96.x相对hbase-0.94.x的改变hadoop:hadoop-2.2.0hbase:hbase-0.96.01.org.apache.hadoop.hbase.client.

hbase-0.96.x相对hbase-0.94.x的改变

hadoop:hadoop-2.2.0
hbase:hbase-0.96.0
1.org.apache.hadoop.hbase.client.Put
<1>取消了无参的构造方法
<2>Put类不再继承Writable类
0.94.6时public class Put extends Mutation implements HeapSize, Writable, Comparable
0.96.0时public class Put extends Mutation implements HeapSize, Comparable
解决方法：
由public class MonthUserLoginTimeIndexReducer extends Reducer {
改public class MonthUserLoginTimeIndexReducer extends Reducer {
2.org.apache.hadoop.hbase.client.Mutation.familyMap
org.apache.hadoop.hbase.client.Mutation.familyMap类型改变：
/**
* 0.94.6
* protected Map> familyMap
*
* 0.96.*
* protected NavigableMap> familyMap
* org.apache.hadoop.hbase.Cell hbase-0.94.*中是没有的
*/

org.apache.hadoop.hbase.KeyValue的改变：
/**
* 0.94.*
* public class KeyValue extends Object implements Writable, HeapSize
*
* 0.96.0
* public class KeyValue extends Object implements Cell, HeapSize, Cloneable
*/
解决方法：将代码中的List改成List
3. org.apache.hadoop.hbase.KeyValue
0.96.0中方法getFamily已被弃用（Deprecated），改成方法getFamilyArray()
4.org.apache.hadoop.hbase.HTableDescriptor
类org.apache.hadoop.hbase.HTableDescriptor的构造方法public HTableDescriptor(String name)已被弃用（Deprecated）
解决方法：使用public HTableDescriptor(TableName name)
旧：HTableDescriptor tableDesc = new HTableDescriptor(tableName);
新：HTableDescriptor tableDesc = new HTableDescriptor(TableName.valueOf(tableName));
5.org.apache.hadoop.hbase.client.HTablePool
类org.apache.hadoop.hbase.client.HTablePool整个被弃用（Deprecated）
解决方法：使用HConnection.getTable(String)代替，HConnection是个接口，类CoprocessorHConnection是它唯一的实现类：
HRegionServer hRegiOnServer= new HRegionServer(conf) ;
HConnection cOnnection= HConnectionManager.createConnection(conf);
hCOnnection= new CoprocessorHConnection(connection,hRegionServer);
6.org.apache.hadoop.hbase.client.Result
方法public KeyValue[] raw()被弃用（Deprecated），建议使用public Cell[] rawCells()
方法getRow被弃用（Deprecated）
方法getFamily被弃用（Deprecated）
方法getQualifier被弃用（Deprecated）
方法getValue被弃用（Deprecated）
方法public List getColumn(byte[] family,byte[] qualifier)被弃用（Deprecated）
方法public KeyValue getColumnLatest(byte[] family,byte[] qualifier)被弃用（Deprecated）
Cell中：改成以下方法
getRowArray()
getFamilyArray()
getQualifierArray()
getValueArray()
Result中：增加如下方法
public List getColumnCells(byte[] family,byte[] qualifier)
public KeyValue getColumnLatestCell(byte[] family,byte[] qualifier)
改动：所有ipeijian_data中凡是和【新增用户活跃用户流失用户】相关的都做如下变化：
旧代码：if (value.raw().length == 1
新代码：if (value.rawCells().length == 1
7.job中设置TableInputFormat.SCAN
0.96.0中去掉了方法：public void write(DataOutput out)throws IOException
之前版本使用conf.set(TableInputFormat.SCAN, StatUtils.convertScanToString(scan));进行设置
StatUtils.convertScanToString的具体实现为:
public static String convertScanToString(Scan scan) throws IOException {
ByteArrayOutputStream out = new ByteArrayOutputStream();
DataOutputStream dos = new DataOutputStream(out);
scan.write(dos);
return Base64.encodeBytes(out.toByteArray());
}
该方法的实现与TableMapReduceUtil.convertScanToString(Scan scan)是一样的。
但是当hbase升级到了0.96.*是对于类Scan弃用（不仅仅是Deprecated，而是Deleted）了方法write，所以上面
的实现变为不正确
hbase0.96.*中对该方法进行了重新的实现：
public static String convertScanToString(Scan scan) throws IOException {
ClientProtos.Scan proto = ProtobufUtil.toScan(scan)；
return Base64.encodeBytes(proto.toByteArray());
}
所以做如下更改：
StatUtils类中方法convertScanToString的实现做如上更改以适配hbase0.96.*
8.cn.m15.ipj.db.hbase.MyPut
自定义的Put类，比传统的Put类多一个length，原版和新版代码比较：
原版：（红色字体为API变为新版时报错的地方）

public class MyPut extends Put {
public MyPut(byte[] row, int length) {
   //原因是put的无参构造方法已经在新本中消失
if (row == null || length > HConstants.MAX_ROW_LENGTH) {
throw new IllegalArgumentException(“Row key is invalid”);
}
this.row = Arrays.copyOf(row, length);
this.ts = HConstants.LATEST_TIMESTAMP;
}
public MyPut add(byte[] family, byte[] qualifier, long ts, byte[] value,int length) {
List list = getKeyValueList(family);
KeyValue kv = createPutKeyValue(family, qualifier, ts, value, length);
list.add(kv);
familyMap.put(kv.getFamily(), list);
  //familyMap的类型已经改变
return this;
}
private List getKeyValueList(byte[] family) {
List list = familyMap.get(family);
  //familyMap的类型已经改变
if (list == null) {
list = new ArrayList(0);
}
return list;
}
private KeyValue createPutKeyValue(byte[] family, byte[] qualifier,long ts, byte[] value, int length) {
return new KeyValue(this.row, 0, this.row.length, family, 0,
family.length, qualifier, 0, qualifier.length, ts,
KeyValue.Type.Put, value, 0, length);
}
}

更改之后:

public MyPut(byte[] row, int length) {
super(row,length);
//新增加
if (row == null || length > HConstants.MAX_ROW_LENGTH) {
throw new IllegalArgumentException(“Row key is invalid”);
}
this.row = Arrays.copyOf(row, length);
this.ts = HConstants.LATEST_TIMESTAMP;
}
public MyPut add(byte[] family, byte[] qualifier, long ts, byte[] value,int length) {
List<Cell> list = getCellsList(family);
KeyValue kv = createPutKeyValue(family, qualifier, ts, value, length);
list.add(kv);
familyMap.put(CellUtil.cloneFamily(kv), list);
return this;
}
private List getCellsList(byte[] family) {
List<Cell> list = familyMap.get(family);
if (list == null) {
list = new ArrayList<Cell>(0);
}
return list;
}
private KeyValue createPutKeyValue(byte[] family, byte[] qualifier,long ts, byte[] value, int length) {
return new KeyValue(this.row, 0, this.row.length, family, 0,family.length, qualifier, 0, qualifier.length, ts,
KeyValue.Type.Put, value, 0, length);
}
}

package com.test.hbase;

import java.io.IOException;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.hbase.CellUtil;
import org.apache.hadoop.hbase.HBaseConfiguration;
import org.apache.hadoop.hbase.HColumnDescriptor;
import org.apache.hadoop.hbase.HTableDescriptor;
import org.apache.hadoop.hbase.MasterNotRunningException;
import org.apache.hadoop.hbase.TableName;
import org.apache.hadoop.hbase.ZooKeeperConnectionException;
import org.apache.hadoop.hbase.client.Delete;
import org.apache.hadoop.hbase.client.Get;
import org.apache.hadoop.hbase.client.HBaseAdmin;
import org.apache.hadoop.hbase.client.HTable;
import org.apache.hadoop.hbase.client.Put;
import org.apache.hadoop.hbase.client.Result;
import org.apache.hadoop.hbase.client.ResultScanner;
import org.apache.hadoop.hbase.client.Scan;
import org.apache.hadoop.hbase.util.Bytes;
import org.apache.hadoop.hbase.Cell;

public class HbaseTest {

    private static Configuration cOnf= null;
    /**
     * 初始化配置
     */
    static {
        conf = HBaseConfiguration.create();
    }

    /**
     * 创建表操作
     * 
     * @throws IOException
     * @throws ZooKeeperConnectionException
     * @throws MasterNotRunningException
     */
    public void createTable(String tableName, String[] cfs)
            throws MasterNotRunningException, ZooKeeperConnectionException,
            IOException {
        HBaseAdmin admin = new HBaseAdmin(conf);
        if (admin.tableExists(tableName)) {
            System.out.println("表已经存在!");
        } else {
            HTableDescriptor tableDesc = new HTableDescriptor(
                    TableName.valueOf(tableName));
            for (int i = 0; i ) {
                tableDesc.addFamily(new HColumnDescriptor(cfs[i]));
            }
            admin.createTable(tableDesc);
            admin.close();
            System.out.println("表创建成功！");
        }
    }

    /**
     * 删除表操作
     */
    public void deleteTable(String tableName) {
        HBaseAdmin admin;
        try {
            admin = new HBaseAdmin(conf);
            admin.disableTable(tableName);
            admin.deleteTable(tableName);
            admin.close();
            System.out.println("表删除成功！");
        } catch (MasterNotRunningException e) {
            // TODO Auto-generated catch block
            e.printStackTrace();
        } catch (ZooKeeperConnectionException e) {
            // TODO Auto-generated catch block
            e.printStackTrace();
        } catch (IOException e) {
            // TODO Auto-generated catch block
            e.printStackTrace();
        }

    }

    /**
     * 插入一行记录
     */
    public void writeRow(String tableName, String[] cfs) {
        try {
            HTable table = new HTable(conf, tableName);
            Put put = new Put(Bytes.toBytes("row1"));
            for (int j = 0; j ) {
                put.add(Bytes.toBytes(cfs[j]),
                        Bytes.toBytes(String.valueOf(1)),
                        Bytes.toBytes("value_1"));
            }
            table.close();
            System.out.println("添加成功！");
        } catch (IOException e) {
            // TODO Auto-generated catch block
            e.printStackTrace();
        }
    }

    /**
     * 删除一行记录
     */
    public void deleteRow(String tableName, String rowKey) {
        try {
            HTable table = new HTable(conf, tableName);
            Delete dl = new Delete(rowKey.getBytes());
            table.delete(dl);
            table.close();
            System.out.println("删除行成功!");
        } catch (IOException e) {
            // TODO Auto-generated catch block
            e.printStackTrace();
        }

    }

    /**
     * 查找一条记录
     */
    public static void selectRow(String tableName, String rowKey) {
        try {
            HTable table = new HTable(conf, tableName);
            Get g = new Get(rowKey.getBytes());
            Result rs = table.get(g);
            System.out.println(rs);
            table.close();
            System.out.println("查询完成！");
        } catch (IOException e) {
            // TODO Auto-generated catch block
            e.printStackTrace();
        }
    }

    /**
     * 查询表中所有的行
     */
    public void scanner(String tableName) {
        try {
            HTable table = new HTable(conf, tableName);
            Scan s = new Scan();
            ResultScanner rs = table.getScanner(s);
            for (Result r : rs) {
                // keyvalue
                Cell[] cell = r.rawCells();
                System.out.println("长度：" + cell.length);
                for (int i = 0; i ) {
                    System.out.println("信息:"
                            + new String(CellUtil.cloneFamily(cell[i])) + " "
                            + new String(CellUtil.cloneQualifier(cell[i]))
                            + "  " + new String(CellUtil.cloneValue(cell[i]))
                            + " " + new String(CellUtil.cloneRow(cell[i])));
                }
                System.out.println("\n-----------------------");
            }
            table.close();
            System.out.println("执行结束！");
        } catch (IOException e) {
            // TODO Auto-generated catch block
            e.printStackTrace();
        }
    }

    public static void main(String[] args) {
        HbaseTest hbase = new HbaseTest();
        String tableName = "test";
        hbase.scanner(tableName);
    }
}

推荐阅读

ip
Socket 编程基础入门

本文介绍了一个基本的同步Socket程序，演示了如何实现客户端与服务器之间的简单消息传递。此外，文章还概述了Socket的基本工作流程，并计划在未来探讨同步与异步Socket的区别。 ... [详细]

蜡笔小新 2024-11-25 12:35:34
ip
如何使用 org.apache.tinkerpop.gremlin.structure.VertexProperty 的 key 方法

本文详细介绍了 `org.apache.tinkerpop.gremlin.structure.VertexProperty` 类中的 `key()` 方法，并提供了多个实际应用的代码示例。通过这些示例，读者可以更好地理解该方法在图数据库操作中的具体用途。 ... [详细]

蜡笔小新 2024-11-21 17:38:10
ip
C# Winform 实现局域网文件传输

本文介绍了如何使用C# Winform开发局域网内的文件传输功能，详细描述了从用户界面到后端网络通信的具体实现。 ... [详细]

蜡笔小新 2024-11-25 20:10:16
ip
Java中利用POI库读取Doc和Docx文件的方法

近期在研究Java IO流技术时，遇到了一个关于如何正确读取Doc文档而不出现乱码的问题。本文将详细介绍使用Apache POI库处理Doc和Docx文件的具体方法，包括必要的库引入和示例代码。 ... [详细]

蜡笔小新 2024-11-25 19:51:38
ip
Oracle RMAN 增量备份详解：差异增量与累积增量

本文详细介绍了Oracle RMAN中的增量备份机制，重点解析了差异增量和累积增量备份的概念及其在不同Oracle版本中的实现。通过对比两种备份方式的特点，帮助读者选择合适的备份策略。 ... [详细]

蜡笔小新 2024-11-25 19:07:53
ip
PHP中处理HTTP头部信息的方法与技巧

本文详细介绍了在PHP中如何获取和处理HTTP头部信息，包括通过cURL获取请求头信息、使用header函数发送响应头以及获取客户端HTTP头部的方法。同时，还探讨了PHP中$_SERVER变量的使用，以获取客户端和服务器的相关信息。 ... [详细]

蜡笔小新 2024-11-24 16:12:27
ip
使用UploadProgress实现文件上传进度条

本文介绍了如何通过安装和配置php_uploadprogress扩展来实现文件上传时的进度条显示功能。通过一个简单的示例，详细解释了从安装扩展到编写具体代码的全过程。 ... [详细]

蜡笔小新 2024-11-24 00:04:13
ip
物联网学习之旅：微信小程序控制STM32（三）STM32代码编写

STM32代码编写STM32端不需要写关于连接MQTT服务器的代码，连接的工作交给ESP8266来做，STM32只需要通过串口接收和发送数据，间接的与服务器交互。串口三配置串口一已 ... [详细]

蜡笔小新 2024-11-23 21:37:55
ip
解决JSP页面中文乱码问题的方法

本文详细探讨了在Windows 98环境下安装Apache 1.3.9、JServ、GNUJSP 1.0、JDK 1.2.2及JSDK 2.0后遇到的中文显示问题，并提供了多种有效的解决方案。 ... [详细]

蜡笔小新 2024-11-23 19:28:03
range
将datatable导出为excel的三种方式（转）

一、使用Microsoft.Office.Interop.Excel.DLL需要安装Office代码如下：2publicstaticboolExportExcel(S ... [详细]

蜡笔小新 2024-11-23 16:37:33
数组
Java实现优先队列：二叉堆详解

本文详细介绍了二叉堆的概念及其在Java中的实现方法。二叉堆是一种特殊的完全二叉树，具有堆性质，常用于实现优先队列。 ... [详细]

蜡笔小新 2024-11-19 12:52:35
object
Java线程停止策略详解

本文探讨了Java中有效停止线程的多种方法，包括使用标志位、中断机制及处理阻塞I/O操作等，旨在帮助开发者避免使用已废弃的危险方法，确保线程安全和程序稳定性。 ... [详细]

蜡笔小新 2024-11-25 19:16:54
ip
2022年4月15日算法练习题

2022年4月15日的算法练习题，包括最长公共子序列和线段树的应用。 ... [详细]

蜡笔小新 2024-11-25 19:10:45
input
Java程序设计复习题

1、编写一个Java程序在屏幕上输出“你好！”。programmenameHelloworld.javapublicclassHelloworld{publicst ... [详细]

蜡笔小新 2024-11-22 20:05:03
数组
探讨Java中将图像对象转换为文件和字节数组的方法

本文详细探讨了在Java中如何将图像对象转换为文件和字节数组（Byte[]）的技术。虽然网络上存在大量相关资料，但实际操作时仍需注意细节。本文通过使用JMSL 4.0库中的图表对象作为示例，提供了一种实用的方法。 ... [详细]

蜡笔小新 2024-11-21 21:42:59

swa乄ycat曼颜

这个家伙很懒，什么也没留下！

Tags | 热门标签

RankList | 热门文章