标签 hive 下的文章 - 乐趣区

首页关于我的作品 Privacy Policy

1 hive--删除表中的数据truncate 2 阅读 2 【超实用教程】用 SwitchHosts 搭配 GitHub520，一键解决 GitHub 访问慢、图片加载失败问题 2 阅读 3 Oracle SQL Developer连接HiveServer 1 阅读 4 精华 Nano Banana 神指令词！适合懒人！（3D 建模、3D 手办、Q 版旅行地图...） 1 阅读 5 优化网站速度：如何利用Google的PageSpeed Insights提升用户体验和SEO排名 0 阅读

未分类 Kafka jekyll hive java mine hadoop linux 日常 storm rabbitMq react-native luster pmp ai pmi github aigc

登录

标签搜索

centos
apache
Kafka
maven
hadoop
idea
zookeeper
rabbitmq
centos7
ssh
插件
hive
继承
hdfs
编译
log4j
java
module
iptables
update

Typecho

累计撰写 36 篇文章
累计收到 45 条评论

首页
栏目
- 未分类
- Kafka
- jekyll
- hive
- java
- mine
- hadoop
- linux
- 日常
- storm
- rabbitMq
- react-native
- luster
- pmp
- ai
- pmi
- github
- aigc
页面

搜索到 4 篇与的结果

2017-11-06
查看hive 表在hdfs上的存储路径 1、执行hive，进入hive窗口2、执行show databases,查看所有的database;3、执行use origin_ennenergy_onecard; 则使用origin_ennenergy_onecard数据库4、执行show create table M_BD_T_GAS_ORDER_INFO_H;则可以查看table在hdfs上的存储路径如下：hive (origin_ennenergy_onecard)> show create table M_BD_T_GAS_ORDER_INFO_H;OK CREATE TABLE `M_BD_T_GAS_ORDER_INFO_H`( `fguid` string, `fstationno` string, `fstationname` string, `fgunno` int, `fserialno` int, `fgas` double, `fprice` double, `fmoney` double, `fsumgas` double, `ftradedatetime` date, `fstopdatetime` date, `fsavedatetime` date, `ffueltype` string, `recorddate` date) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' STORED AS INPUTFORMAT 'org.apache.hadoop.mapred.TextInputFormat' OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat' LOCATION 'hdfs://mycluster/user/hive/warehouse/origin_ennenergy_onecard.db/m_bd_t_gas_order_info_h' -----标红部分为hdfs的路径 TBLPROPERTIES ( 'COLUMN_STATS_ACCURATE'='true', 'numFiles'='6', 'numRows'='3546198', 'rawDataSize'='435279808', 'totalSize'='438826006', 'transient_lastDdlTime'='1468831756') Time taken: 0.279 seconds, Fetched: 30 row(s)备注：hive其他命令：show functions ----->查看所有的hive函数desc tablesname ------>查看table的表结构感谢 "sborgite"提醒！
- 2017年11月06日
- 0 阅读
- 0 评论
- 0 点赞
2017-11-03
hive--删除表中的数据truncate delect:用于删除特定行条件,你可以从给定表中删除所有的行TRUNCATE:truncate用于删除所有的行，这个行为在hive元存储删除数据是不可逆的DROP:删除hive中的表truncate 不能删除外部表！因为外部表里的数据并不是存放在Hive Meta store中truncate:truncate table table_name;例子：truncate table employees;
- 2017年11月03日
- 2 阅读
- 0 评论
- 0 点赞
2017-10-24
hive0.14-insert、update、delete操作测试问题导读1.测试insert报错，该如何解决？ 2.hive delete和update报错，该如何解决？ 3.什么情况下才允许delete和update？首先用最普通的建表语句建一个表： hive>create table test(id int,name string)row format delimited fields terminated by ','; 复制代码测试insert： insert into table test values (1,'row1'),(2,'row2'); 复制代码结果报错： java.io.FileNotFoundException: File does not exist: hdfs://127.0.0.1:9000/home/hadoop/git/hive/packaging/target/apache-hive-0.14.0-SNAPSHOT-bin/ apache-hive-0.14.0-SNAPSHOT-bin/lib/curator-client-2.6.0.jar at org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(DistributedFileSystem.java:1128) at org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(DistributedFileSystem.java:1120) at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81) at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1120) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.getFileStatus(ClientDistributedCacheManager.java:288) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.getFileStatus(ClientDistributedCacheManager.java:224) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.determineTimestamps(ClientDistributedCacheManager.java:99) at org.apache.hadoop.mapreduce.filecache.ClientDistributedCacheManager.determineTimestampsAndCacheVisibilities(ClientDistributedCacheManager.java:57) at org.apache.hadoop.mapreduce.JobSubmitter.copyAndConfigureFiles(JobSubmitter.java:265) at org.apache.hadoop.mapreduce.JobSubmitter.copyAndConfigureFiles(JobSubmitter.java:301) at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:389) at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1285) at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1282) at java.security.AccessController.doPrivileged(Native Method) ...... 复制代码貌似往hdfs上找jar包了，小问题，直接把lib下的jar包上传到hdfs hadoop fs -mkdir -p /home/hadoop/git/hive/packaging/target/apache-hive-0.14.0-SNAPSHOT-bin/apache-hive-0.14.0-SNAPSHOT-bin/lib/ hadoop fs -put $HIVE_HOME/lib/* /home/hadoop/git/hive/packaging/target/apache-hive-0.14.0-SNAPSHOT-bin/apache-hive-0.14.0-SNAPSHOT-bin/lib/ 复制代码接着运行insert，没有问题，接下来测试delete hive>delete from test where id = 1; 复制代码报错！： FAILED: SemanticException [Error 10294]: Attempt to do update or delete using transaction manager that does not support these operations. 说是在使用的转换管理器不支持update跟delete操作。原来要支持update操作跟delete操作，必须额外再配置一些东西，见： https://cwiki.apache.org/conflue ... tersforTransactions 根据提示配置hive-site.xml: hive.support.concurrency – true hive.enforce.bucketing – true hive.exec.dynamic.partition.mode – nonstrict hive.txn.manager – org.apache.hadoop.hive.ql.lockmgr.DbTxnManager hive.compactor.initiator.on – true hive.compactor.worker.threads – 1 复制代码配置完以为能够顺利运行了，谁知开始报下面这个错误： FAILED: LockException [Error 10280]: Error communicating with the metastore 复制代码与元数据库出现了问题，修改log为DEBUG查看具体错误： 4-11-04 14:20:14,367 DEBUG [Thread-8]: txn.CompactionTxnHandler (CompactionTxnHandler.java:findReadyToClean(265)) - Going to execute query <select cq_id, cq_database, cq_table, cq_partition, cq_type, cq_run_as from COMPACTION_QUEUE where cq_state = 'r'> 2014-11-04 14:20:14,367 ERROR [Thread-8]: txn.CompactionTxnHandler (CompactionTxnHandler.java:findReadyToClean(285)) - Unable to select next element for cleaning, Table 'hive.COMPACTION_QUEUE' doesn't exist 2014-11-04 14:20:14,367 DEBUG [Thread-8]: txn.CompactionTxnHandler (CompactionTxnHandler.java:findReadyToClean(287)) - Going to rollback 2014-11-04 14:20:14,368 ERROR [Thread-8]: compactor.Cleaner (Cleaner.java:run(143)) - Caught an exception in the main loop of compactor cleaner, MetaException(message :Unable to connect to transaction database com.mysql.jdbc.exceptions.jdbc4.MySQLSyntaxErrorException: Table 'hive.COMPACTION_QUEUE' doesn't exist at sun.reflect.GeneratedConstructorAccessor19.newInstance(Unknown Source) at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45) at java.lang.reflect.Constructor.newInstance(Constructor.java:526) at com.mysql.jdbc.Util.handleNewInstance(Util.java:409) 复制代码在元数据库中找不到COMPACTION_QUEUE这个表，赶紧去mysql中查看，确实没有这个表。怎么会没有这个表呢？找了很久都没找到什么原因，查源码吧。在org.apache.hadoop.hive.metastore.txn下的TxnDbUtil类中找到了建表语句，顺藤摸瓜，找到了下面这个方法会调用建表语句： private void checkQFileTestHack() { boolean hackOn = HiveConf.getBoolVar(conf, HiveConf.ConfVars.HIVE_IN_TEST) || HiveConf.getBoolVar(conf, HiveConf.ConfVars.HIVE_IN_TEZ_TEST); if (hackOn) { LOG.info("Hacking in canned values for transaction manager"); // Set up the transaction/locking db in the derby metastore TxnDbUtil.setConfValues(conf); try { TxnDbUtil.prepDb(); } catch (Exception e) { // We may have already created the tables and thus don't need to redo it. if (!e.getMessage().contains("already exists")) { throw new RuntimeException("Unable to set up transaction database for" + " testing: " + e.getMessage()); } } } } 复制代码什么意思呢，就是说要运行建表语句还有一个条件：HIVE_IN_TEST或者HIVE_IN_TEZ_TEST.只有在测试环境中才能用delete，update操作，也可以理解，毕竟还没有开发完全。终于找到原因，解决方法也很简单：在hive-site.xml中添加下面的配置： <property> <name>hive.in.test</name> <value>true</value> </property> 复制代码 OK,再重新启动服务，再运行delete： hive>delete from test where id = 1; 复制代码又报错： FAILED: SemanticException [Error 10297]: Attempt to do update or delete on table default.test that does not use an AcidOutputFormat or is not bucketed 复制代码说是要进行delete操作的表test不是AcidOutputFormat或没有分桶。估计是要求输出是AcidOutputFormat然后必须分桶网上查到确实如此，而且目前只有ORCFileformat支持AcidOutputFormat，不仅如此建表时必须指定参数('transactional' = true)。感觉太麻烦了。。。。于是按照网上示例建表： hive>create table test(id int ,name string )clustered by (id) into 2 buckets stored as orc TBLPROPERTIES('transactional'='true'); 复制代码 insert hive>insert into table test values (1,'row1'),(2,'row2'),(3,'row3'); 复制代码 delete hive>delete from test where id = 1; 复制代码 update hive>update test set name = 'Raj' where id = 2; 复制代码 OK!全部顺利运行，不过貌似效率太低了，基本都要30s左右，估计应该可以优化，再研究研究
- 2017年10月24日
- 0 阅读
- 0 评论
- 0 点赞
2017-10-24
Oracle SQL Developer连接HiveServer Oracle SQL Developer从http://www.oracle.com/technetwork/developer-tools/sql-developer/downloads/index.html下载SQL Developer 4.1.5，并解压；从http://www.cloudera.com/downloads/connectors/hive/jdbc/2-5-15.html下载Hive JDBC Driver for Oracle SQL Developer，并解压，进入解压后的目录，将Cloudera_HiveJDBC4_2.5.15.1040.zip解压。打开sqldeveloper.exe，点击”工具”–>“首选项”,在”数据库”–>”第三方JDBC驱动”中，添加Hive JDBC驱动：添加后重启sqldeveloper。再次打开sqldeveloper后，点击”新建连接”之后，多了”Hive”数据库：连接Hive:Oracle SQL Developer从http://www.oracle.com/technetwork/developer-tools/sql-developer/downloads/index.html下载SQL Developer 4.1.5，并解压；从http://www.cloudera.com/downloads/connectors/hive/jdbc/2-5-15.html下载Hive JDBC Driver for Oracle SQL Developer，并解压，进入解压后的目录，将Cloudera_HiveJDBC4_2.5.15.1040.zip解压。打开sqldeveloper.exe，点击”工具”–>“首选项”,在”数据库”–>”第三方JDBC驱动”中，添加Hive JDBC驱动：添加后重启sqldeveloper。再次打开sqldeveloper后，点击”新建连接”之后，多了”Hive”数据库：连接Hive:
- 2017年10月24日
- 1 阅读
- 0 评论
- 0 点赞

Typecho

36 文章数

45 评论量

2020 - 2029 © lequ7 - 乐趣区