从hdfs上导入表

从 HDFS 文件中导入数据到 SequoiaDB 表

  1. hive> insert overwrite table sdb_tab select * from hdfs_tab;
  2. Total MapReduce jobs = 1
  3. Launching Job 1 out of 1
  4. Number of reduce tasks is set to 0 since there's no reduce operator
  5. Starting Job = job_201310172156_0010, Tracking URL = http://bl465-5:50030/jobdetails.jsp?jobid=job_201310172156_0010
  6. Kill Command = /opt/hadoop-hive/hadoop-1.2.1/libexec/../bin/hadoop job -kill job_201310172156_0010
  7. Hadoop job information for Stage-0: number of mappers: 1; number of reducers: 0
  8. 2013-10-18 04:44:47,733 Stage-0 map = 0%, reduce = 0%
  9. 2013-10-18 04:44:49,763 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.85 sec
  10. 2013-10-18 04:44:50,777 Stage-0 map = 100%, reduce = 0%, Cumulative CPU 1.85 sec
  11. 2013-10-18 04:44:51,795 Stage-0 map = 100%, reduce = 100%, Cumulative CPU 1.85 sec
  12. MapReduce Total cumulative CPU time: 1 seconds 850 msec
  13. Ended Job = job_201310172156_0010
  14. 10 Rows loaded to sdb_tab
  15. MapReduce Jobs Launched:
  16. Job 0: Map: 1 Cumulative CPU: 1.85 sec HDFS Read: 2301 HDFS Write: 0 SUCCESS
  17. Total MapReduce CPU Time Spent: 1 seconds 850 msec
  18. OK
  19. Time taken: 12.201 seconds

Note:

在导入数据到 SequoiaDB 表之前,请确保已经创建基于 HDFS 文件的 hdfs_tab 数据表,并 load 了数据。