facing problem in twitter influencer project


5 Answer(s)


attaching the contents of my hive-site.xml



Hi Subhra,

Check the highligted lines in the output:

hive> LOAD DATA INPATH '/user/cloudera/tweets/2016/10/16/17/' INTO TABLE tweets_partitioned;
Loading data to table class7.tweets_partitioned
Table class7.tweets_partitioned stats: [num_partitions: 0, num_files: 0, num_rows: 0, total_size: 0, raw_data_size: 0]
OK
Time taken: 0.515 seconds

hive> select * from tweets_partitioned;
OK
Failed with exception java.io.IOException:java.io.IOException: Not a file: hdfs://localhost.localdomain:8020/user/cloudera/tweets/2016
Time taken: 0.171 seconds

While writing the file into tweets_partitoned, there were no record written.

Could you please share the complete set of query you used and your flume.conf file you used for fetch tweeter data.

Thanks.


please find attached

flume.conf --> the keys are not the exact ones I am using

data_download.txt --> contains the twitter data downloaded into hdfs

table.txt --> The create table script that I have used

 

Thanks



I am able to solve it.

The problem occurred because I had created the hive table without a partitioned.

Now I am able to load data as well as query from the table


Hi Subhra,

Thanks for your confirmation.