Data Lost in HDFS


#1

Hi Team,

Please can any one help me.

I have uploaded data (CCA175.zip and ml-10m.zip) extracted and started working while hue,
in ml-10m.zip there are 3 csv file, i have created Hive table trying to load data from below queries

CREATE DATABASE IF NOT EXISTS movielense;
CREATE EXTERNAL TABLE IF NOT EXISTS movielense.movie_data(MovieID Int,Title String,Genres String)
ROW FORMAT DELIMITED FIELDS TERMINATED BY “::”
LINES TERMINATED BY “\n” STORED AS textfile LOCATION ‘/user/shashireddy408417’;
-------successfully created table
LOAD DATA INPATH ‘/user/shashireddy408417/ml-10m/ml-10M100K/movies.dat’ OVERWRITE INTO TABLE movielense.movie_data;

after running this command i got an error like
FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask

then I am really shocked,when i looking at hdfs location, there is no data at all, not a single file.

Please let me know what was that issue,
what should i do , if it is happened when i’m working in organisation if we lost raw data as well ?

Regards,
Shashi


#2

Please check if you kept in HDFS or local file system.


#3

Please not that “LOAD DATA” moves data from HDFS location into the location of the table.

Good that you created external table. Had you created a normal table, it would delete that data on dropping the table.