Hi,
I am asking for a guidance to run a python program on the cloudxlab following this pattern:
-
Assuming that my python code which can be a custom map/reduce process, lets call it mypython.py
-
Assuming that my input folder is in the sub-directory /myinputfolder
-
Assuming that my input file is in the sub-directory /myinoutfolder/myinputdata.txt
Can someone provide me with the right command to run this python program using hadoop streaming-jar file?
The following is an example of a command that I used and the system come back with the file not found error!
python /user/drarmankanooni3849/RatingsBreakdown.py -r hadoop --hadoop-streaming- jar /hdp/apps/2.3.4.0-3485/mapreduce/hadoop-streaming.jar /user/drarmankanooni3849/movielens/u.data
Thank you,
Arman