In Spark class of 25/03/2018, you have said that, if half of the line stored in one node and other half in another node second node will communicate with first node and transmit the second half.
But I have heard that HDFS will be taking care such that, complete record will be stored in one node by checking EOF.