org.apache.spark.input
A class that allows DataStreams to be serialized and moved around by not creating them until they need to be read
CombineFileSplit is not serializable resulting in the splitBytes construct
TaskAttemptContext is not serializable resulting in the confBytes construct
Close the file (if it is currently open)
Create a new DataInputStream from the split and context
Read the file as a byte array
A class that allows DataStreams to be serialized and moved around by not creating them until they need to be read
CombineFileSplit is not serializable resulting in the splitBytes construct
,TaskAttemptContext is not serializable resulting in the confBytes construct