问题描述:

I have a huge set of XML records split across different files. Now If a record starts in File 1 but does not end there. Instead it is continued in some other file say File10. How will the Map Reduce framework identify that the remaining pat of the record so that it is processed by the same mapper?

相关阅读:
Top