Hi,
I see the following messages on the console after issuing a dump in Pig. I have previously loaded the data from a Cassandra table which has over 15 Million records into a pig relation.
I'm running a 5 node Hadoop cluster and all nodes are displaying the same message as below;
INFO 14:16:22,225 attempt_201310071219_0003_m_000001_0 NaN%
INFO 14:16:25,294 attempt_201310071219_0003_m_000001_0 NaN%
INFO 14:16:28,366 attempt_201310071219_0003_m_000001_0 NaN%
INFO 14:16:31,434 attempt_201310071219_0003_m_000001_0 NaN%
Could someone please shed some light on this?
And looking at the job details in the web browser, I can see that the number of map and reduce tasks are set to 1. I was under the impression that map and reduce tasks are split across the 5 nodes rather than having one node do all the work?
Thanks
Majd