I'm running the latest mapred svn and have the same problem, switching to httpclient helped.
RT> As you scan see from the below the %age complete is very low until all RT> of a sudden it jumps to fully complete. This started happening with some RT> segments about a week ago. Others go through their full list of ~10 000 RT> urls. It appears to occur whether I use a generate.max.per.host RT> directive or if I leave it out. Plugins are as defined by default. RT> There are no errors logged at either the jobtracker or tasktracker. RT> Happens whether I use a datanode/namenode configuration or local RT> filesystem. Michael
