After hours of troubleshooting and seeing numerous boxes go down to ridiculous io. This is how we felt:
We found that map generators like c10t create high amount of swap usage which locks the box up. I guess the new map saving changed how the chunks worked. If you use this, turn it OFF!
The process to watch out for is c10t. We stopped this by shutting down crond on all of our boxes.