improve send/recv handling for larger amounts of data
In various places (eg, PstreamBuffers and file IO) we can wish to send large amounts of data about, but MPI is currently restricted to handling int32 counts of data. These are the total number of elements in a message, not the byte count. Unfortunately, we largely rely on sending/receiving byte streams as being the lowest common denominator for all contiguous data types.