Furthermore, the recordings must be shared across devices (probably stored/processed on server).
Besides the formats in common, I'd like your recommendations for the right bitrate/configuration or so. I'm recording voice and I want to achieve minimum space but good/decent understanding of what people record (more or less the way WhatsApp works), so they understand each other and the file is light enough to travel relatively fast on slow connections.
I already went to this Android chart, but I know little about iOS (future development).
You can find the list of supported media techonlogies on iOS in the documentation. But it seems as if there isn't so much overlap, so you might want to go ahead and use an encoder like the open speex codec which is optimized for VoIP applications.
You can find the answer in my another post with snippet MPEG_4 format and AAC encoder audio format for iOS and Android