Sad that Google + Facebook pulled out of federation a long time ago. We could have had bring-your-own-client cross-network all of this stuff. We still can, but realistically we won't.
WebRTC defines how the browser supports audio/video/chat endpoints. A practical video calling system needs servers to manage firewall traversal, a directory of users, and ways to admin the system.