You're right, when the server receives two swipe events at the same time in opposite directions we assume that both devices are aligned at this point. It doesn't matter though where on the screen you pinch them together,.
We haven't figured out a way to do this automatically. Currently we prompt the user to enter the size of the device when the app is opened for the first time.
I don't think it's possible unless you add an app.
One solution would be create a companion app that determines the physical screen size, then makes that available to your network via web sockets.
There our a few steps I left out here but it's possible.
The app could be optional - if its installed that device will have fully automatic configuration. If it's not installed, you just fall back to your user prompt.
Based on the demo they showed, this isn't needed. As long as they know the orientation of the device relative to a flat plane they can move the virtual items within the bounds of that device's screen. When it crosses over on to another device, it only needs to calculate movement based on it's relative orientation.
I think this blog post is pointless. Of course every abstraction adds some overhead at first. If your problem is simple enough to fit in a few lines of code you don't have to think a lot how you structure it.