OpenMiDaS transmits a video stream from the device to a backend server where depth estimation is performed using MiDaS and returned to the device for a side by side comparison with the camera preview.
The work on MiDaS comes from the paper: https://arxiv.org/abs/1907.01341v3