Unfortunately that is how WI-FI webcams are: I encountered about 2 seconds delay too. To do actual Vision Navigation I had to write my own code on an RPi0 2W using the PI Cam in direct wired mode. This means that you have to use a different SD card with your own OS and OpenCV and Dynamixel SDK - quite a bit of work.