Sounds like you have the right ideas.
Ground wire is common to all devices. VTx and cameras may need different power supply voltages. Ground wire is used both for power supply ground and for video signal ground. 3S Lipo is close enough to 12V to use for that. 5V regulated from ESC works well if you need 5V. Be aware that a linear BEC inside the ESC may get warmer because you are drawing more power. Black/brown is ground. Red is + some voltage (usually 5 or 12). Yellow is video signal. Sometimes white is sound.
Getting 5V power from the FC ESC connection sounds like an easy and reasonably correct way to do it. The very worst that you might get would be some small amount video of video interference, but don't let that send you off to make wiring better when it just is not needed.
Be aware that if you are not nearsighted, then you will probably need reading glasses to use the goggles, especially if you are over 20 or so. Most goggles were designed to be small and designed by a young, possibly near sighted person. If you aren't near-sighted I recommend you research as to whether there is room in the goggles to wear glasses. Also consider mounting a set of reading glasses inside.
I made my favorite goggles by cutting a pair in half to make them about 100mm longer. They now focus where my eyes naturally focus and I could wear them for hours with no eye strain.
https://www.rcgroups.com/forums/showthread.php?p=36570916&postcount=3010https://www.rcgroups.com/forums/showthread.php?p=36594099&postcount=3029Buy the goggles you like. RCGroups.com often has threads with users comments that can help with many things about that. For your goggles, do you want:
- built in receiver
- diversity for receiver
- dvr to record videos with
- extra input or output connections