It depends on the video, and is it for marketing, enjoyment, professional usage, education, etc. Videos recorded by recreational coyote hunters vs what a call-manufacturer or what appears on the Outdoor Channel are produced for very different reasons.
Nighttime vs daytime videos play a role as well. There aren't a lot of people doing full production elements (with what some people call "b-roll") with NV and thermal for every stand, every kill, etc. When I make a daytime video, I have used things like drone footage, action-cam footage, and pre and post interviews, multiple camera angles of the kill, and a lot more. All of these things can be done at night as well, but they are just more challenging, and people rarely do it. Also, all these things take a lot of time. The hardest thing for me when hunting is good is to take the time to record all the "b-roll" that makes for a better overall video when all I want to do is get to the next stand.
I believe your question is more for nighttime hunting footage. Similar to what 6mm referenced, a lot depends on if the scope records audio. The more that audio is picked up by the recording device, where maybe they can hear other things happening, the more extra audio isn't as necessary to keep a person engaged with the video. A video that has absolutely no sound at all, with no gunshot, and no music is pretty boring for anyone watching especially for any length of time. Even if a person doesn't like the background music, it is something. So, I would highly recommend some type of sound be present on a video.
If a person or audience doesn't like audio, they can always mute it as some people have said they do.