I want to load a video that has a soundtrack, and I want it to take an audio clip of a single spoken word and I want the clip inserted at different points in the video. The output will be the new video. I also want to insert an image at certain points in the video (watermark).
Can I do this using FFMPEG or libvlcsharp?