I am creating a cutlist like this
file 't1.MP4'
inpoint 1.45067
outpoint 3.18133
file 't2.MP4'
inpoint 3.45067
outpoint 6.18133
...
and I use it like this
ffmpeg -y -f concat -safe 0 -i list.txt -c:v libx264 -c:a aac -af aresample=async=1 -pix_fmt yuv420p output.mp4
The output almost always contains a still image at the start, sometimes half the video time, sometimes less, showing the first frame then the final video starts. I noticed, the less inpoint
and outpoint
are apart, the longer the still image at the start. Maybe its also connected to the amount of inpoint
and outpoint
.
How do I solve this?
The concat demuxer is outputting more packets than that specified in the ranges, since video will need earlier frames to decode.
Try
I assume all files have videos and audios with same properties i.e. framerate, resolution, audio sampling rate, channels..etc