by `tsac-2024-04-08-win64` without cuda encoding and decoding took about 16 minutes on a ryzen7-5800x, so roughly realtime