fix visualization, interactions via file, i/o for subtitles text in vids, request frames during training