VASA-1 takes in a single portrait photo and an audio file and converts ... They've got it creating 512x512 pixel images at 45 frames per second and can do it in about 2 minutes using a desktop ...