Generate lip-synced video from image or video and audio
Generate lip-synced videos from images and audio