Generate natural speech from Japanese or English text
Detect anime faces and landmarks in an image
Towards Unified Music Emotion Recognition across Dimensional