The model suffers from ADHD
#18
by
Candala
- opened
My impression is that the model suffers from ADHD. It is unable to formulate thoughts in a coherent manner, let alone generate a narrative: instead of a smooth narrative, the result is a somewhat chaotic collection of notes. However, compared to Ministral-3-14B-Base, this model is functional, as Ministral-3-14B-Base is completely dysfunctional.
It should be noted that this review is based on the performance of both models with German-language texts. It should be remembered that Mistral was previously truly multilingual. Unfortunately, this is no longer the case.