The model suffers from ADHD

#18
by Candala - opened

My impression is that the model suffers from ADHD. It is unable to formulate thoughts in a coherent manner, let alone generate a narrative: instead of a smooth narrative, the result is a somewhat chaotic collection of notes. However, compared to Ministral-3-14B-Base, this model is functional, as Ministral-3-14B-Base is completely dysfunctional.

It should be noted that this review is based on the performance of both models with German-language texts. It should be remembered that Mistral was previously truly multilingual. Unfortunately, this is no longer the case.

Sign up or log in to comment