Largest (as of 2024) machine translated Arabic educational corpus
Sultan Alrashed PRO
SultanR
AI & ML interests
Smol language modelling and Arabic!
Recent Activity
authored
a paper
about 1 hour ago
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+
Languages and Cultures
authored
a paper
about 1 hour ago
AraMix: Recycling, Refiltering, and Deduplicating to Deliver the Largest Arabic Pretraining Corpus
authored
a paper
about 1 hour ago
SmolKalam: Ensemble Quality-Filtered Translation at Scale for High Quality Arabic Post-Training Data