Note: The solution may not be in `solution` or `answer` columns, but inside /boxed/{ANSWER}
Gurvaah Singh
ReallyFloppyPenguin
AI & ML interests
AI, GGUFing AI, AI, Running AI, Thinking about AI, and so on
Recent Activity
liked a model about 16 hours ago
kdcyberdude/w2v-bert-punjabi liked a model 1 day ago
deepseek-ai/DeepSeek-V4-Flash-Base liked a Space 1 day ago
kdcyberdude/HARvestGymOrganizations
Datasets That Kill
Sikh Models
-
HuggingFaceTB/SmolLM3-3B
Text Generation β’ 3B β’ Updated β’ 164k β’ 944 -
Qwen/Qwen3-4B
Text Generation β’ Updated β’ 6.23M β’ β’ 606 -
meta-llama/Llama-3.1-8B-Instruct
Text Generation β’ 8B β’ Updated β’ 9.39M β’ β’ 5.77k -
mistralai/Mistral-7B-Instruct-v0.3
7B β’ Updated β’ 3.2M β’ 2.54k
GGUFs
Interesting Papers
-
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper β’ 2501.11425 β’ Published β’ 109 -
Agent Laboratory: Using LLM Agents as Research Assistants
Paper β’ 2501.04227 β’ Published β’ 95 -
System Prompt Optimization with Meta-Learning
Paper β’ 2505.09666 β’ Published β’ 71 -
Visual Planning: Let's Think Only with Images
Paper β’ 2505.11409 β’ Published β’ 57
MathRL
Note: The solution may not be in `solution` or `answer` columns, but inside /boxed/{ANSWER}
Datasets That Kill
Free AI!!!
Sikh Models
-
HuggingFaceTB/SmolLM3-3B
Text Generation β’ 3B β’ Updated β’ 164k β’ 944 -
Qwen/Qwen3-4B
Text Generation β’ Updated β’ 6.23M β’ β’ 606 -
meta-llama/Llama-3.1-8B-Instruct
Text Generation β’ 8B β’ Updated β’ 9.39M β’ β’ 5.77k -
mistralai/Mistral-7B-Instruct-v0.3
7B β’ Updated β’ 3.2M β’ 2.54k
Revolutionary Models
GGUFs
Ultra Cool Models
Interesting Papers
-
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper β’ 2501.11425 β’ Published β’ 109 -
Agent Laboratory: Using LLM Agents as Research Assistants
Paper β’ 2501.04227 β’ Published β’ 95 -
System Prompt Optimization with Meta-Learning
Paper β’ 2505.09666 β’ Published β’ 71 -
Visual Planning: Let's Think Only with Images
Paper β’ 2505.11409 β’ Published β’ 57