Curated models for AI infrastructure, LLM deployment, and edge computing. Optimized for NVIDIA DGX Spark and Docker Swarm clusters.
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation ⢠33B ⢠Updated ⢠725k ⢠⢠1.99k -
sentence-transformers/all-MiniLM-L6-v2
Sentence Similarity ⢠Updated ⢠164M ⢠⢠4.48k -
BAAI/bge-large-en-v1.5
Feature Extraction ⢠0.3B ⢠Updated ⢠4.97M ⢠⢠627 -
meta-llama/Llama-3.3-70B-Instruct
Text Generation ⢠Updated ⢠777k ⢠⢠2.66k