How to use from
SGLang
Install from pip and serve model
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "TomGrc/FusionNet_passthrough" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "TomGrc/FusionNet_passthrough",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Use Docker images
docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "TomGrc/FusionNet_passthrough" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "TomGrc/FusionNet_passthrough",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Quick Links

FusionNet_passthrough

Fine-tuned model on English language using passthrough Fusion method.

Model description

This is an experiment with the passthrough Fusion method of FusionNet. This model has 21.2B parameters, and this model is fine-tuned. Enjoy!

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 65.94
AI2 Reasoning Challenge (25-Shot) 69.45
HellaSwag (10-Shot) 87.72
MMLU (5-Shot) 65.28
TruthfulQA (0-shot) 67.65
Winogrande (5-shot) 81.29
GSM8k (5-shot) 24.26
Downloads last month
208
Safetensors
Model size
21B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TomGrc/FusionNet_passthrough

Quantizations
3 models

Evaluation results