Mengzhao Chen's picture

Mengzhao Chen

ChenMnZ

·

https://chenmnz.github.io/

ChenMnZ

AI & ML interests

model compression

Recent Activity

upvoted an article about 2 months ago

The Optimal Architecture for Small Language Models

upvoted a paper about 2 months ago

mHC: Manifold-Constrained Hyper-Connections

upvoted a paper 3 months ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

View all activity

Organizations

None yet

ChenMnZ 's models 129

ChenMnZ/Llama-3-70b-EfficientQAT-w2g64-GPTQ

Text Generation • 71B • Updated Jul 22, 2024 • 5

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64

Text Generation • 2B • Updated Jul 22, 2024 • 4

ChenMnZ/Llama-3-70b-EfficientQAT-w2g128-GPTQ

Text Generation • 71B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128

Text Generation • 2B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-3-8b-EfficientQAT-w4g128

Text Generation • 2B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-8b-EfficientQAT-w3g128

Text Generation • 2B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-8b-EfficientQAT-w2g64

Text Generation • 2B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-3-8b-EfficientQAT-w2g128

Text Generation • 2B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-2-7b-EfficientQAT-w4g128-GPTQ

Text Generation • 7B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-2-7b-EfficientQAT-w2g64-GPTQ

Text Generation • 7B • Updated Jul 22, 2024 • 3 • 1

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w3g128

Text Generation • 10B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-GPTQ

Text Generation • 7B • Updated Jul 22, 2024 • 10

ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-GPTQ

Text Generation • 69B • Updated Jul 22, 2024 • 4

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g64

Text Generation • 8B • Updated Jul 22, 2024

ChenMnZ/Llama-2-70b-EfficientQAT-w2g64-GPTQ

Text Generation • 69B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-BitBLAS

Text Generation • 274B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128

Text Generation • 7B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-GPTQ

Text Generation • 69B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-GPTQ

Text Generation • 13B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-BitBLAS

Text Generation • 51B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-GPTQ

Text Generation • 13B • Updated Jul 22, 2024 • 3

ChenMnZ/Llama-3-70b-EfficientQAT-w4g128

Text Generation • 11B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-BitBLAS

Text Generation • 51B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-3-70b-EfficientQAT-w3g128

Text Generation • 10B • Updated Jul 22, 2024 • 1 • 1

ChenMnZ/Llama-3-70b-EfficientQAT-w2g64

Text Generation • 8B • Updated Jul 22, 2024 • 1 • 1

ChenMnZ/Llama-3-70b-EfficientQAT-w2g128

Text Generation • 7B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-GPTQ

Text Generation • 13B • Updated Jul 22, 2024 • 1

ChenMnZ/Llama-2-7b-EfficientQAT-w4g128

Text Generation • 1B • Updated Jul 22, 2024 • 74

ChenMnZ/Llama-2-7b-EfficientQAT-w3g128

Text Generation • 1.0B • Updated Jul 22, 2024 • 2

ChenMnZ/Llama-2-7b-EfficientQAT-w2g64

Text Generation • 0.8B • Updated Jul 22, 2024 • 2