AlbertShi's picture

5 13

AlbertShi

AlbertShi

·

AI & ML interests

None yet

Organizations

upvoted an article 5 months ago

Article

Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models

Aug 5

•

7

upvoted a paper 5 months ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published Aug 4 • 17

upvoted a collection 9 months ago

🐕Small-Doges

Doge family of small language models! • 18 items • Updated Apr 21 • 11

upvoted a paper 11 months ago

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Paper • 2412.11834 • Published Dec 16, 2024 • 8

upvoted a collection 11 months ago

Doge

Doge family of small language models. • 12 items • Updated Mar 28 • 6