Travis King's picture

Travis King

travisking

·

AI & ML interests

have you heard of generative AI?

Recent Activity

upvoted a paper about 6 hours ago

Are We on the Right Way to Assessing LLM-as-a-Judge?

upvoted a paper 3 days ago

Hierarchical Dataset Selection for High-Quality Data Sharing

liked a dataset 6 days ago

nvidia/Nemotron-PII

View all activity

Organizations

None yet

upvoted a paper about 6 hours ago

Are We on the Right Way to Assessing LLM-as-a-Judge?

Paper • 2512.16041 • Published 5 days ago • 23

upvoted a paper 3 days ago

Hierarchical Dataset Selection for High-Quality Data Sharing

Paper • 2512.10952 • Published 11 days ago • 1

upvoted 2 papers 7 days ago

Causal Judge Evaluation: Calibrated Surrogate Metrics for LLM Systems

Paper • 2512.11150 • Published 11 days ago • 4

BEAVER: An Efficient Deterministic LLM Verifier

Paper • 2512.05439 • Published 18 days ago • 34

upvoted a paper 11 days ago

Towards a Science of Scaling Agent Systems

Paper • 2512.08296 • Published 14 days ago • 13

upvoted 2 collections 19 days ago

Skywork-Reward-V2

Scaling preference data curation to the extreme • 9 items • Updated Jul 4 • 25

Reward Models 10-2025

A collection of great reward models for research and production • 7 items • Updated 6 days ago • 12

upvoted a collection 29 days ago

Olmo 3 Pre-training

All artifacts related to Olmo 3 pre-training • 10 items • Updated 13 days ago • 30

upvoted a paper about 1 month ago

Mitigating Label Length Bias in Large Language Models

Paper • 2511.14385 • Published Nov 18 • 6

upvoted an article about 1 month ago

Article

ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases

Nov 5

•

57

upvoted a collection about 1 month ago

Nemotron RAG

14 items • Updated about 6 hours ago • 51

upvoted a paper about 1 month ago

OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation

Paper • 2511.13655 • Published Nov 17 • 9

upvoted 3 articles about 1 month ago

Article

🌳 QAT: The Art of Growing a Bonsai Model

Nov 9

•

15

Article

The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs

Nov 15

•

12

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30

•

70

upvoted 5 papers about 1 month ago

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12 • 93

LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls

Paper • 2511.09148 • Published Nov 12 • 16

Motif 2 12.7B technical report

Paper • 2511.07464 • Published Nov 7 • 39

Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs

Paper • 2511.07419 • Published Nov 10 • 25

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published Nov 7 • 52