Felix Tuma's picture

Felix Tuma

floom

·

AI & ML interests

NLP

Recent Activity

liked a dataset 7 days ago

kachoio/polymarket-5-minute-crypto-up-down-markets

updated a collection 13 days ago

PotentialApplication

liked a model 22 days ago

numind/NuExtract3

View all activity

Organizations

None yet

upvoted a paper 2 months ago

Automating Database-Native Function Code Synthesis with LLMs

Paper • 2604.06231 • Published Apr 2 • 17

upvoted a paper 3 months ago

Structurally Aligned Subtask-Level Memory for Software Engineering Agents

Paper • 2602.21611 • Published Feb 25 • 1

upvoted 3 papers 6 months ago

Adaptation of Agentic AI

Paper • 2512.16301 • Published Dec 18, 2025 • 111

GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

Paper • 2512.19682 • Published Dec 22, 2025 • 19

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

Paper • 2512.18880 • Published Dec 21, 2025 • 25

upvoted 6 papers 8 months ago

NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning

Paper • 2510.18940 • Published Oct 21, 2025 • 9

Redefining Retrieval Evaluation in the Era of LLMs

Paper • 2510.21440 • Published Oct 24, 2025 • 9

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 232

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30, 2025 • 32

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24, 2025 • 103

Multi-Agent Evolve: LLM Self-Improve through Co-evolution

Paper • 2510.23595 • Published Oct 27, 2025 • 14

upvoted 3 papers 9 months ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29, 2025 • 22

TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them

Paper • 2509.21117 • Published Sep 25, 2025 • 30

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

Paper • 2509.06806 • Published Sep 8, 2025 • 64

upvoted 6 papers 10 months ago

Language Self-Play For Data-Free Training

Paper • 2509.07414 • Published Sep 9, 2025 • 31

StepWiser: Stepwise Generative Judges for Wiser Reasoning

Paper • 2508.19229 • Published Aug 26, 2025 • 20

Hermes 4 Technical Report

Paper • 2508.18255 • Published Aug 25, 2025 • 54

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19, 2025 • 48

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129

AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators

Paper • 2508.09101 • Published Aug 12, 2025 • 8