medical
updated
Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline
Paper
• 2411.12814
• Published
• 23
SegBook: A Simple Baseline and Cookbook for Volumetric Medical Image
Segmentation
Paper
• 2411.14525
• Published
• 19
MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation
towards Unannotated Modalities
Paper
• 2412.04106
• Published
• 5
PepTune: De Novo Generation of Therapeutic Peptides with
Multi-Objective-Guided Discrete Diffusion
Paper
• 2412.17780
• Published
• 5
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
Paper
• 2412.18925
• Published
• 107
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and
Vision-Language Models Derived from Scientific Literature
Paper
• 2501.07171
• Published
• 55
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and
Understanding
Paper
• 2501.18362
• Published
• 23
A Study on the Performance of U-Net Modifications in Retroperitoneal
Tumor Segmentation
Paper
• 2502.00314
• Published
• 3
Current Pathology Foundation Models are unrobust to Medical Center
Differences
Paper
• 2501.18055
• Published
• 3
Homeomorphism Prior for False Positive and Negative Problem in Medical
Image Dense Contrastive Representation Learning
Paper
• 2502.05282
• Published
HealthGPT: A Medical Large Vision-Language Model for Unifying
Comprehension and Generation via Heterogeneous Knowledge Adaptation
Paper
• 2502.09838
• Published
• 11
SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question
Answering?
Paper
• 2502.13233
• Published
• 15
Preference Learning Unlocks LLMs' Psycho-Counseling Skills
Paper
• 2502.19731
• Published
• 7
Enhancing Abnormality Grounding for Vision Language Models with
Knowledge Descriptions
Paper
• 2503.03278
• Published
• 14
Multi Agent based Medical Assistant for Edge Devices
Paper
• 2503.05397
• Published
• 9
Gumbel-Softmax Flow Matching with Straight-Through Guidance for
Controllable Biological Sequence Generation
Paper
• 2503.17361
• Published
• 5
m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning
with Large Language Models
Paper
• 2504.00869
• Published
• 10
Medical large language models are easily distracted
Paper
• 2504.01201
• Published
• 3
MedSAM2: Segment Anything in 3D Medical Images and Videos
Paper
• 2504.03600
• Published
• 10
Clinical ModernBERT: An efficient and long context encoder for
biomedical text
Paper
• 2504.03964
• Published
• 5
Latent Diffusion Autoencoders: Toward Efficient and Meaningful
Unsupervised Representation Learning in Medical Imaging
Paper
• 2504.08635
• Published
• 4
UKBOB: One Billion MRI Labeled Masks for Generalizable 3D Medical Image
Segmentation
Paper
• 2504.06908
• Published
• 6
EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental
Health Safety
Paper
• 2504.09689
• Published
• 6
Filter2Noise: Interpretable Self-Supervised Single-Image Denoising for
Low-Dose CT with Attention-Guided Bilateral Filtering
Paper
• 2504.13519
• Published
• 1
SilVar-Med: A Speech-Driven Visual Language Model for Explainable
Abnormality Detection in Medical Imaging
Paper
• 2504.10642
• Published
• 2
CheXWorld: Exploring Image World Modeling for Radiograph Representation
Learning
Paper
• 2504.13820
• Published
• 16
Clinical knowledge in LLMs does not translate to human interactions
Paper
• 2504.18919
• Published
• 26
MediAug: Exploring Visual Augmentation in Medical Imaging
Paper
• 2504.18983
• Published
• 7
Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health
Information
Paper
• 2505.06046
• Published
• 15
Multi-Objective-Guided Discrete Flow Matching for Controllable
Biological Sequence Design
Paper
• 2505.07086
• Published
• 1
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture,
Training and Dataset
Paper
• 2505.09568
• Published
• 99
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible
Long-term Tracking
Paper
• 2505.08581
• Published
• 9
Unifying Segment Anything in Microscopy with Multimodal Large Language
Model
Paper
• 2505.10769
• Published
• 2
MedCaseReasoning: Evaluating and learning diagnostic reasoning from
clinical case reports
Paper
• 2505.11733
• Published
• 7
HISTAI: An Open-Source, Large-Scale Whole Slide Image Dataset for
Computational Pathology
Paper
• 2505.12120
• Published
• 7
The Aloe Family Recipe for Open and Specialized Healthcare LLMs
Paper
• 2505.04388
• Published
• 26
NOVA: A Benchmark for Anomaly Localization and Clinical Reasoning in
Brain MRI
Paper
• 2505.14064
• Published
• 19
An Explainable Diagnostic Framework for Neurodegenerative Dementias via
Reinforcement-Optimized LLM Reasoning
Paper
• 2505.19954
• Published
Towards Scalable Language-Image Pre-training for 3D Medical Imaging
Paper
• 2505.21862
• Published
• 1
CXReasonBench: A Benchmark for Evaluating Structured Diagnostic
Reasoning in Chest X-rays
Paper
• 2505.18087
• Published
• 8
MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at
Scale
Paper
• 2506.04405
• Published
• 7
Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric
Approach
Paper
• 2506.03238
• Published
• 1
Medical World Model: Generative Simulation of Tumor Evolution for
Treatment Planning
Paper
• 2506.02327
• Published
• 20
MIRIAD: Augmenting LLMs with millions of medical query-response pairs
Paper
• 2506.06091
• Published
• 11
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical
Understanding and Reasoning
Paper
• 2506.07044
• Published
• 113
MIRAGE: Multimodal foundation model and benchmark for comprehensive
retinal OCT image analysis
Paper
• 2506.08900
• Published
• 4
Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust
MedVQA in Gastrointestinal Endoscopy
Paper
• 2506.09958
• Published
• 1
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical
Reasoning
Paper
• 2506.09513
• Published
• 101
Ming-Omni: A Unified Multimodal Model for Perception and Generation
Paper
• 2506.09344
• Published
• 31
Enhancing Step-by-Step and Verifiable Medical Reasoning in MLLMs
Paper
• 2506.16962
• Published
• 10
An Agentic System for Rare Disease Diagnosis with Traceable Reasoning
Paper
• 2506.20430
• Published
• 8
SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context
Learning
Paper
• 2506.21355
• Published
• 10
Gazal-R1: Achieving State-of-the-Art Medical Reasoning with
Parameter-Efficient Two-Stage Training
Paper
• 2506.21594
• Published
• 8
μ^2Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for
Radiology Report Generation
Paper
• 2507.00316
• Published
• 15
CRISP-SAM2: SAM2 with Cross-Modal Interaction and Semantic Prompting for
Multi-Organ Segmentation
Paper
• 2506.23121
• Published
• 2
Should We Still Pretrain Encoders with Masked Language Modeling?
Paper
• 2507.00994
• Published
• 81
RLVER: Reinforcement Learning with Verifiable Emotion Rewards for
Empathetic Agents
Paper
• 2507.03112
• Published
• 33
MedGen: Unlocking Medical Video Generation by Scaling
Granularly-annotated Medical Videos
Paper
• 2507.05675
• Published
• 27
SAMed-2: Selective Memory Enhanced Medical Segment Anything Model
Paper
• 2507.03698
• Published
• 12
MedGemma Technical Report
Paper
• 2507.05201
• Published
• 16
Skywork-R1V3 Technical Report
Paper
• 2507.06167
• Published
• 73
UGPL: Uncertainty-Guided Progressive Learning for Evidence-Based
Classification in Computed Tomography
Paper
• 2507.14102
• Published
• 1
SegDT: A Diffusion Transformer-Based Segmentation Model for Medical
Imaging
Paper
• 2507.15595
• Published
• 6
RedDino: A foundation model for red blood cell analysis
Paper
• 2508.08180
• Published
• 2
MedSAMix: A Training-Free Model Merging Approach for Medical Image
Segmentation
Paper
• 2508.11032
• Published
• 2
End-to-End Agentic RAG System Training for Traceable Diagnostic
Reasoning
Paper
• 2508.15746
• Published
• 14
Semantic Diffusion Posterior Sampling for Cardiac Ultrasound Dehazing
Paper
• 2508.17326
• Published
• 1
Baichuan-M2: Scaling Medical Capability with Large Verifier System
Paper
• 2509.02208
• Published
• 43
M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via
Self-Supervision
Paper
• 2509.01360
• Published
• 12
MedDINOv3: How to adapt vision foundation models for medical image
segmentation?
Paper
• 2509.02379
• Published
• 2
Does DINOv3 Set a New Medical Vision Standard?
Paper
• 2509.06467
• Published
• 38
Curia: A Multi-Modal Foundation Model for Radiology
Paper
• 2509.06830
• Published
• 21
EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI
Paper
• 2509.11648
• Published
• 2
ROOM: A Physics-Based Continuum Robot Simulator for Photorealistic
Medical Datasets Generation
Paper
• 2509.13177
• Published
• 3
SAIL-VL2 Technical Report
Paper
• 2509.14033
• Published
• 44
MedReseacher-R1: Expert-Level Medical Deep Researcher via A
Knowledge-Informed Trajectory Synthesis Framework
Paper
• 2508.14880
• Published
• 15