Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
9
1
Isadora White
izzcw
Follow
ziadrone's profile picture
Jennny's profile picture
Pamela153's profile picture
3 followers
·
3 following
https://icwhite.github.io/website/
isadorcw
icwhite
isadora-c-white
AI & ML interests
LLMs, Reinforcement Learning, agents, embodiment, multi-agent collaboration
Recent Activity
upvoted
a
paper
about 1 month ago
RefineBench: Evaluating Refinement Capability of Language Models via Checklists
upvoted
a
paper
2 months ago
Steering Autoregressive Music Generation with Recursive Feature Machines
upvoted
a
paper
5 months ago
Group Sequence Policy Optimization
View all activity
Organizations
izzcw
's models
26
Sort: Recently updated
izzcw/dpo_model_3.1_8k
Updated
Jun 4, 2025
izzcw/qwen_large_crafting_sft_success
Text Generation
•
2B
•
Updated
Jun 1, 2025
•
4
izzcw/large_crafting_sft_success
Text Generation
•
2B
•
Updated
Jun 1, 2025
•
7
izzcw/trajectory_crafting_dpo_pairs
Updated
Jun 1, 2025
izzcw/trajectory_crafting_dpo_pairs.json
Updated
Jun 1, 2025
izzcw/llama_3.1_large_crafting_sft_success
Text Generation
•
8B
•
Updated
May 31, 2025
•
7
izzcw/llama_3b_crafting_sft_success_new_mem
Text Generation
•
3B
•
Updated
May 27, 2025
•
4
izzcw/mini_llama_crafting_sft_success_new_mem
Text Generation
•
1B
•
Updated
May 27, 2025
•
8
izzcw/cooking_sft_fail_new_mem
Text Generation
•
8B
•
Updated
May 24, 2025
•
7
izzcw/crafting_sft_fail_new_mem
Text Generation
•
8B
•
Updated
May 24, 2025
•
6
izzcw/cooking_sft_success_new_mem
Text Generation
•
8B
•
Updated
May 22, 2025
•
5
izzcw/large_cooking_sft_success
Text Generation
•
8B
•
Updated
May 13, 2025
•
7
•
1
izzcw/large_cooking_sft_fail
8B
•
Updated
May 11, 2025
•
4
izzcw/large_crafting_sft_fail
Text Generation
•
8B
•
Updated
May 8, 2025
•
5
izzcw/filtered_crafting_train_data_shorter_length
Text Generation
•
8B
•
Updated
May 8, 2025
•
7
izzcw/dpo_crafting_lora_from_sft
Updated
May 1, 2025
•
3
izzcw/dpo_crafting_lora_from_base
Updated
May 1, 2025
•
3
izzcw/dpo_crafting_lora
Updated
May 1, 2025
•
3
izzcw/dpo_cooking
Updated
Apr 30, 2025
izzcw/filtered_crafting_train_data
8B
•
Updated
Apr 28, 2025
•
6
izzcw/filtered_construction_train_data
8B
•
Updated
Apr 28, 2025
•
7
izzcw/filtered_cooking_train_data
8B
•
Updated
Apr 28, 2025
•
10
izzcw/llama_3_70b_lora_sft_construction
Updated
Apr 11, 2025
•
3
izzcw/llama_3_70b_lora_sft_crafting
Updated
Apr 11, 2025
•
4
izzcw/llama3_70b_lora_sft_cooking
Updated
Apr 3, 2025
•
5
izzcw/final_combined_mc_data
Text Generation
•
8B
•
Updated
Mar 25, 2025
•
8