Hugging Face – Posts

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

All HF Hub posts

posted an update 1 day ago

Post

1686

AI Is Training on Your Content Without Permission — Fight Back with Invisible Watermarks

FINAL-Bench/security-scan

Most generative AI training data is crawled without consent. Your text gets summarized, images reprocessed, videos clipped — with no way to prove you're the original creator. Existing watermarks are either visible or wiped out by a single AI preprocessing pass.

Detect Before, Track After

Pre-embed — Detect theft without any watermark. Text plagiarism check, image similarity analysis (perceptual hash, SSIM, color histogram, feature matching), and video temporal matching catch copies, edits, and excerpts.

Post-embed — Embed invisible multi-layer watermarks. If one layer is destroyed, others survive independently. Even full removal leaves forensic traces as evidence.

Text: 4 Independent Layers

Four mechanisms work simultaneously: zero-width Unicode characters at morpheme/word boundaries (Korean Kiwi + English NLP), style fingerprinting via synonym-ending-connective substitution, SHA-256 timestamped evidence packages, and punctuation-anchored micro-marks. Each layer uses a different Unicode category, so attacks on one cannot eliminate the others. Full bilingual support, zero readability impact.

34-Attack Defense

7 categories, 34 attacks simulated: Unicode normalization, invisible character removal, homoglyph substitution (9,619 confusables), and AI rewriting. Each scored on Signal (watermark survival) + Trace (forensic evidence of attack) — proving deliberate removal even when watermarks are destroyed.

Image & Video

Images: DCT frequency-domain watermarks surviving JPEG compression and resize. Videos: keyframe watermarking with temporal propagation and majority-vote extraction. Both support pre-embed similarity detection.

Who Is This For

Creators, rights holders needing legal evidence, media companies, and organizations tracking document leaks. Korean/English bilingual, open source, Gradio-based.

1 reply

sergiopaniego

posted an update 2 days ago

Post

1834

What happens when you make an LLM drive a car where physics are real and actions can't be undone?

I ported CARLA, the autonomous driving simulator, to OpenEnv and added training support via TRL + Hugging Face Spaces.

The model interacts with the simulator through tool calls (observe, brake, change lane) and learns from a reward signal.

In 50 training steps, Qwen 0.6B learns to swerve and brake to avoid pedestrians in emergency situations.

The project supports text and vision (VLMs can see through a camera sensor), open-world driving with traffic, and multiple driving scenarios.

This builds on the carla-env project by sinatras, which originally placed LLMs inside CARLA for evaluation. We extended it with vision, new scenarios, rubric-based rewards, and made it trainable end-to-end.

Blog: https://huggingface.co/blog/sergiopaniego/bringing-carla-to-openenv-trl/
CARLA env in OpenEnv: https://github.com/meta-pytorch/OpenEnv/tree/main/envs/carla_env
Training script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/carla.py

YatharthS

posted an update 1 day ago

Post

1538

Just open sourced LavaSR v2: a model that can enhance 5000 seconds of audio in 1 second while being higher quality than giant and slow 6gb diffusion models!

It works with any sampling rate from 8-48khz and is nearly 5000x faster than competition while being superior in objective benchmarks.

LavaSR v2 is Perfect for
- Enhancing TTS models.
- Fixing old audio datasets.
- Restoring low quality recordings.

You can check out the examples and run it locally or online:

Repo: https://github.com/ysharma3501/LavaSR.git
Demo: YatharthS/LavaSR
Model: YatharthS/LavaSR

OzTianlu

posted an update 2 days ago

Post

1497

Scaling UP in Kai! 🌊
NoesisLab/Kai-3B-Instruct

Introducing NoesisLab/Kai-3B-Instruct What happens when you force a 3B model to reason entirely in its latent space ?
Meet Kai-3B, our latest industrial-grade reasoning model fine-tuned using the Adaptive Dual Search (ADS) algorithm.
GSM8K (0-shot, Direct Answer): 39.27% 🤯 (Llama-2-7B is ~14.6%)
HumanEval (Pass@1): 39.02% 💻 (Overtakes Gemma-2-2B's 30%)
MMLU (5-shot): 53.62% 📚 (Crushing the 50% barrier)
ARC-Challenge: 51.88%🎯
PIQA: 77.53%
HellaSwag: 69.53%
Kai-3B proves that reasoning density doesn't strictly require parameter bloat or verbose generation. It acts as a perfect, cold-blooded Agent action-engine—ideal for JSON routing, SWE-bench patch generation, and anywhere you need absolute structured certainty without token waste.

2 replies

albertvillanova

posted an update 2 days ago

Post

1437

🚀 TRL v0.29.0 introduces trl-training: an agent-native training skill.

This makes the TRL CLI a structured, agent-readable capability, allowing AI agents to reliably execute training workflows such as:
- Supervised Fine-Tuning (SFT)
- Direct Preference Optimization (DPO)
- Group Relative Policy Optimization (GRPO)

We’re excited to see what the community builds on top of this.

If you’re working on AI agents, alignment research, or scalable RL training infrastructure: give TRL v0.29.0 a try! 🤗

The future of ML tooling is agent-native.
🔗 https://github.com/huggingface/trl/releases/tag/v0.29.0

GVA21q2

posted an update 2 days ago

Post

1394

# π.Guy.AI — AI-Powered Neuropedagogy Math Lessons

Students with math anxiety, ADHD, dyslexia, or low working memory need different learning experiences — but teachers can't create individualized materials for every student.

**π.Guy.AI** generates interactive HTML math lessons adapted to 7 cognitive profiles, using a multi-agent AI pipeline:

1. **Neuro-Interpreter** — enriches prompts with profile-specific adaptations
2. **Creative Agent** — generates a 12-slide lesson with SVG visualizations
3. **Quality Control** — validates against 8 neuropedagogy principles

Each lesson is a standalone HTML file with inline CSS/JS/SVG — works offline, no dependencies.

## The Model

Fine-tuned **Qwen2.5-7B-Instruct** with LoRA on 313 curated Hebrew math lessons.

- Model: [GVA21q2/piguyai-lessons-v2-enhanced](https://huggingface.co/GVA21q2/piguyai-lessons-v2-enhanced)
- Dataset: [GVA21q2/pi-guy-ai-lessons](https://huggingface.co/datasets/GVA21q2/pi-guy-ai-lessons)
- Demo: [GVA21q2/pi-guy-ai-demo]( GVA21q2/pi-guy-ai-demo)
- Web app: [gva21q2.github.io/pi.guy.ai](https://gva21q2.github.io/pi.guy.ai/)

7 profiles: math anxiety, ADHD, dyslexia, dysgraphia, low working memory, visual processing, weak inhibition.

Built by [Guy Assal](https://www.guyassal.education)

nyuuzyou

posted an update 2 days ago

Post

1489

🌍 Street-Level Imagery Dataset nyuuzyou/streetview

934,191 image records index Eastern Europe and Northern Asia. Temporal links map historical views at identical coordinates across nine years.

Key Stats:

- 905,940 unique images
- Coverage spanning 2016 to 2025
- Average 14.3 historical links per location

Geographic bounds span 20.49° E to 152.32° E. Urban centers show higher data density.

1 reply

ajibawa-2023

posted an update about 16 hours ago

Post

418

Python-Code-Large
Dataset: ajibawa-2023/Python-Code-Large

Python-Code-Large is a large-scale corpus of Python source code comprising more than 2 million rows of Python code. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and program analysis for the Python ecosystem.

By providing a high-volume, language-specific corpus, Python-Code-Large enables systematic experimentation in Python-focused model training, domain adaptation, and downstream code understanding tasks.

Python-Code-Large addresses the need for a dedicated Python-only dataset at substantial scale, enabling focused research across data science, backend systems, automation, scientific computing, and AI-driven Python environments.

BibbyResearch

posted an update 1 day ago

Post

792

Announcement :-

BibbyResearch/China-Egocentric-Dataset-Robotics

Bibby AI - AI Latex Editor for Research writing has launched the above Chinese Egocentric Dataset for Robotics Research!

1 reply

projectlosangeles

posted an update 1 day ago

Post

1209

🔥Check out brand new and greatly improved Orpheus model and space! 🔥

asigalov61/Orpheus-Music-Transformer

Please ❤️the space and the model repo if you enjoyed it! It really helps!

Thank you 🙏

Alex

Project Los Angeles
Tegridy Code 2026

victor-mir
@not-lain
@victor
@John6666
@Csplk
@alexkuz
@mimbres
@Timzoid

1 reply

Recently active users