Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
7
7
15
Catherine Arnett
catherinearnett
Follow
chrismutava's profile picture
sumyyyyy's profile picture
GorkaUrbizu's profile picture
113 followers
·
38 following
https://catherinearnett.github.io/
linguist_cat
catherinearnett
catherinearnett.bsky.social
AI & ML interests
multilingual NLP, tokenization
Recent Activity
updated
a model
4 days ago
catherinearnett/afr_Latn_als_Latn_9_91_bpe_nowhitespace_16384
published
a model
4 days ago
catherinearnett/afr_Latn_als_Latn_9_91_bpe_nowhitespace_16384
updated
a model
4 days ago
catherinearnett/afr_Latn_als_Latn_50_50_bpe_nowhitespace_16384
View all activity
Organizations
catherinearnett
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
about 1 month ago
mrlbenchmarks/global-piqa-parallel
Viewer
•
Updated
12 days ago
•
13.5k
•
1.84k
•
8
liked
a dataset
3 months ago
commoncrawl/CommonLID
Viewer
•
Updated
Feb 10
•
373k
•
226
•
52
liked
4 datasets
4 months ago
aaparajit02/punjabi-asr
Viewer
•
Updated
Jul 23, 2023
•
39.2k
•
206
•
3
aznlp/azerbaijani-blogs
Viewer
•
Updated
Apr 14, 2024
•
6.93k
•
26
•
3
MWirelabs/assamese-monolingual-corpus
Viewer
•
Updated
Nov 13, 2025
•
1.61M
•
33
•
1
Atnafu/Afri-MCQA
Viewer
•
Updated
Jan 15
•
15.3k
•
538
•
17
liked
a dataset
7 months ago
mrlbenchmarks/global-piqa-nonparallel
Viewer
•
Updated
12 days ago
•
13.5k
•
11.2k
•
35
liked
a dataset
8 months ago
nlip/DIWALI
Viewer
•
Updated
28 days ago
•
8.82k
•
144
•
6
liked
4 datasets
10 months ago
classla/ParlaSpeech-PL
Viewer
•
Updated
Jul 2, 2025
•
531k
•
151
•
6
classla/ParlaSpeech-HR
Viewer
•
Updated
Jul 2, 2025
•
868k
•
3.18k
•
5
classla/ParlaSpeech-CZ
Viewer
•
Updated
Jul 2, 2025
•
711k
•
3.99k
•
5
classla/ParlaSpeech-RS
Viewer
•
Updated
Dec 1, 2025
•
278k
•
1.14k
•
4
liked
a dataset
11 months ago
filbench/UD_Tagalog-NewsCrawl
Viewer
•
Updated
Jul 23, 2025
•
15.6k
•
56
•
1
liked
a dataset
about 1 year ago
jumelet/multiblimp
Viewer
•
Updated
May 16, 2025
•
121k
•
6.37k
•
17
liked
a dataset
almost 2 years ago
ambean/lingOly
Viewer
•
Updated
Jun 11, 2024
•
90
•
6.08k
•
9