Skip to content

Pull requests: huggingface/datatrove

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add slots=True to dataclasses
#405 opened Nov 14, 2025 by Rolv-Arild Loading…
ray nits
#403 opened Nov 11, 2025 by hynky1999 Loading…
Finepdfs
#402 opened Nov 11, 2025 by hynky1999 Loading…
chore(ci): upgrade checkout to v5
#384 opened Aug 27, 2025 by zkpepe Loading…
fix bos token missing
#346 opened Feb 13, 2025 by jquesnelle Loading…
Add open-source text extraction libraries
#293 opened Sep 27, 2024 by garrethlee Loading…
Mersenne prime hashing fix.
#200 opened May 28, 2024 by Apsod Loading…
Linewise filters
#125 opened Mar 14, 2024 by guipenedo Draft
ProTip! What’s not been updated in a month: updated:<2025-10-20.