The largest public domain dataset for training LLMs.
PleIAs
company
AI & ML interests
Open Science LLMs
Organization Card
About org cards
PleIAs is a French startup training LLMs with an open science approach.
Collections
1
models
None public yet
datasets
27
PleIAs/openalex_extraction
Updated
PleIAs/openalex-free-license
Viewer
•
Updated
PleIAs/Post-OCR-Correction
Updated
•
196
•
112
PleIAs/YouTube-Commons
Viewer
•
Updated
•
769
•
271
PleIAs/US-PD-Newspapers
Viewer
•
Updated
•
46
•
33
PleIAs/German-PD
Viewer
•
Updated
•
1
•
9
PleIAs/Greek-PD
Viewer
•
Updated
PleIAs/Czech-PD
Viewer
•
Updated
•
2
PleIAs/Serbian-PD
Viewer
•
Updated
•
1
PleIAs/Chinese-PD
Viewer
•
Updated
•
1