Making automatic speech recognition work on large files with Wav2Vec2 in π€ Transformers Feb 1, 2022 β’ 2
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers β’ 51 items β’ Updated 1 day ago β’ 15
SpaceByte: Towards Deleting Tokenization from Large Language Modeling Paper β’ 2404.14408 β’ Published Apr 22 β’ 6