view article Article Orquestrando Small Language Models (SLM) usando JavaScript e a API de Inferência do Hugging Face By rrg92 • 6 days ago • 1
view article Article Orchestrating Small Language Models (SLM) using JavaScript and the Hugging Face Inference API By rrg92 • 6 days ago • 1
view article Article FaceChain-FACT: Open-source 10-second portrait generation, reusing massive LoRa styles, a base-model-friendly portrait application. By haoyufirst • 10 days ago • 1
view article Article Formatting Datasets for Chat Template Compatibility By nroggendorff • 8 days ago • 1
view article Article Fine-tuning LLMs with Singular Value Decomposition By fractalego • 8 days ago • 3
view article Article FiftyOne Computer Vision Datasets Come to the Hugging Face Hub By jamarks • 7 days ago • 11
view article Article Orchestration of Experts: The First-Principle Multi-Model System By alirezamsh • 11 days ago • 14
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting Paper • 2405.18424 • Published 13 days ago • 7
VeLoRA: Memory Efficient Training using Rank-1 Sub-Token Projections Paper • 2405.17991 • Published 13 days ago • 9
LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models Paper • 2405.18377 • Published 13 days ago • 14
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning Paper • 2405.18386 • Published 13 days ago • 16
Yuan 2.0-M32: Mixture of Experts with Attention Router Paper • 2405.17976 • Published 13 days ago • 18
view article Article ⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2 By burtenshaw • 7 days ago • 20
view article Article Introducing Transformers Agent 2.0: A Leap Forward in Intelligent Automation By Andyrasika • 13 days ago • 8
view article Article Journey With Me Into The Mind of Large Language Models: Interesting Findings in AnthropicAI's Scaling Monosemanticity paper. By Jaward • 19 days ago • 2
view article Article Synthetic dataset generation techniques: generating custom sentence similarity data By davanstrien • 18 days ago • 11
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data Paper • 2403.11207 • Published Mar 17 • 14
view article Article Enjoy the Power of Phi-3 with ONNX Runtime on your device By Emma-N • 19 days ago • 20
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 22 items • Updated 10 days ago • 318
view article Article Exploration of Job Application Automation with Data Scraping By herooooooooo • about 1 month ago • 3
view article Article Glaze and the Effectiveness of Anti-AI Methods for Diffusion Models By parsee-mizuhashi • 26 days ago • 3
view article Article Synthetic dataset generation techniques: Self-Instruct By davanstrien • 26 days ago • 5
LlamaForTokenClassification Collection Fine Tuned llama variants for Token Classification • 6 items • Updated 28 days ago • 2
Terminus XL Collection v-prediction SDXL clone with zero-terminal SNR noise schedule • 8 items • Updated Apr 24 • 6
view article Article Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task By danaaubakirova • 25 days ago • 15
view article Article Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework By Yescia • May 7 • 1
view article Article Advancing Open-source Large Language Models in the Medical & Healthcare Domain By aaditya • May 10 • 4
view article Article Train custom AI models with the trainer API and adapt them to 🤗 By not-lain • 8 days ago • 24
view article Article Knowledge Distillation for Fine-Tuning a GPT-3.5 Judge: Enhancing Accuracy and Performance By Andyrasika • 28 days ago • 4
view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch By AviSoori1x • May 6 • 26
view article Article Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia? By davanstrien • May 7 • 6
view article Article Train Custom Models on Hugging Face Spaces with AutoTrain SpaceRunner By abhishek • May 9 • 7
view article Article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • May 7 • 27
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell • Apr 28 • 34
view article Article ⚗️ 🧑🏼🌾 Let's grow some Domain Specific Datasets together By burtenshaw • Apr 29 • 27
view article Article A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI By AmelieSchreiber • 27 days ago • 21
view article Article Fish Speech V1 - New Multilingual Open Source TTS Model By lengyue233 • May 3 • 7
view article Article Token Merging for fast LLM inference : Background and first trials with Mistral By samchain • Apr 30 • 1