view article Article CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models 4 days ago β’ 10
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper β’ 2405.09818 β’ Published 12 days ago β’ 92
OmniGlue: Generalizable Feature Matching with Foundation Model Guidance Paper β’ 2405.12979 β’ Published 6 days ago β’ 7
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention Paper β’ 2405.12981 β’ Published 6 days ago β’ 22
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control Paper β’ 2405.12970 β’ Published 6 days ago β’ 20
Diffusion for World Modeling: Visual Details Matter in Atari Paper β’ 2405.12399 β’ Published 7 days ago β’ 25
πGGUF Collection Llama.cpp compatible models, can be used on CPUs and GPUs! β’ 663 items β’ Updated 2 days ago β’ 23
INDUS: Effective and Efficient Language Models for Scientific Applications Paper β’ 2405.10725 β’ Published 10 days ago β’ 19
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community β’ 16 items β’ Updated 10 days ago β’ 175
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma β’ 11 items β’ Updated 10 days ago β’ 101
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 14 items β’ Updated 5 days ago β’ 131
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model 14 days ago β’ 121
view article Article Hugging Face x LangChain : A new partner package in LangChain 14 days ago β’ 67
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper β’ 2405.00732 β’ Published 29 days ago β’ 114
Customizing Text-to-Image Models with a Single Image Pair Paper β’ 2405.01536 β’ Published 25 days ago β’ 17
LLM-AD: Large Language Model based Audio Description System Paper β’ 2405.00983 β’ Published 26 days ago β’ 13
FLAME: Factuality-Aware Alignment for Large Language Models Paper β’ 2405.01525 β’ Published 25 days ago β’ 21
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment Paper β’ 2405.01481 β’ Published 25 days ago β’ 20
WildChat: 1M ChatGPT Interaction Logs in the Wild Paper β’ 2405.01470 β’ Published 25 days ago β’ 53
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation Paper β’ 2405.01434 β’ Published 25 days ago β’ 44
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper β’ 2405.01535 β’ Published 25 days ago β’ 100
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper β’ 2404.18796 β’ Published 28 days ago β’ 63
Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting Paper β’ 2404.18911 β’ Published 28 days ago β’ 26
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning Paper β’ 2404.16994 β’ Published Apr 25 β’ 31
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs Paper β’ 2404.16873 β’ Published Apr 21 β’ 25
Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding Paper β’ 2404.16710 β’ Published Apr 25 β’ 55
What matters when building vision-language models? Paper β’ 2405.02246 β’ Published 24 days ago β’ 87
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). β’ 6 items β’ Updated 24 days ago β’ 37
view article Article Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face 25 days ago β’ 13
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data Paper β’ 2404.15653 β’ Published Apr 24 β’ 24
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper β’ 2309.10400 β’ Published Sep 19, 2023 β’ 22
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper β’ 2404.14619 β’ Published Apr 22 β’ 120
view article Article Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs Apr 16 β’ 11
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 20 items β’ Updated 6 days ago β’ 286
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper β’ 2404.14219 β’ Published Apr 22 β’ 237
view article Article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!) By wolfram β’ Apr 24 β’ 48
view article Article Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data By Pclanglais β’ Apr 18 β’ 20
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Apr 18 β’ 541
Eurus Collection Advancing LLM Reasoning Generalists with Preference Trees β’ 11 items β’ Updated Apr 15 β’ 22
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 β’ 131
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated 21 days ago β’ 82
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper β’ 2404.03715 β’ Published Apr 4 β’ 58
C4AI Command R Plus Collection C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities. β’ 3 items β’ Updated 4 days ago β’ 17