My fav models miqudev/miqu-1-70b Updated Feb 4 β’ 136k β’ 970 Qwen/Qwen1.5-72B-Chat Text Generation β’ Updated 25 days ago β’ 33.2k β’ 210 segolilylabs/Lily-Cybersecurity-7B-v0.2 Text Generation β’ Updated Jan 22 β’ 396 β’ 40 nomic-ai/nomic-embed-text-v1 Sentence Similarity β’ Updated 22 days ago β’ 130k β’ 376
My fav papers JudgeLM: Fine-tuned Large Language Models are Scalable Judges Paper β’ 2310.17631 β’ Published Oct 26, 2023 β’ 31 Prometheus: Inducing Fine-grained Evaluation Capability in Language Models Paper β’ 2310.08491 β’ Published Oct 12, 2023 β’ 49 Chain-of-Thought Reasoning Without Prompting Paper β’ 2402.10200 β’ Published Feb 15 β’ 91 BitDelta: Your Fine-Tune May Only Be Worth One Bit Paper β’ 2402.10193 β’ Published Feb 15 β’ 17
JudgeLM: Fine-tuned Large Language Models are Scalable Judges Paper β’ 2310.17631 β’ Published Oct 26, 2023 β’ 31
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models Paper β’ 2310.08491 β’ Published Oct 12, 2023 β’ 49