Librarian Bot (Bot)

New activity in librarian-bots/dataset-to-model-monitor about 2 hours ago

Discussion tracking new models trained on google/cvss

#11 opened 9 months ago by

Discussion tracking new models trained on nvidia/HelpSteer

67

#21 opened 6 months ago by

Paper • 2406.02214 • Published 7 days ago •

commented 3 papers about 2 hours ago

commented 5 papers about 10 hours ago

SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining

Paper • 2405.15525 • Published 17 days ago •

Sparse Matrix in Large Language Model Fine-tuning

Paper • 2405.19597 • Published 12 days ago •

SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors

Paper • 2405.17604 • Published 14 days ago •

LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters

Paper • 2405.03003 • Published May 5 •

Parameter-Efficient Fine-Tuning with Discrete Fourier Transform

Paper • 2310.02556 • Published Oct 4, 2023 • 2 •

commented a paper about 11 hours ago

NOLA: Networks as Linear Combination of Low Rank Random Basis

Paper • 2405.20222 • Published 11 days ago • 9 •

commented 2 papers about 16 hours ago

MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model

Paper • 2405.13195 • Published 20 days ago • 8 •

CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers

Paper • 2406.02332 • Published 6 days ago •

commented 4 papers 2 days ago

Extended Mind Transformers

Paper • 2406.02542 • Published 6 days ago •

Loki: Low-Rank Keys for Efficient Sparse Attention

Paper • 2402.13449 • Published Feb 21 •

CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory

Paper • 2404.09336 • Published Apr 14 •

Self-Selected Attention Span for Accelerating Large Language Model Inference

Paper • 2311.13581 • Published Nov 22, 2023 • 1 •

commented 3 papers 3 days ago

PaSS: Parallel Speculative Sampling

Paper • 2309.13536 • Published Sep 24, 2023 •

Tackling the Unlimited Staleness in Federated Learning with Intertwined Data and Device Heterogeneities

Paper • 2309.08708 • Published Sep 15, 2023 • 3 •

Frustratingly Simple Memory Efficiency for Pre-trained Language Models via Dynamic Embedding Pruning

Paper • 2405.15208 • Published 18 days ago •

commented 2 papers 4 days ago

Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs

Paper • 2405.14259 • Published 19 days ago • 1 •

Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition

Paper • 2404.08698 • Published Apr 10 •

commented 6 papers 5 days ago

Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

3

MiniCache: KV Cache Compression in Depth Dimension for Large Language Models

Paper • 2405.14366 • Published 19 days ago •

Paper • 2404.04793 • Published Apr 7 •

SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget

Paper • 2405.12532 • Published 21 days ago •

PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference

Paper • 2404.15949 • Published Apr 24 •

Sequence can Secretly Tell You What to Discard

Paper • 2403.08058 • Published Mar 12 •

CHAI: Clustered Head Attention for Efficient LLM Inference

Paper • 2305.01625 • Published May 2, 2023 • 6 •

commented 4 papers 7 days ago

Unlimiformer: Long-Range Transformers with Unlimited Length Input

4

Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark

Paper • 2404.16563 • Published Apr 25 • 1 •

Paper • 2405.16712 • Published 15 days ago • 19 •

Zamba: A Compact 7B SSM Hybrid Model

4

4-bit Shampoo for Memory-Efficient Network Training

Paper • 2405.18144 • Published 13 days ago • 5 •

Paper • 2405.18426 • Published 13 days ago • 15 •

commented 6 papers 8 days ago

GFlow: Recovering 4D World from Monocular Video

3

LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models

Paper • 2405.18377 • Published 13 days ago • 14 •

Paper • 2405.06640 • Published May 10 • 1 •

Linearizing Large Language Models

Paper • 2405.19313 • Published 12 days ago •

Language Models Trained to do Arithmetic Predict Human Risky and Intertemporal Choice

Paper • 2305.13648 • Published May 23, 2023 •

Non-parametric, Nearest-neighbor-assisted Fine-tuning for Neural Machine Translation

Paper • 2405.20204 • Published 11 days ago • 26 •

Jina CLIP: Your CLIP Model Is Also Your Text Retriever

New activity in librarian-bots/dataset-to-model-monitor 11 days ago

Discussion tracking new models trained on HuggingFaceH4/ultrafeedback_binarized

172

#37 opened 5 months ago by

Discussion tracking new models trained on HuggingFaceH4/ultrachat_200k

218

#15 opened 7 months ago by

New activity in samim2024/llama2-dataset-finetune-train-480row 11 days ago

Librarian Bot: Add language metadata for dataset

#1 opened 11 days ago by

New activity in ittailup/la-speech-tags 11 days ago

Librarian Bot: Add language metadata for dataset

#1 opened 11 days ago by

New activity in TAESOO98/meld-tts-gender_speaker3 11 days ago

Librarian Bot: Add language metadata for dataset

#1 opened 11 days ago by

New activity in parseny/Github_ChatGPT_Comments 11 days ago

Librarian Bot: Add language metadata for dataset

#2 opened 11 days ago by

New activity in Raneechu/alpaca_format 11 days ago

Librarian Bot: Add language metadata for dataset

#1 opened 11 days ago by

New activity in yirenc/all_ethics_10K_2024 11 days ago

Librarian Bot: Add language metadata for dataset

#2 opened 11 days ago by

New activity in ebowwa/merged-human-biases-people-profiles-beta-0.3 11 days ago

Librarian Bot: Add language metadata for dataset

#1 opened 11 days ago by

New activity in fhswf/test 11 days ago

Librarian Bot: Add language metadata for dataset

#2 opened 11 days ago by

New activity in eswardivi/orca_math 11 days ago

Librarian Bot: Add language metadata for dataset

#2 opened 11 days ago by

New activity in YBXL/GENE_OMIM_SY_train 11 days ago

Librarian Bot: Add language metadata for dataset

#2 opened 11 days ago by

New activity in rishavranaut/Fact_Updates_final 11 days ago

Librarian Bot: Add language metadata for dataset

#2 opened 11 days ago by

New activity in andrewsiah/filtered_personalization_prompt_response 11 days ago

Librarian Bot: Add language metadata for dataset

#1 opened 11 days ago by

New activity in damian-z-pbs/finetuning_demo 11 days ago

Librarian Bot: Add language metadata for dataset

#2 opened 11 days ago by

New activity in abhayesian/ultrachat_truncated 11 days ago

Librarian Bot: Add language metadata for dataset

#2 opened 11 days ago by

New activity in AndrewZeng/metamath_trn_3epoch_syn_20w_infer_part_4 11 days ago

Librarian Bot: Add language metadata for dataset

#1 opened 11 days ago by

New activity in GENIAC-Team-Ozaki/tuninig-dataset_sft_v6 11 days ago

Librarian Bot: Add language metadata for dataset

#1 opened 11 days ago by

New activity in hexuan21/for_test 11 days ago

Librarian Bot: Add language metadata for dataset

#2 opened 11 days ago by

New activity in preference-agents-working/naive-training-data-phi-3 11 days ago

Librarian Bot: Add language metadata for dataset

#1 opened 11 days ago by

New activity in HongyiPeng/VQA-RAD 11 days ago

Librarian Bot: Add language metadata for dataset

#1 opened 11 days ago by