Omar Sanseviero's picture

Omar Sanseviero

osanseviero

·

https://osanseviero.github.io/hackerllama/

AI & ML interests

Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙

Articles

Welcome Llama 3 - Meta's new open LLM

CodeGemma - an official Google release for code LLMs

🪆 Introduction to Matryoshka Embedding Models

Welcome Gemma - Google's new open LLM

Constitutional AI with Open LLMs

Preference Tuning LLMs with Direct Preference Optimization Methods

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Mixture of Experts Explained

Inference for PROs

Spread Your Wings: Falcon 180B is here

Code Llama: Llama 2 learns to code

Results of the Open Source AI Game Jam

Llama 2 is here - get it on Hugging Face

The Falcon has landed in the Hugging Face ecosystem

Hugging Face Machine Learning Demos on arXiv

What's new in Diffusers? 🎨

Announcing Evaluation on the Hub

An Introduction to Deep Reinforcement Learning

Welcome spaCy to the 🤗 Hub

Sentence Transformers in the 🤗 Hub

Organizations

osanseviero's activity

New activity in Bin12345/AutoCoder about 5 hours ago

Add link to paper

#1 opened about 5 hours ago by

New activity in Bin12345/AutoCoder_S_6.7B about 5 hours ago

Add link to paper

#2 opened about 5 hours ago by

New activity in wyysf/CraftsMan about 5 hours ago

Add metadata + paper link

#1 opened about 5 hours ago by

commented a paper about 5 hours ago

ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

Paper • 2405.15738 • Published 3 days ago • 34 •

New activity in merve/paligemma-tracking about 12 hours ago

Cache examples

#1 opened about 12 hours ago by

New activity in wyysf/CraftsMan about 12 hours ago

🚩 Report: Not working

#3 opened about 12 hours ago by

New activity in meta-llama/Meta-Llama-3-8B about 14 hours ago

Update README.md

#160 opened 3 days ago by

Update README.md

#161 opened 2 days ago by

safetensors vs pth file

#162 opened 1 day ago by

New activity in mistralai/Mistral-7B-Instruct-v0.3 5 days ago

Add minor reference to transformers

#7 opened 5 days ago by

New activity in mistralai/Mistral-7B-v0.3 5 days ago

Add minor reference to transformers

#5 opened 5 days ago by

New activity in meta-llama/Meta-Llama-3-8B 5 days ago

Request: DOI

#149 opened 5 days ago by

Request: DOI

#150 opened 5 days ago by

narendrapatelegn

New activity in eloialonso/diamond 6 days ago

Add metadata

#2 opened 6 days ago by

New activity in SpacesExamples/jupyterlab 6 days ago

specify user uid = 1000 based on dev mode docs: https://huggingface.co/dev-mode-explorers

#8 opened 6 days ago by

update dependencies for compatibility with dev mode

#7 opened 6 days ago by

New activity in big-vision/paligemma-hf 7 days ago

Space won't load

#1 opened 8 days ago by

New activity in microsoft/phi-1 8 days ago

Add link to paper

#10 opened 8 days ago by

New activity in meta-llama/Meta-Llama-3-8B 11 days ago

Rename README.md to auto

#142 opened 11 days ago by

New activity in google/paligemma-3b-pt-896 11 days ago

ImportError: cannot import name 'PaliGemmaForConditionalGeneration' from 'transformers'

#2 opened 13 days ago by

New activity in paris-ai-running-club/README 11 days ago

FOMO

#1 opened 11 days ago by

New activity in google/gemma-7b-it 11 days ago

Rename README.md to !pip install transformers datasets

#92 opened 12 days ago by

New activity in google/paligemma-3b-pt-224 13 days ago

ImportError: cannot import name 'PaliGemmaForConditionalGeneration' from 'transformers'

#2 opened 13 days ago by

New activity in KingNish/OpenGPT-4o 13 days ago

First try- very cool!

#1 opened 13 days ago by

New activity in amazon/chronos-t5-base 14 days ago

Add pipeline for forecasting

#4 opened 14 days ago by

New activity in amazon/chronos-t5-small 14 days ago

Add pipeline for time series forecasting

#3 opened 14 days ago by

New activity in amazon/chronos-t5-mini 14 days ago

Add pipeline for time series forecasting

#5 opened 14 days ago by

New activity in amazon/chronos-t5-tiny 14 days ago

Add pipeline tag

#3 opened 14 days ago by

New activity in amazon/chronos-t5-large 14 days ago

Add tag for time series forecasting

#5 opened 14 days ago by

New activity in time-series-foundation-models/Lag-Llama 14 days ago

Add pipeline tag

#3 opened 14 days ago by

New activity in AutonLab/MOMENT-1-large 14 days ago

Add a pipeline tag for time series forecasting

#1 opened 14 days ago by

New activity in google/timesfm-1.0-200m 14 days ago

Fix pipeline type

#7 opened 14 days ago by

New activity in meta-llama/Meta-Llama-3-8B 14 days ago

error

#107 opened 26 days ago by

New activity in google/timesfm-1.0-200m 14 days ago

add time-series tag

#5 opened 14 days ago by

New activity in meta-llama/Meta-Llama-Guard-2-8B 15 days ago

Change license from other to llama3

#10 opened 21 days ago by

New activity in meta-llama/Meta-Llama-3-8B-Instruct 15 days ago

Change license from other to llama3

#92 opened 21 days ago by

New activity in meta-llama/Meta-Llama-3-8B 15 days ago

Change license from other to llama3

#121 opened 21 days ago by

New activity in meta-llama/Meta-Llama-3-70B-Instruct 15 days ago

Change license from other to llama3

#47 opened 21 days ago by

New activity in meta-llama/Meta-Llama-3-70B 15 days ago

Change license from other to llama3

#13 opened 21 days ago by

New activity in meta-llama/Meta-Llama-3-8B-Instruct 18 days ago

Update README.md

#65 opened about 1 month ago by

New activity in meta-llama/Meta-Llama-3-70B-Instruct 18 days ago

Request: DOI

#43 opened 24 days ago by

NextGenDeveloper

New activity in meta-llama/Meta-Llama-3-8B 18 days ago

tokenizer doesn't work with the old API ?

#43 opened about 1 month ago by

423测试

#52 opened about 1 month ago by

i can't install at colap

#59 opened about 1 month ago by

Request: request to access llama3 please!!!

#69 opened about 1 month ago by

horse racing gamble

#78 opened about 1 month ago by

Request: DOI

#79 opened about 1 month ago by

Request: DOI

#80 opened about 1 month ago by

Request: DOI

#81 opened about 1 month ago by

luke

#92 opened 29 days ago by

Update Readme.md

#93 opened 29 days ago by

1111

#94 opened 29 days ago by

🚩 Report: Ethical issue(s)

#38 opened about 1 month ago by

myWork

#106 opened 26 days ago by

llama

#115 opened 23 days ago by

The Serverless Inference API: "The model meta-llama/Meta-Llama-3-8B is too large to be loaded automatically (16GB > 10GB)"

#31 opened about 1 month ago by

Cannot access gated repo

#123 opened 20 days ago by

I was trying to fine-tune llama3 8b but getting following error - TypeError: LlamaForCausalLM.forward() got an unexpected keyword argument 'decoder_input_ids'

#117 opened 22 days ago by

Llama-3-70b tokenizer.

#116 opened 22 days ago by

New activity in google/timesfm-1.0-200m 19 days ago

Add some metadata

#1 opened 19 days ago by