509 118 163

Merve Noyan PRO

merve

mervenoyann

merveenoyan

AI & ML interests

VLMs, vision & co

Articles

Organizations

merve's activity

New activity in google/paligemma-3b-pt-224 3 days ago

How <seg[value]> tokens generate the masks in segmentation tasks?

#10 opened 3 days ago by

cmgzy

New activity in merve/paligemma-doc 3 days ago

Broken

#1 opened 4 days ago by

ndurner

New activity in OpenGVLab/VideoChat2-IT 5 days ago

Dataset download

#2 opened 3 months ago by

adeo

New activity in HuanjinYao/DenseConnector-v1.5-8B 5 days ago

Space GPU grant

#1 opened 5 days ago by

merve

New activity in nlpzhaof/aligngpt-7b 5 days ago

Demo on Hub & GPU grant

#2 opened 5 days ago by

merve

commented 5 papers 5 days ago

ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

Paper • 2405.15738 • Published 8 days ago • 41 •

iVideoGPT: Interactive VideoGPTs are Scalable World Models

Paper • 2405.15223 • Published 9 days ago • 11 •

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Paper • 2405.15574 • Published 8 days ago • 45 •

Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation

Paper • 2404.19752 • Published Apr 30 • 19 •

Dense Connector for MLLMs

Paper • 2405.13800 • Published 10 days ago • 20 •

New activity in XiaoduoAILab/Xmodel_VLM 5 days ago

Space with GPU grant

#1 opened 5 days ago by

merve

New activity in google/paligemma-3b-pt-448 6 days ago

Flash attention ?

#3 opened 11 days ago by

edmond

Manual training

#4 opened 10 days ago by

edmond

New activity in merve/paligemma_vqav2 6 days ago

fine-tuned format

#1 opened 7 days ago by

baiall

New activity in big-vision/paligemma-hf 10 days ago

Different results between Jax Space and the HF Transformers Space

#2 opened 12 days ago by

Shalev

New activity in google/gemma-2b 10 days ago

Following blog for fine tuning gemma-2b doesn't yield same results

#60 opened 13 days ago by

chongdashu

New activity in google/paligemma-3b-mix-224 10 days ago

blog: https://huggingface.co/blog/paligemma

#2 opened 18 days ago by

NickyNicky

New activity in THUDM/cogvlm2-llama3-chat-19B 10 days ago

Demo for the model

#1 opened 12 days ago by

mervenoyan

New activity in google/paligemma-3b-mix-448 16 days ago

How to keep it grounded ?

#3 opened 17 days ago by

Jayakumark

New activity in google/paligemma-3b-ft-vqav2-448 16 days ago

training data example for this model, also what type of query it expects

#2 opened 17 days ago by

Nocte

New activity in google/paligemma-3b-pt-896 16 days ago

ImportError: cannot import name 'PaliGemmaForConditionalGeneration' from 'transformers'

#2 opened 18 days ago by

bbouldin

New activity in google/paligemma-3b-pt-896 18 days ago

how to use segment for objects?

#3 opened 18 days ago by

paul91

New activity in google/paligemma-3b-pt-224 18 days ago

Batch Decoding

#3 opened 18 days ago by

vody-am

ImportError: cannot import name 'PaliGemmaForConditionalGeneration' from 'transformers'

#2 opened 18 days ago by

mdeniz1

New activity in google/paligemma-3b-mix-448 18 days ago

Weird output

#2 opened 18 days ago by

MoonRide

New activity in BAAI/Bunny-v1_1-4B 23 days ago

Demo

#1 opened 23 days ago by

merve

New activity in merve/llava-next 23 days ago

What is the minimum Space Hardware to run this (cloned) Space?

#10 opened about 1 month ago by

KHCHEUNG-UoSHK

New activity in OmAlve/Swin-Transformer-Foods101 26 days ago

nits

#1 opened 26 days ago by

merve

New activity in huggingface/cookbook-images about 1 month ago

Dataset info picture for cookbook notebook "Fine-tune a Vision Transformer With a Custom Biomedical Dataset"

#14 opened about 1 month ago by

emre570

New activity in Rageshhf/medi-classifier about 1 month ago

Add example inputs

#1 opened about 1 month ago by

merve

New activity in emre570/google-vit-large-finetuned about 1 month ago

Nits :')

#1 opened about 1 month ago by

merve

New activity in huggingface/cookbook-images about 1 month ago

Images and GIFs for Artistic Analysis Recipe

#13 opened about 1 month ago by

jamarks

New activity in FoundationVision/groma-7b-finetune about 1 month ago

Model card and demo

#1 opened about 1 month ago by

merve

New activity in ermu2001/pllava-34b about 1 month ago

Host the demo on Hugging Face Spaces instead

#1 opened about 1 month ago by

merve

New activity in merve/BLIP2-with-transformers about 1 month ago

Fix

#3 opened about 1 month ago by

hysts

New activity in wcy1122/MGM about 1 month ago

Models in model repository

#1 opened about 1 month ago by

merve

New activity in hf-vision/course-assets about 1 month ago

Upload Faster RCNN.png

#84 opened about 1 month ago by

sitammeur

New activity in kanashi6/GiT about 1 month ago

Space to try the model

#1 opened about 1 month ago by

merve

commented a paper about 1 month ago

COCONut: Modernizing COCO Segmentation

Paper • 2404.08639 • Published Apr 12 • 25 •

New activity in MM-UPD/MM-UPD about 1 month ago

Have a benchmark leaderboard for the dataset

#1 opened about 1 month ago by

merve

New activity in HuggingFaceM4/idefics2_playground about 2 months ago

Update app_dialogue.py

#1 opened about 2 months ago by

merve

New activity in merve/llava-next 2 months ago

Error

#4 opened 2 months ago by

ShermanAI

New activity in BAAI/seggpt-vit-large 2 months ago

Added task tag

#1 opened 2 months ago by

merve

New activity in merve/UDOP 2 months ago

Most Excellent!

#1 opened 2 months ago by

awacke1

New activity in llava-hf/llava-v1.6-34b-hf 2 months ago

Update README.md

#1 opened 2 months ago by

merve

commented a paper 2 months ago

CoLLaVO: Crayon Large Language and Vision mOdel

Paper • 2402.11248 • Published Feb 17 • 17 •

New activity in kadirnar/Open-Sora 3 months ago

Demo only one model

#1 opened 3 months ago by

merve

New activity in merve/compare_clip_siglip 3 months ago

CLIP better than Siglip?

#1 opened 3 months ago by

kexul

New activity in hpcai-tech/Open-Sora 3 months ago

Some nits 🤗

#1 opened 3 months ago by

merve

New activity in opencompass/open_vlm_leaderboard 3 months ago

Great work!

#2 opened 3 months ago by

merve

New activity in google/metricx-23-xl-v2p0 3 months ago

Space to try the model

#1 opened 3 months ago by

merve

New activity in facebook/hiera_tiny_224.mae_in1k_ft_in1k 3 months ago

Create README.md

#2 opened 3 months ago by

merve

New activity in facebook/hiera_tiny_224.mae_in1k 3 months ago

Create README.md

#2 opened 3 months ago by

merve

New activity in facebook/hiera_huge_16x224.mae_k400_ft_k400 3 months ago

Create README.md

#2 opened 3 months ago by

merve

New activity in facebook/hiera_large_16x224.mae_k400 3 months ago

Create README.md

#2 opened 3 months ago by

merve

New activity in facebook/hiera_large_224.mae_in1k 3 months ago

Create README.md

#1 opened 3 months ago by

merve

New activity in facebook/hiera_tiny_224.mae_in1k_ft_in1k 3 months ago

Create README.md

#1 opened 3 months ago by

merve

New activity in facebook/hiera_huge_224.mae_in1k_ft_in1k 3 months ago

Create README.md

#1 opened 3 months ago by

merve

New activity in facebook/hiera_huge_224.mae_in1k 3 months ago

Create README.md

#1 opened 3 months ago by

merve

New activity in facebook/hiera_base_224.mae_in1k 3 months ago

Create README.md

#2 opened 3 months ago by

merve

Merve Noyan PRO

AI & ML interests

Articles

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Vision Language Models Explained

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

Deploy MusicGen in no time with Inference Endpoints

Open-Source Text Generation & LLM Ecosystem at Hugging Face

Jupyter X Hugging Face

Using Machine Learning to Aid Survivors and Race through Time

Introducing Skops

Announcing the Hugging Face Fellowship Program

Showcase Your Projects in Spaces using Gradio

Hosting your Models and Datasets on Hugging Face Spaces using Streamlit

Organizations

merve's activity

How <seg[value]> tokens generate the masks in segmentation tasks?

Broken

Dataset download

Space GPU grant

Demo on Hub & GPU grant

Space with GPU grant

Flash attention ?

Manual training

fine-tuned format

Different results between Jax Space and the HF Transformers Space

Following blog for fine tuning gemma-2b doesn't yield same results

blog: https://huggingface.co/blog/paligemma

Demo for the model

How to keep it grounded ?

training data example for this model, also what type of query it expects

ImportError: cannot import name 'PaliGemmaForConditionalGeneration' from 'transformers'

how to use segment for objects?

Batch Decoding

ImportError: cannot import name 'PaliGemmaForConditionalGeneration' from 'transformers'

Weird output

Demo

What is the minimum Space Hardware to run this (cloned) Space?

nits

Dataset info picture for cookbook notebook "Fine-tune a Vision Transformer With a Custom Biomedical Dataset"

Add example inputs

Nits :')

Images and GIFs for Artistic Analysis Recipe

Model card and demo

Host the demo on Hugging Face Spaces instead

Fix

Models in model repository

Upload Faster RCNN.png

Space to try the model

Have a benchmark leaderboard for the dataset

Update app_dialogue.py

Error

Added task tag

Most Excellent!

Update README.md

Demo only one model

CLIP better than Siglip?

Some nits 🤗

Great work!

Space to try the model

Create README.md

Create README.md

Create README.md

Create README.md

Create README.md

Create README.md

Create README.md

Create README.md

Create README.md