Undi95 (Undi)

Posts 5

Post

891

Hey everyone,

Just wanted to shout out a massive thank you to all 2000 of you who've followed me on Hugging Face! 🎉 It's incredible to have such an awesome crew backing me up as I dive into all these LLM experiments.

Even though not all my models turn out perfect, I've found some real gems and methods along the way 💎. It's like digging for treasure – sometimes you found nothing, but sometimes you find a pearl, and sometimes you find a new method to try.

Your support and encouragement mean the world to me, and I'm really stoked to keep experimenting and learning. If you told me some years ago I would have so much people following me for what I do, I wouldn't have believed it. Here's to more discoveries and adventures ahead! 🚀

Also, big thanks once again, and a huge shoutout to @IkariDev for being there through this journey and supporting me. I'm excited for our future work together and hope we will continue to make people happy! 👏

I want to thank @Gryphe too, since my early work was heavily inspired from MythoMax and the RP/ERP vibe of it. If I'm here today it's probably because of you 😂

I was so close to forget @chargoddard and his amazing tool too! What will we do without mergekit in our life? Thank you! 🙏

See y'all at 3k!

Post

6506

Hello!
The 8B/70B OG Llama-3 models made with the Orthogonal Activation Steering script as been pushed in private.

After multiple test with an empty prompt system, I can confirm it's not uncensored enough, but I wanted to try all the GGUF before (and it take time to do lmao)

If you want to try that yourself, here is the script : https://gist.github.com/wassname/42aba7168bb83e278fcfea87e70fa3af
And here is the same script that we modified to be able to use it on multiple GPU for 70B : https://files.catbox.moe/ya4rto.ipynb

Llama3-Unholy-8B-OAS don't have the problem as it was already trained to be less censored, but the OG one was really too much censored.

I will try to redo that soon, as it seems to HAVE WORKED for some prompt (as seen on the log, for exemple) but it's not enough.

32 entry of the dataset is clearly not enough, but it's okay, I really wanted to try that as it was something new.
I could take the Unholy way and retrain the 70B before using OAS but it should work without, that's not the goal.

View all posts

Collections 8

models 259

datasets 9

Undi95/orthogonal-activation-steering-TOXIC

Viewer • Updated 10 days ago • 1 • 6

Undi95/CoupleRP

Updated Apr 1 • 2

Undi95/Capybara-ShareGPT

Viewer • Updated Mar 23 • 2

Undi95/pippa_perplexity

Viewer • Updated Feb 11 • 1

Undi95/andrijdavid_roleplay-conversation-sharegpt

Viewer • Updated Feb 8 • 3 • 5

Undi95/ConversationChronicles-sharegpt-SHARDED

Viewer • Updated Jan 16 • 6

Undi95/toxic-dpo-v0.1-sharegpt

Viewer • Updated Jan 15 • 317 • 12

Undi95/toxic-dpo-v0.1-NoWarning

Viewer • Updated Jan 10 • 78 • 10

Undi95/oasst2_toxic

Preview • Updated Dec 24, 2023 • 1

Undi PRO

AI & ML interests

Organizations

Posts 5

Collections 8

Undi95/ReMM-SLERP-L2-13B

Undi95/ReMM-v2-L2-13B

Undi95/ReMM-v2.1-L2-13B

Undi95/ReMM-v2.2-L2-13B

Undi95/Llamix2-MLewd-4x13B

Undi95/Xwin-MLewd-13B-V0.2

Undi95/MLewd-v2.4-13B

Undi95/MLewd-Chat-v2-13B

models 259

Undi95/Meta-Llama-3-70B-Instruct-hf

Undi95/Meta-Llama-3-8B-hf

Undi95/Meta-Llama-3-8B-Instruct-hf

Undi95/Meta-Llama-3-70B-Instruct-OAS-GGUF

Undi95/Meta-Llama-3-70B-Instruct-OAS

Undi95/Unholy-8B-DPO-OAS

Undi95/Unholy-8B-DPO-OAS-GGUF

Undi95/Llama3-Unholy-8B-OAS-GGUF

Undi95/Llama3-Unholy-8B-OAS

Undi95/Llama-3-LewdPlay-8B-evo-GGUF

datasets 9

Undi95/orthogonal-activation-steering-TOXIC

Undi95/CoupleRP

Undi95/Capybara-ShareGPT

Undi95/pippa_perplexity

Undi95/andrijdavid_roleplay-conversation-sharegpt

Undi95/ConversationChronicles-sharegpt-SHARDED

Undi95/toxic-dpo-v0.1-sharegpt

Undi95/toxic-dpo-v0.1-NoWarning

Undi95/oasst2_toxic

Undi PRO

AI & ML interests

Organizations

Posts 5

Collections 8

models 259 Sort: Recently updated

datasets 9 Sort: Recently updated

models 259

datasets 9