15 143 167

lhl PRO

leonardlin

https://randomfoo.net/

lhl

AI & ML interests

None yet

Articles

Organizations

Posts 6

Post

2161

Maybe of interest, I just finished a long writeup of my weekend project exploring Qwen 2 7B Instruct's Chinese censorship: https://huggingface.co/blog/leonardlin/chinese-llm-censorship-analysis

I also have an accompanying model and dataset (and codebase) for those curious to poke around:

* augmxnt/Qwen2-7B-Instruct-deccp

* augmxnt/deccp

Post

1899

Interesting, I've just seen the my first HF spam on one of my new model uploads: shisa-ai/shisa-v1-llama3-70b - someone has an SEO spam page as a HF space attached to the model!?! Wild. Who do I report this to?

View all posts

Collections 23

spaces 1

Runtime error

💬

Shisa Ablations

models

None public yet

datasets

None public yet

lhl PRO

AI & ML interests

Articles

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

Not Legal Advice on AI Training Data in Japan

Evaling llm-jp-eval (evals are hard)

Organizations

Posts 6

Collections 23

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

Accelerating LLM Inference with Staged Speculative Decoding

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

QuIP: 2-Bit Quantization of Large Language Models With Guarantees

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

spaces 1

Shisa Ablations

models

datasets