CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models
•
17
imgsys.org -- arena for text guided image generation
Track, rank and evaluate open LLMs' CoT quality
View how beam search decoding works, in detail!
Jailbreak the LLM and privacy guardrails