Edoardo Federici

efederici

AI & ML interests

llms, ir, graphs & co

Organizations

Posts 1

view post
Post
1043
Finally, I can post! šŸš€

I created a Capybara-inspired Italian dataset by translating the initial instruction and running it through a pipeline to generate conversations. I used Claude Sonnet for translation and instruction generation, and Opus for generating the answers.

I hope this dataset proves useful for people working on šŸ‡®šŸ‡¹ language models.

ā› Open sourcing the dataset here: efederici/capybara-claude-15k-ita