A collection of evaluation benchmarks for the Italian language.
Simone Conia
AI & ML interests
Natural Language Processing, Multilinguality, Knowledge Graphs, Semantics, Large Language Models
Recent Activity
updated a model about 4 hours ago
principled-intelligence/scope-guard-4B-q-2601 authored a paper 13 days ago
ReTraceQA: Evaluating Reasoning Traces of Small Language Models in Commonsense Question Answering authored a paper 13 days ago
AgREE: Agentic Reasoning for Knowledge Graph Completion on Emerging Entities