tokenizer used by submit model
AI & ML interests
Large language Models
datasets 26
geniacllm/livedoor_news_corpus
Viewer • Updated • 2.77k • 11 • 1
geniacllm/wikipedia_v2
Preview • Updated • 120
geniacllm/made_by_llm_and_human
Viewer • Updated • 2.64k • 13
geniacllm/hanrei
Viewer • Updated • 2.9M • 67
geniacllm/gsm8k
Viewer • Updated • 1.03M • 68
geniacllm/aozora_bunko
Viewer • Updated • 10.2k • 9
geniacllm/kokkai_v2
Preview • Updated • 14
geniacllm/dataset_from_other_team
Viewer • Updated • 27.1k • 10
geniacllm/wiki40b
Viewer • Updated • 1.2M • 14
geniacllm/CulturaX_default_filtered_ja_10b
Preview • Updated • 26