arxiv:2604.08644

EXAONE 4.5 Technical Report

Published on Apr 9

· Submitted by

taesiri on Apr 13

Upvote

Authors:

Kyungmin Lee ,

Kwangrok Ryoo ,

Abstract

EXAONE 4.5 is an open-weight vision language model that integrates a visual encoder into EXAONE 4.0, achieving enhanced document understanding and general language capabilities through targeted data curation and extended context length.

AI-generated summary

This technical report introduces EXAONE 4.5, the first open-weight vision language model released by LG AI Research. EXAONE 4.5 is architected by integrating a dedicated visual encoder into the existing EXAONE 4.0 framework, enabling native multimodal pretraining over both visual and textual modalities. The model is trained on large-scale data with careful curation, particularly emphasizing document-centric corpora that align with LG's strategic application domains. This targeted data design enables substantial performance gains in document understanding and related tasks, while also delivering broad improvements across general language capabilities. EXAONE 4.5 extends context length up to 256K tokens, facilitating long-context reasoning and enterprise-scale use cases. Comparative evaluations demonstrate that EXAONE 4.5 achieves competitive performance in general benchmarks while outperforming state-of-the-art models of similar scale in document understanding and Korean contextual reasoning. As part of LG's ongoing effort toward practical industrial deployment, EXAONE 4.5 is designed to be continuously extended with additional domains and application scenarios to advance AI for a better life.

View arXiv page View PDF GitHub 28 Add to collection

Community

avahal

about 2 hours ago

Interesting breakdown of this paper on arXivLens: https://arxivlens.com/PaperView/Details/exaone-4-5-technical-report-6583-450f8ed7
Covers the executive summary, detailed methodology, and practical applications.