EXAONE-Deep
HomeModel OverviewTechnical FeaturesPerformanceApplicationsDeploymentTutorials

EXAONE-Deep Model Overview

About EXAONE-Deep

EXAONE-Deep is a series of open-source large language models developed by LG AI Research, optimized for enhanced reasoning capabilities. These models excel in mathematical reasoning, scientific understanding, and code generation, making them particularly suitable for complex problem-solving tasks.

Available in three sizes (32B, 7.8B, and 2.4B parameters), EXAONE-Deep models offer a balance of performance and efficiency for different deployment scenarios, from high-performance servers to more resource-constrained environments.

What sets EXAONE-Deep apart is its focus on reasoning tasks, where it outperforms many similarly-sized open-source models across various benchmarks related to mathematics, science, and code generation.

Model Specifications

EXAONE-Deep is available in three model sizes to accommodate different computational resources and use cases:

ModelParametersLayersAttention HeadsVocabulary SizeContext Length
EXAONE-Deep-32B30.95B (excluding embeddings)64GQA with 40 Q-heads and 8 KV-heads102,40032,768 tokens
EXAONE-Deep-7.8B7.77B (excluding embeddings)32GQA with 32 Q-heads and 8 KV-heads102,40032,768 tokens
EXAONE-Deep-2.4B2.37B (excluding embeddings)24Standard attention with 16 heads102,40032,768 tokens

All models are built on a decoder-only transformer architecture and utilize a tokenizer with a 102,400 vocabulary size. The models support sequence lengths of up to 32,768 tokens, making them suitable for processing lengthy documents and complex reasoning chains.

Key Features

Mathematical Reasoning

EXAONE-Deep excels at various mathematical tasks, demonstrating strong performance on benchmarks such as:

  • MATH-500

    A benchmark of challenging mathematical problems across various domains

  • AIME 2024/2025

    American Invitational Mathematics Examination problems

  • CSAT Mathematics

    College Scholastic Ability Test mathematics problems

Scientific Reasoning

The model demonstrates strong capabilities in scientific problem-solving:

  • GPQA Diamond

    Graduate-level physics, chemistry, and biology problems

  • Multi-step reasoning

    Ability to follow complex chains of scientific reasoning

  • Domain expertise

    Specialized knowledge across physics, chemistry, biology, and other scientific fields

Coding Abilities

EXAONE-Deep performs exceptionally well in code-related tasks:

  • LiveCodeBench

    Strong performance on coding benchmarks requiring algorithmic problem-solving

  • Multiple languages

    Support for Python, JavaScript, Java, C++, and other programming languages

  • Code explanation

    Ability to explain code functionality and assist with debugging

Deployment Support

The models are designed for practical deployment:

  • Multiple model sizes

    32B, 7.8B, and 2.4B variants for different resource constraints

  • Quantization support

    AWQ and GGUF formats for optimized inference

  • Framework compatibility

    Support for various inference frameworks including Transformers, vLLM, llama.cpp, and TensorRT-LLM

Applications

EXAONE-Deep is designed for applications requiring strong reasoning capabilities:

Math Problem Solving

Solving complex mathematical problems with step-by-step reasoning, particularly useful for educational applications, tutoring systems, and research assistance.

Scientific Research Assistance

Analyzing scientific literature, generating hypotheses, and assisting researchers in exploring scientific concepts across various domains including physics, chemistry, and biology.

AI Inference Applications

Powering reasoning-focused AI applications that require logical thinking, problem-solving, and complex decision-making capabilities across various domains.

Research Background

EXAONE-Deep was developed by LG AI Research with the goal of creating large language models that excel specifically at reasoning tasks. The development focused on optimizing model architecture and training methodology to enhance performance on complex reasoning problems while maintaining efficiency.

The models leverage architectural innovations such as Grouped-Query Attention (GQA) in the larger models, which helps balance computational efficiency and model performance. The training process incorporated specialized datasets and techniques to enhance reasoning capabilities across mathematical, scientific, and programming domains.

EXAONE-Deep represents an important step in the development of specialized AI systems that can handle complex reasoning tasks that were previously challenging for language models, particularly in domains requiring structured logical thinking.

Related Resources

GitHub Repository

Access the model code, documentation, and examples

View on GitHub
Technical Documentation

Detailed technical specifications and implementation guides

View Documentation
Hugging Face Models

Download models and explore interactive demos

Explore Models