Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2307.15337

Papers from the NICS-EFFALG Team

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Paper • 2505.21600 • Published May 27 • 71
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Paper • 2412.17153 • Published Dec 22, 2024 • 39
Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

Paper • 2307.15337 • Published Jul 28, 2023 • 38
DiTFastAttn: Attention Compression for Diffusion Transformer Models

Paper • 2406.08552 • Published Jun 12, 2024 • 25

💫 Glossary https://osanseviero.github.io/hackerllama/blog/posts/hitchhiker_guide/

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4, 2024 • 65
Learning To Teach Large Language Models Logical Reasoning

Paper • 2310.09158 • Published Oct 13, 2023 • 1
ChipNeMo: Domain-Adapted LLMs for Chip Design

Paper • 2311.00176 • Published Oct 31, 2023 • 9
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Paper • 2308.09583 • Published Aug 18, 2023 • 7

Papers related to prompt engineering and optimizers

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Paper • 2309.08532 • Published Sep 15, 2023 • 53
Large Language Models as Optimizers

Paper • 2309.03409 • Published Sep 7, 2023 • 77
Graph of Thoughts: Solving Elaborate Problems with Large Language Models

Paper • 2308.09687 • Published Aug 18, 2023 • 7
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks

Paper • 2211.12588 • Published Nov 22, 2022 • 3

Batched decoding

Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

Paper • 2307.15337 • Published Jul 28, 2023 • 38

Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs

Paper • 2310.13961 • Published Oct 21, 2023 • 5
Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 5
AutoMix: Automatically Mixing Language Models

Paper • 2310.12963 • Published Oct 19, 2023 • 14
SAI: Solving AI Tasks with Systematic Artificial Intelligence in Communication Network

Paper • 2310.09049 • Published Oct 13, 2023 • 1

Iterated Decomposition: Improving Science Q&A by Supervising Reasoning Processes

Paper • 2301.01751 • Published Jan 4, 2023
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

Paper • 2307.11768 • Published Jul 17, 2023 • 13
Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 39
Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

Paper • 2307.15337 • Published Jul 28, 2023 • 38

Papers from the NICS-EFFALG Team

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Paper • 2505.21600 • Published May 27 • 71
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Paper • 2412.17153 • Published Dec 22, 2024 • 39
Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

Paper • 2307.15337 • Published Jul 28, 2023 • 38
DiTFastAttn: Attention Compression for Diffusion Transformer Models

Paper • 2406.08552 • Published Jun 12, 2024 • 25

Batched decoding

Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

Paper • 2307.15337 • Published Jul 28, 2023 • 38

💫 Glossary https://osanseviero.github.io/hackerllama/blog/posts/hitchhiker_guide/

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4, 2024 • 65
Learning To Teach Large Language Models Logical Reasoning

Paper • 2310.09158 • Published Oct 13, 2023 • 1
ChipNeMo: Domain-Adapted LLMs for Chip Design

Paper • 2311.00176 • Published Oct 31, 2023 • 9
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

Paper • 2308.09583 • Published Aug 18, 2023 • 7

Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs

Paper • 2310.13961 • Published Oct 21, 2023 • 5
Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 5
AutoMix: Automatically Mixing Language Models

Paper • 2310.12963 • Published Oct 19, 2023 • 14
SAI: Solving AI Tasks with Systematic Artificial Intelligence in Communication Network

Paper • 2310.09049 • Published Oct 13, 2023 • 1

Papers related to prompt engineering and optimizers

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Paper • 2309.08532 • Published Sep 15, 2023 • 53
Large Language Models as Optimizers

Paper • 2309.03409 • Published Sep 7, 2023 • 77
Graph of Thoughts: Solving Elaborate Problems with Large Language Models

Paper • 2308.09687 • Published Aug 18, 2023 • 7
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks

Paper • 2211.12588 • Published Nov 22, 2022 • 3

Iterated Decomposition: Improving Science Q&A by Supervising Reasoning Processes

Paper • 2301.01751 • Published Jan 4, 2023
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

Paper • 2307.11768 • Published Jul 17, 2023 • 13
Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 39
Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

Paper • 2307.15337 • Published Jul 28, 2023 • 38

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs