# EfficientAI **Repository Path**: alibaba/EfficientAI ## Basic Information - **Project Name**: EfficientAI - **Description**: No description available - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2026-01-05 - **Last Updated**: 2026-05-17 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # ๐Ÿš€ EfficientAI > **Efficient Inference for LLMs & MLLMs** > An open-source research project from Alibaba Cloud dedicated to efficient large language model inference.

EfficientAI Banner

License Papers Stars Issues

--- ## ๐Ÿ“‹ Table of Contents - [โœจ Key Features](#-key-features) - [๐Ÿ”ฅ Latest Updates](#-latest-updates) - [๐Ÿ“ฆ Installation](#-installation) - [โšก Quick Start](#-quick-start) - [๐Ÿงช Benchmarks](#-benchmarks) - [๐Ÿ“š Publications](#-publications) - [๐Ÿค Contributing](#-contributing) - [๐Ÿ“„ License](#-license) - [โœ‰๏ธ Contact](#-contact) --- ## โœจ Key Features EfficientAI focuses on **inference-time optimizations** for LLMs and MLLMs: | Feature | Description | Status | |---------|-------------|--------| | ๐Ÿ”น **Activation Sparsity** | Dynamic sparsity methods for faster inference | โœ… LaRoSa (ICML 2025) | | ๐Ÿ”น **Quantization** | Post-training & quantization-aware techniques for MLLMs | โœ… MASQuant (CVPR 2026) | | ๐Ÿ”น **Agentic Reasoning** | Efficient tool-use and reasoning frameworks | โœ… D-CORE ( ICML 2026) | | ๐Ÿ”น **Reproducible Benchmarks** | Standardized eval pipelines for research & production | ๐Ÿ”„ In Progress | --- ## ๐Ÿ”ฅ Latest Updates
๐Ÿ“ฐ Changelog (Click to expand) - **[2026-05]** ๐Ÿš€ **D-CORE** accepted to **ICML 2026** โ†’ Efficient tool-use reasoning via dynamic computation routing [๐Ÿ“„ Paper](https://arxiv.org/abs/2602.02160) | [๐Ÿ’ป Code](./dcore/) | [๐ŸŽฎ Demo](#) - **[2026-03]** ๐ŸŽ‰ **MASQuant** accepted to **CVPR 2026** โ†’ Multimodal LLM PTQ algorithm with SOTA accuracy-efficiency tradeoff [๐Ÿ“„ Paper](https://arxiv.org/abs/2603.04800) | [๐Ÿ’ป Code](./masquant/) - **[2026-01]** ๐Ÿ† **LaRoSa** accepted to **ICML 2025** โ†’ Training-free activation sparsity for LLM acceleration [๐Ÿ“„ Paper](https://arxiv.org/abs/2507.01299) | [๐Ÿ’ป Code](./larosa/)
--- ## ๐Ÿ“ฆ Installation ```bash # Clone the repository git clone https://github.com/alibaba/EfficientAI.git cd EfficientAI # Install dependencies (recommended: use conda) pip install -r requirements.txt # Optional: Install with specific module support # pip install -e ".[larosa]" # for LaRoSa # pip install -e ".[masquant]" # for MASQuant