image.png

Quick Note

Core Methodology?

  1. Hybrid Attention (Compressed Sparse Attention + Heavily Compressed Attention)
  2. Manifold-Constrained Hyper-Connections (mHC)
  3. Muon Optimizer

Motivation

Strenghts

Core Architecture

Related Works

Relation to My Research

Main research Gap