moe_dse

This package provides an automated framework to enable efficient FPGA acceleration of MoE models, by finding the hardware implementation of expert computations with low latency, under resource constraints. It runs an algorithm to iteratively search for optimal or feasible solutions with an ILP formulation, successively relaxing constraints in each iteration. The output is the HLS C++ for the selected MoE implementation system for the target FPGA. The detailed steps to run this framework, and associated folders are as follows:

step1: Generate the IP candidates for DSE, and the associated characteric models. Detailed steps and scripts are in folder ./IP.
Step2: Generate expert token distribution for MoE, for DSE and validation. Detailed steps and scripts are in folder ./switch-base-8.
Step3: Conduct DSE, and generate final solution and its associated implmentations. Detailed steps and scripts are in folder ./moe_model.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
IP		IP
moe_model		moe_model
switch-base-8		switch-base-8
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

moe_dse

About

Uh oh!

Releases

Packages

Languages

License

UIUC-ChenLab/moe_dse

Folders and files

Latest commit

History

Repository files navigation

moe_dse

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages