Skip to content
View lcy-seso's full-sized avatar
🔥
swamped with work
🔥
swamped with work

Organizations

@TiledTensor @FractalTensor

Block or report lcy-seso

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lcy-seso/README.md

Ying 🐰

Interests: Deep Learning, Compilers (e.g., Polyhedral Compilation), HPC, etc.—basically anything related to high-level programming techniques that empower modern LLM algorithm developers. I'm interested in bridging theoretical concepts with practical implementations!

Projects:

  • 🚀 TileFusion is an experimental C++ macro kernel template library that raises the abstraction level of CUDA C for tile processing. The project aims to offer a higher-level interface that enables algorithm developers to innovate hardware-aware LLM algorithms without getting bogged down by low-level hardware details.

  • 🧩 FractalTensor is a programming framework that introduces the concept of FractalTensor—a list of statically shaped tensors arranged in nested lists, associated with advanced functional array compute operators like map, reduce, and scan, as well as array access operators.

    This project involves DSL and IR work, inspired by polyhedral-style loop program analysis. After completing the research paper, I have to plan to resume work on FractalTensor following the TileFusion project, a side project derived from this research.

  • 🔍 VPTQ Introducing VPTQ – an extreme low-bit quantization algorithm and inference library designed for large language models (LLMs). Developed by my talented friend @YangWang92, this project offers an innovative approach to quantizing LLMs. I'm happy to contribute, both to explore my own research interests and to gain hands-on experience with innovative algorithmic ideas.

📈 Stats:

Anurag's GitHub stats Top Langs

My blog posts share ideas interested me in my daily work, capturing the lessons I learn along the way. However, updates are infrequent. @haruhi55 is also me in disguise! 🐵✨

📧 Contact Me: [email protected] | [email protected]

Feel free to reach out to me with questions about the projects or to discuss deep learning system, compiler optimization, or any related topics!

Pinned Loading

  1. microsoft/TileFusion microsoft/TileFusion Public

    TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.

    Cuda 77 5

  2. microsoft/FractalTensor microsoft/FractalTensor Public

    FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of lists of statically-shaped tensors, referred to as a Fractal…

    Python 24 4

  3. microsoft/VPTQ microsoft/VPTQ Public

    VPTQ, A Flexible and Extreme low-bit quantization algorithm

    Python 622 42

  4. lcy-seso.github.io lcy-seso.github.io Public

    Forked from mmistakes/so-simple-theme

    Ying's blog posts.

    SCSS

  5. DLFrameworkTest DLFrameworkTest Public

    My tests and experiments with some popular dl frameworks.

    Python 12

  6. LearningNotes LearningNotes Public

    Ying's notes

    TeX 7