Skip to content
View LoserCheems's full-sized avatar
๐Ÿถ
I am loser cheems
๐Ÿถ
I am loser cheems

Block or report LoserCheems

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
LoserCheems/README.md

Jingze Shi

news: I am looking for a engineering internship in the field of LLM. If you have any information, don't hesitate to get in touch with me. ๐Ÿ“ง

Experience ๐Ÿ•

  • 2022.9-Present Undergraduate Student

Competition Awards ๐Ÿ†

Publications ๐Ÿ“

  • Trainable Dynamic Mask Sparse Attention [Paper]
  • Concise Reasoning, Big Gains: Pruning Long Reasoning Trace with Difficulty-Aware Prompting [Paper]

Research Direction ๐Ÿ”ญ

  • Natural Language Processing
  • Large Language Models
  • Small Language Models
  • Foundation Models
  • Deep Reinforcement Learning
  • High Efficient Algorithm

Skills โš’๏ธ

  • Natural Language: ็ฎ€ไฝ“ไธญๆ–‡, English
  • Programming Language: C++, Python
  • Typesetting Language: Markdown, LaTeX
  • Programming Framework: PyTorch, Transformers

Pinned Loading

  1. huggingface/transformers huggingface/transformers Public

    ๐Ÿค— Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Python 152k 31k

  2. pytorch/pytorch pytorch/pytorch Public

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python 94.4k 25.7k

  3. huggingface/open-r1 huggingface/open-r1 Public

    Fully open reproduction of DeepSeek-R1

    Python 25.6k 2.4k

  4. huggingface/trl huggingface/trl Public

    Train transformer language models with reinforcement learning.

    Python 16.1k 2.3k

  5. SmallDoges/small-doge SmallDoges/small-doge Public

    Doge Family of Small Language Models

    Python 181 12

  6. SmallDoges/flash-dmattn SmallDoges/flash-dmattn Public

    Trainable fast and memory-efficient sparse attention

    C++ 429 36