- Beijing
-
19:07
- 8h ahead - https://www.zhihu.com/people/who-u
-
RLLoggingBoard Public
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
-
verl Public
Forked from volcengine/verlveRL: Volcano Engine Reinforcement Learning for LLM
Python Apache License 2.0 UpdatedFeb 14, 2025 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
-
AgentLife Public
A small open source 3D agent simulator based on LLM.
-
transformers_tasks Public
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
-
rotate_image_classifier Public
识别旋转验证码(如百度)的图片旋转度数,可用于辅助机器通过旋转验证码验证。
-
-
github-readme-stats Public
Forked from anuraghazra/github-readme-stats⚡ Dynamically generated stats for your github readmes
JavaScript MIT License UpdatedJan 18, 2023 -
-
pytorch-seq2seq Public
Forked from bentrevett/pytorch-seq2seqTutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
-
-
autoPlanningAlgorithm Public
auto planning algorithm implemented by python.
-
-
-
ReinforcementLearningSystem Public
🛩 Use Deep Reinforcement Learning Algorithms in a simple scene.
-
-
-
SimpleRulesControlSimulation Public
A simple control simulation based on rule.
-
-
-
A simple Agent Learning System with gRPC.
-
-
-
-
-
NeuralNetworkForSpectrum Public
A project to help people who doesn't know Neural-Network-Programming to train their data easily.
-
MonitorSystem Public
A monitor system implemented by Python on Raspberry Pi(s).
-
awesome-dji-robomaster Public
Forked from open-ai-robot/awesome-dji-robomasterAwesome DJI Robomaster S1
UpdatedOct 24, 2019 -
-
robomaster_s1_can_hack Public
Forked from RoboMasterS1Challenge/robomaster_s1_can_hackDJI RoboMaster S1 CAN Hack
C MIT License UpdatedOct 19, 2019