Stars
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
📚 AIGC 求职面经、必备基础知识、提示词工程、ChatGPT、Stable Diffusion、Prompt、Embedding、Fintune 等 AIGC 求职你所需要知道的一切~
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
✨✨Latest Advances on Multimodal Large Language Models
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-contex…
Source code for paper: "AltDiffusion: A multilingual Text-to-Image diffusion model"
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
关于domain generalization,domain adaptation,causality,robutness,prompt,optimization,generative model各式各样研究的阅读笔记
[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
A curated list of papers, code and resources pertaining to few-shot image generation.
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
A collection of resources on controllable generation with text-to-image diffusion models.
[CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"
ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation (ICCV 2023, Oral)
[TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space"
Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)
Official codebase for the Paper “Retrieval-Augmented Diffusion Models”
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Retrieval augmented diffusion from CompVis.
Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"