This is the Vision Engine Team of 360 AI Research in computer vision and multimodal fields.
We focus on "multimodal + cross-modal learning" and "large model + zero/few shot learning",
conducting research in
- 🌗 Vision-Language cross-modal learning paper, data
- 🔎 Open-world object detection paper, video, competition
- 📺 Open-vocabulary video analysis seminar
- 🎨 AIGC image&video generation paper, paper, app, code, code
- 🧙 Large multimodal model paper, code, code, code
Internship: we're hiring research interns in fileds of AIGC, LMM, and inference optimization, check 👉 JD here