Stars
Demos of mutation testing and fuzz testing prepared for the Software Testing Course of NJU Software Institute.
A new codebase for popular Scene Graph Generation methods (2020). Visualization & Scene Graph Extraction on custom images/datasets are provided. It's also a PyTorch implementation of paper “Unbiase…
An Autonomous LLM Agent for Complex Task Solving
Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model
This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World" (Accepted by ICCV 2023)
This repo contains codes and instructions for baselines in the VLUE benchmark.
MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
This is repository of our SIGIR'19 paper Triple-to-Text: Converting RDF Triples into High-Quality Natural Languages via Optimizing an Inverse KL Divergence
The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021)
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Open source annotation tool for machine learning practitioners.
Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
dengdan / coco
Forked from cocodataset/cocoapiMS COCO API - http://mscoco.org/
Pytorch implementation for t-SNE with cuda to accelerate
TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation (ECCV 2022)
Block shuffling learning for Deepfake Detection
[IJCAI 2020] Official implementation for "DIDFuse: Deep Image Decomposition for Infrared and Visible Image Fusion"
Kernel-based Density Map Generation for Dense Object Counting
RGB-T Crowd Counting from Drone: A Benchmark and MMCCN Network
Drone-based Joint Density Map Estimation, Localization and Tracking with Space-Time Multi-Scale Attention Network
A simple yet effective crowd counting and localization network (SCALNet)
Official Code for Context-Aware Crowd Counting. CVPR 2019