Scene Graph Generation

113 papers with code • 5 benchmarks • 7 datasets

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Graph Generation

Dataset	Best Model	Compare
Visual Genome	SpeaQ (without reweighting)	See all
4D-OR	ORacle	See all
VRD	FactorizableNet	See all
3R-Scan	SceneGraphFusion	See all
MS-COCO	NeuSyRE	See all

Libraries

Use these libraries to find Scene Graph Generation models and implementations

rafa-cxg/PySGG-cxg

3 papers

suprosanna/relationformer

2 papers

shikorab/SceneGraph

2 papers

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

RLIPv2: Fast Scaling of Relational Language-Image Pre-training

jacobyuan7/rlipv2 • • ICCV 2023

In this paper, we propose RLIPv2, a fast converging model that enables the scaling of relational pre-training to large-scale pseudo-labelled scene graph data.

Paper
Code

Panoptic Video Scene Graph Generation

jingkang50/openpvsg • • CVPR 2023

PVSG relates to the existing video scene graph generation (VidSGG) problem, which focuses on temporal interactions between humans and objects grounded with bounding boxes in videos.

Paper
Code

4D Panoptic Scene Graph Generation

Jingkang50/OpenPSG • • NeurIPS 2023

To facilitate research in this new area, we build a richly annotated PSG-4D dataset consisting of 3K RGB-D videos with a total of 1M frames, each of which is labeled with 4D panoptic segmentation masks as well as fine-grained, dynamic scene graphs.

Paper
Code

Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning

paulgay/VGfM • • 16 Jul 2018

Recent approaches on visual scene understanding attempt to build a scene graph -- a computational representation of objects and their pairwise relationships.

Paper
Code

Relation Transformer Network

rajatkoner08/rtn • 13 Apr 2020

In this work, we propose a novel transformer formulation for scene graph generation and relation prediction.

Paper
Code

Learning Visual Commonsense for Robust Scene Graph Generation

ZhecanJamesWang/GLAT_SGG • • ECCV 2020

Scene graph generation models understand the scene through object and predicate recognition, but are prone to mistakes due to the challenges of perception in the wild.

Paper
Code

Learning and Reasoning with the Graph Structure Representation in Robotic Surgery

mobarakol/Surgical_SceneGraph_Generation • • 7 Jul 2020

Learning to infer graph representations and performing spatial reasoning in a complex surgical environment can play a vital role in surgical scene understanding in robotic surgery.

Paper
Code