Useful Links (Tools, Models, Projects, Datasets, Simulators) - feliyur/exercises GitHub Wiki

Models / Paper Code

Name / Link	Description	License
DepthAnything	Monocular depth prediction	Apache 2.0
Patchwork++	Segment ground in Lidar measurement	GPL-3.0

Tools

Name / Link	Description	License
latexify	Compile python code into latex formulae	MIT
dacite	Instantiates a dataclass object using a dictionary (apparently, recursively)	MIT
easydict, addict	Attribute dict implementations in python	LGPL-v3.0, MIT respectively
Open3D	Library for visualization and manipulation of 3D data.	MIT
Compiler Explorer	Compile and run C++ code online
KDbg	graphical debugging interface	GPL-2.0

Datasets

Name \ Link	Description
CamVid	Motion-based Segmentation and Recognition Dataset
Make3D	Range image dataset
NYU Depth	Indoor Segmentation and Support Inference from RGBD Images
CityScapes	Semantic Understanding of Urban Street Scenes
KITTY	Outdoor driving datasets / vision benchmark suite. 3D Object Benchmark
Pascal3D+	Massive 3D Object detection and pose estimation
Dubrovnik6K	Location recognition in urban outdoors
DeepLoc	large-scale urban outdoor localization dataset
CambridgeLandmarks	Localization, collected for PoseNet, using smartphone. Includes images, camera poses and Sfm reconstructions.
Cars Dataset	Stanford Cars Dataset. 16,185 images of 196 classes of cars
ModelNet	collection of 3D CAD models for objects, some annotated with orientation
SUN Dataset	Object detection, scene recognition. Collection of annotated images covering a large variety of environmental scenes, places and the objects within.
BigBIRD	3D Database of Object Instances. 125 objects, images, RGB-D point clouds, pose information and segmentation, reconstructed meshes.
TUM RGB-D Dataset	Kinect data. Color and depth images of a Microsoft Kinect sensor along the ground-truth trajectory of the sensor. Indoors.
Matterport3D	Indoor environments, RGB-D, segmentation
Active Vision Dataset	"simulation of motion for object instance recognition in real-world environments" - RGB-D images, bounding boxes.
Objectron	Object-centric video clips with 3d detections and ground truth poses.
ScanNet	RGB-D (indoor) video dataset annotated with 3D camera poses, surface reconstructions, and instance-level semantic segmentations.
SceneNet	Photorealistic synthethic indoor trajectories with ShapeNet objects.
ShapeNet	Richly-annotated, large-scale dataset of 3D shapes.

Oxford RobotCar	RGB, Lidar, Radar
Driving Video Datasets
BDD100K and BDD	Annotated driving videos from Berkeley.

Small Datasets
Alderley Day/Night Dataset	Day/night street videos for the same route, with frame correspondences.

Simulators

Name \ Link	Description	Popularity	Accessed Date
AI2-THOR	Photorealistic Interactive Environments for AI Agents, indoors, photorealistic, interactive, physics. Documentation.	1	Nov 2020
VizDoom	Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information	1	2018
OpenAI Gym	A toolkit (and environments) for developing and comparing reinforcement learning algorithms	1	2018
House3D	"Based on" Princeton's SUNCG. Has depth annotation and semantic labels.	1	2018
G2D	Allows collecting images and depth along a specified track in GTA V environment. Requires buying GTA for ~40$ (e.g. from STEAM). Windows only. Similar project DeepGTAV	1	2018
DeepMind Lab	3D environments, prefer speed over realism	1	2018
AirSim	Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research, outdoor	2	2018
ESIM	Event camera simulator. Based on ROS. Very limited in input format as is, input either from UnrealCV or rendering of scene file (.obj).	1	2018
Carla	Autonomous driving simulator. Reasonably realistic	1	Feb 2019
EuroPilot	Python interface for autonomous driving simulator, based on Euro Truck 2 (technically, captures screent output, so can be used with any game)	1	Feb 2019

Mapping / SLAM

Name \ Link	Description	License	Note
OmniMapper	SLAM framework build on top of ROS + gtsam	MIT-like	From Henrik Christensen's group.
COLMAP	Structure-from-Motion (SfM) and Multi-View Stereo (MVS) pipeline (Schönberger, Pollefeys, Frahm)	BSD (Commercial)
Hydra	Scene graph construction / semantic mapping.	MIT	From Luca Carlone's group