Useful Links (Tools, Models, Projects, Datasets, Simulators) - feliyur/exercises GitHub Wiki
Models / Paper Code
Name / Link | Description | License |
---|---|---|
DepthAnything | Monocular depth prediction | Apache 2.0 |
Patchwork++ | Segment ground in Lidar measurement | GPL-3.0 |
Tools
Name / Link | Description | License |
---|---|---|
latexify | Compile python code into latex formulae | MIT |
dacite | Instantiates a dataclass object using a dictionary (apparently, recursively) | MIT |
easydict, addict | Attribute dict implementations in python | LGPL-v3.0, MIT respectively |
Open3D | Library for visualization and manipulation of 3D data. | MIT |
Compiler Explorer | Compile and run C++ code online | |
KDbg | graphical debugging interface | GPL-2.0 |
Datasets
Name \ Link | Description |
---|---|
CamVid | Motion-based Segmentation and Recognition Dataset |
Make3D | Range image dataset |
NYU Depth | Indoor Segmentation and Support Inference from RGBD Images |
CityScapes | Semantic Understanding of Urban Street Scenes |
KITTY | Outdoor driving datasets / vision benchmark suite. 3D Object Benchmark |
Pascal3D+ | Massive 3D Object detection and pose estimation |
Dubrovnik6K | Location recognition in urban outdoors |
DeepLoc | large-scale urban outdoor localization dataset |
CambridgeLandmarks | Localization, collected for PoseNet, using smartphone. Includes images, camera poses and Sfm reconstructions. |
Cars Dataset | Stanford Cars Dataset. 16,185 images of 196 classes of cars |
ModelNet | collection of 3D CAD models for objects, some annotated with orientation |
SUN Dataset | Object detection, scene recognition. Collection of annotated images covering a large variety of environmental scenes, places and the objects within. |
BigBIRD | 3D Database of Object Instances. 125 objects, images, RGB-D point clouds, pose information and segmentation, reconstructed meshes. |
TUM RGB-D Dataset | Kinect data. Color and depth images of a Microsoft Kinect sensor along the ground-truth trajectory of the sensor. Indoors. |
Matterport3D | Indoor environments, RGB-D, segmentation |
Active Vision Dataset | "simulation of motion for object instance recognition in real-world environments" - RGB-D images, bounding boxes. |
Objectron | Object-centric video clips with 3d detections and ground truth poses. |
ScanNet | RGB-D (indoor) video dataset annotated with 3D camera poses, surface reconstructions, and instance-level semantic segmentations. |
SceneNet | Photorealistic synthethic indoor trajectories with ShapeNet objects. |
ShapeNet | Richly-annotated, large-scale dataset of 3D shapes. |
Oxford RobotCar | RGB, Lidar, Radar |
Driving Video Datasets | |
BDD100K and BDD | Annotated driving videos from Berkeley. |
Small Datasets | |
Alderley Day/Night Dataset | Day/night street videos for the same route, with frame correspondences. |
Simulators
Name \ Link | Description | Popularity | Accessed Date |
---|---|---|---|
AI2-THOR | Photorealistic Interactive Environments for AI Agents, indoors, photorealistic, interactive, physics. Documentation. | 1 | Nov 2020 |
VizDoom | Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information | 1 | 2018 |
OpenAI Gym | A toolkit (and environments) for developing and comparing reinforcement learning algorithms | 1 | 2018 |
House3D | "Based on" Princeton's SUNCG. Has depth annotation and semantic labels. | 1 | 2018 |
G2D | Allows collecting images and depth along a specified track in GTA V environment. Requires buying GTA for ~40$ (e.g. from STEAM). Windows only. Similar project DeepGTAV | 1 | 2018 |
DeepMind Lab | 3D environments, prefer speed over realism | 1 | 2018 |
AirSim | Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research, outdoor | 2 | 2018 |
ESIM | Event camera simulator. Based on ROS. Very limited in input format as is, input either from UnrealCV or rendering of scene file (.obj). | 1 | 2018 |
Carla | Autonomous driving simulator. Reasonably realistic | 1 | Feb 2019 |
EuroPilot | Python interface for autonomous driving simulator, based on Euro Truck 2 (technically, captures screent output, so can be used with any game) | 1 | Feb 2019 |
Mapping / SLAM
Name \ Link | Description | License | Note |
---|---|---|---|
OmniMapper | SLAM framework build on top of ROS + gtsam | MIT-like | From Henrik Christensen's group. |
COLMAP | Structure-from-Motion (SfM) and Multi-View Stereo (MVS) pipeline (Schönberger, Pollefeys, Frahm) | BSD (Commercial) | |
Hydra | Scene graph construction / semantic mapping. | MIT | From Luca Carlone's group |