UAV images dataset for moving object detection

Overview

PESMOD

PESMOD (PExels Small Moving Object Detection) dataset consists of high resolution aerial images in which moving objects are labelled manually. The aim of this work is to provide a different and challenging dataset for moving object detection methods evaluation. Each moving object is labelled for each frame with PASCAL VOC format in a XML file. Dataset consists of 8 sequence detailed below.

Sequence name Number of frames Number of moving objects
Elliot-road 664 3416
Miksanskiy 729 189
Shuraev-trekking 400 800
Welton 470 1129
Marian 622 2791
Grisha-snow 115 1150
Zaborski 582 3290
Wolfgang 525 1069
Total 4107 13834

Download

Click here to download the dataset

Citing PESMOD Dataset

If you find this dataset or method (proposed in the paper) useful in your work, please cite the paper:

Preprint paper on arxiv

Contributions

If you find any mistakes in the labels, you can report it in the issues section.

Script to view dataset, build and run performance code to evaluate your own method with foreground mask

To view dataset after downloading:

python view-dataset.py --path "/home/ibrahim/PESMOD/Pexels-Welton/"

Build performance code with following commands:

cd performance
mkdir build
cmake ..
make .

Run with (-d for dataset main folder, -m for masks main folder, -f for sequence name, -o if you apply morphological opening):

./performance -d "/home/ibrahim/PESMOD/" -m "/home/ibrahim/SCBU-PESMOD-results/" -f "Pexels-Marian"

Dataset sample frames

Example frames from each sequence in the dataset

Issues
  • Can I get the tracked libtorch model file?

    Can I get the tracked libtorch model file?

    This is a very rare dataset! May I ask you a question - as indicated in [https://github.com/mribrahim/PESMOD/blob/5972406505fd0dc42af1df8b68f9a5e9fdceaac6/src/main.cpp#L53](line 53) of src/main.cpp, where can I download the *.pt model file tracked by libtorch? Could you have any plan of publishing the weights for the pre-trained model?

    opened by Backlory 1
Owner
İbrahim Delibaşoğlu
İbrahim Delibaşoğlu
Dataset Synthesizer - NVIDIA Deep learning Dataset Synthesizer (NDDS)

NVIDIA Deep learning Dataset Synthesizer (NDDS) Overview NDDS is a UE4 plugin from NVIDIA to empower computer vision researchers to export high-qualit

NVIDIA Corporation 502 Jun 15, 2022
Official PyTorch Code of GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection (CVPR 2021)

GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Mo

Abhinav Kumar 73 Jun 22, 2022
Object Based Generic Perception Object Model

This model is a highly parameterizable generic perception sensor and tracking model. It can be parameterized as a Lidar or a Radar. The model is based on object lists and all modeling is performed on object level.

TU Darmstadt - FZD 5 Jun 11, 2022
A project demonstration on how to use the GigE camera to do the DeepStream Yolo3 object detection

A project demonstration on how to use the GigE camera to do the DeepStream Yolo3 object detection, how to set up the GigE camera, and deployment for the DeepStream apps.

NVIDIA AI IOT 7 May 27, 2022
Real-time object detection with YOLOv5 and TensorRT

YOLOv5-TensorRT The goal of this library is to provide an accessible and robust method for performing efficient, real-time inference with YOLOv5 using

Noah van der Meer 19 Jun 26, 2022
Port of the 2020 support library to Raspberry Pi for the VL53L3CX Time-of-Flight ranging sensor with advanced multi-object detection

Port of ST VL53L3CX (2020) driver library to Raspberry Pi This is a port of the support library to Raspberry Pi for the VL53L3CX Time-of-Flight rangin

Niall Douglas 3 May 18, 2022
YOLOX + ROS2 object detection package

YOLOX-ROS YOLOX+ROS2 Foxy Supported List Base ROS1 C++ ROS1 Python ROS2 C++ ROS2 Python CPU ✅ CUDA ✅ CUDA (FP16) ✅ TensorRT (CUDA) ✅ OpenVINO ✅ MegEng

Ar-Ray 127 Jun 24, 2022
Prepares the Audi Autonomous Driving Dataset (A2D2) for ROS

A2D2 ROS Preparer Purpose A2D2 ROS Preparer converts the Audi Autonomous Driving Dataset (A2D2) to a rosbag enabling the usage of ROS tools on this da

Chair of Geoinformatics, Technical University of Munich 6 Jun 21, 2022
The repository contains our dataset and C++ implementation of the CVPR 2022 paper, Geometric Structure Preserving Warp for Natural Image Stitching.

Geometric Structure Preserving Warp for Natural Image Stitching This repository contains our dataset and C++ implementation of the CVPR 2022 paper, Ge

null 6 Jun 12, 2022
Marking up images for use with Darknet.

What is DarkMark? DarkMark is a C++ GUI tool used to annotate images for use in neural networks. It was written specifically to be used with the Darkn

Stéphane Charette 73 Jun 6, 2022
A lightweight version of OrcVIO that uses monocular images, inertial data, as well as bounding box measurements

OrcVIO-Lite About Object residual constrained Visual-Inertial Odometry (OrcVIO) is a visual-inertial odometry pipeline, which is tightly coupled with

Sean 21 May 2, 2022
Android example to get the rgb and disparity images from the OAK-D device connected to a phone.

depthai-android-jni-example (WIP) Android example to get the rgb and disparity images from the OAK-D device connected to a phone. DepthaiAndroidFast.m

Ibai Gorordo 27 Jun 3, 2022
High dynamic range (HDR) image comparison tool for graphics people. With an emphasis on OpenEXR images.

tev — The EXR Viewer A high dynamic range (HDR) image comparison tool for graphics people. tev allows viewing images through various tonemapping opera

Thomas Müller 647 Jun 21, 2022
Dense Depth Estimation from Multiple 360-degree Images Using Virtual Depth

Dense Depth Estimation from Multiple 360-degree Images Using Virtual Depth [Project] [Paper] [arXiv] This is the official code of our APIN 2022 paper

null 6 Jun 11, 2022
YOLOV4 tiny + lane detection on Android with 8 FPS!

YOLOV4 Tiny + Ultra fast lane detection on Android with 8 FPS! Tested with HONOR 20PRO Kirin 980

yq-pan 12 May 11, 2022
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

Build Type Linux MacOS Windows Build Status OpenPose has represented the first real-time multi-person system to jointly detect human body, hand, facia

null 24.4k Jun 21, 2022
A coverage-guided and memory-detection enabled fuzzer for windows applications.

WDFuzzer Manual 中文手册见 README_CN.md WDFuzzer:winafl + drmemory WDFuzzer is an A coverage-guided and memory detection abled fuzzer for for windows softw

Jingyi Shi 24 Apr 6, 2022
A modern object detector inside fragment shaders

YOLOv4 Tiny in UnityCG/HLSL Video Demo: https://twitter.com/SCRNinVR/status/1380238589238206465?s=20 Overview YOLOv4 Tiny is one of the fastest object

null 42 Jun 24, 2022