Leddar PixSet Dataset

The First Full-Waveform Flash LiDAR Dataset for Autonomous Vehicle R&D

Leddar PixSet is a new, publicly available dataset for autonomous driving research and development that contains data from a full AV sensor suite (cameras, LiDARs, radar, IMU), and includes full-waveform data from the Leddar Pixell, a 3D solid-state flash LiDAR sensor. PixSet provides an opportunity for 3D computer vision to go beyond usual LiDAR point clouds with a full-waveform LiDAR dataset.

License agreement Access the dataset Request a commercial license

Key Features

Includes full-waveform data from 3D solid-state flash LiDAR
Free access provided for academic and research purposes
29k frames in 97 sequences, with more than 1.3 M 3D boxes annotated
Data from an autonomous vehicle’s comprehensive sensor suite
Various environments, weather conditions and times of day
Open-source API and dataset viewer

PixSet Dataset Overview

To reach the highest levels of autonomy, one of the main challenges faced in AV development is to leverage the data from multiple types of sensors, each of which has its own strengths and weaknesses. Sensor fusion techniques are widely used to improve the performance and robustness of computer vision algorithms. Datasets such as PixSet allow research and engineering teams to use existing sets of sensor data to test and develop AV software and to run simulations, all without the need to assemble their own sensor suites and collect their own dataset.

The PixSet dataset contains 97 sequences for a total of roughly 29k frames using the AV sensor suite. Each frame has been manually annotated with 3D bounding boxes. The sequences have been gathered in various environments and climatic conditions with an instrumented vehicle. (See picture)

Recorded in high-density Canadian urban areas, the scenes take place in urban and suburban environments as well as on the highway, in various weather (e.g., sunny, cloudy, rainy) and illumination (e.g., day, night, twilight) conditions, providing a wide variety of situations with real-world data for autonomous driving.

Leddar Car sensor fusion data capture with LiDAR Camera and Radar

Sensor Suite

What makes this new dataset unique is the use of a flash LiDAR with a field of view of 180° horizontally and 16° vertically and the inclusion of its full-waveform raw data, in addition to the usual LiDAR point cloud data.

The sensors used to collect the dataset are listed below. Mounted on a car, the cameras, LiDARs and radar are positioned in close proximity to each other at the front of the car in order to minimize the parallax effect. The GPS antennas for the inertial measurement unit (IMU) are located on the top of the vehicle.

Leddar Pixell solid-state flash LiDAR (1 x)
Mechanical scanning LiDAR, 64 lines (1 x)
Cameras with 90° optics (3 x)
Camera with ImmerVision panomorph 180° optics (1 x)
77 to 79 GHz FMCW radar (1 x)
IMU fused with dual GPS RTK antenna

PointPillars was implemented on PixSet and the results are available here with common metrics.

Dataset White Paper

To learn more about the Leddar PixSet dataset, download the white paper.

Download the White Paper

When citing or referencing this document, please include the following information:
@misc{déziel2021pixset,
title={PixSet : An Opportunity for 3D Computer Vision to Go Beyond Point Clouds With a Full-Waveform LiDAR Dataset},
author={Jean-Luc Déziel and Pierre Merriaux and Francis Tremblay and Dave Lessard and Dominique Plourde and Julien Stanguennec and Pierre Goulet and Pierre Olivier},
year={2021},
eprint={2102.12010},
archivePrefix={arXiv},
primaryClass={cs.RO}

PixSet Sample Data

Images below provide an overview of the dataset’s variety of scenes and environmental conditions, represented with samples from the cameras (left) and the 64-line LiDAR (right) with 3D boxes.

Below is a sample image taken from the dataset which displays the camera views, the solid-state LiDAR data and the object detection boxes with annotations.

About the Dataset

The dataset contains 97 sequences and cumulates more than 29k frames recorded in different lighting and weather conditions.

The dataset contains 97 sequences, and cumulates more than 29k frames recorded in different lighting and weather conditions.

Particular attention was paid to the synchronization and triggering of the different sensors. This allows the sampling time for the various sensors and portions of the scene to be consistent, minimizing inconsistencies with dynamic objects.

The coordinates for the annotated 3D boxes are provided in the Pixell referential but can be easily re-projected into any other sensor’s referential with the supplied calibration matrices and API.

Object Classes and Labels

22 classes of objects are defined in the dataset (click on the arrow to view detailed description).

Each annotated object has a unique ID which is maintained across frames, allowing development and benchmarking of tracking algorithms. Furthermore, for each object additional attributes are provided, as listed below.

All of the object boxes have a constant size for the duration of the sequence, except for pedestrians, who form a special case since the shape of the pedestrian can vary from frame to frame. Pedestrian members’ position (arms and legs) affects the size of the bounding box. Hence, making the size of the bounding box variable solves this problem and can provide better accuracy for the training and inference.

Pedestrian

Cyclist

Bicycle

Bicycle rack

Truck

Trailer

Train

Motorcyclist

Motorcycle

Construction vehicle

Unclassified vehicle

Stop sign

Traffic light

Traffic sign

Traffic cone

Fire hydrant

Animal

Barrier

Obstacle

TEXT

Furthermore, for each object additional attributes are provided as follows (click on the arrow to view detailed description).

All of the object boxes have a constant size for the duration of the sequence, except for pedestrians, who form a special case since the shape of the pedestrian can vary from frame to frame. Pedestrian members’ position (arms and legs) affects the size of the bounding box. This problem is solved by making the size of the bounding box variable, which can provide better accuracy for the training and inference.

Tracking ID

Occlusion

Truncation

Human activity

Vehicle activity

On the road

Number of points

Details

Open-Source DAS API and Viewer

The open-source API provides easy access to datasets. Many usual methods of algorithm development are provided: sensor synchronization or interpolation, LiDAR ego-motion compensation, data projection in specific referential (any sensor or world), Leddar Pixell point cloud or quad cloud projection, waveforms alignment, annotation management (2-3D boxes and segmentation) and more.

You can install the API through pip install pioneer-das-api or clone the project and contribute.

Based on this API, we also provide an open-source dataset viewer: pip install pioneer-das-view.

Terms of Use

Please read the Public Dataset License Agreement. The datasets are provided for non-commercial purposes, which means that they can be used for research, teaching, scientific publication and personal experimentation. For commercial use of the datasets, which means for a purpose primarily intended for or directed towards commercial advantage or monetary compensation, please contact a LeddarTech representative.

Leddar PixSet