Experimentation with Video Queries on Driving Datasets

EXPERIMENTATION WITH VIDEO QUERIES ON DRIVING DATASETS (2021)

Developed a configurable implementation for pre-training neural networks using FasterRCNN and similar object detection architectures.
Utilized the comprehensive Berkeley Deep Drive dataset to target specific object classes including vehicles, traffic signs, and pedestrians. The system is designed with scalability in mind, allowing for easy expansion to additional object categories through minimal configuration changes. The demonstration below showcases a sample query execution.

  Sample every 3 seconds and get all timestamps with more than 2 cars, 1 sign and 1 pedestrian.

The Berkeley Deep Drive dataset’s comprehensive nature makes it particularly valuable for autonomous driving research. Future work will explore advanced applications including lane segmentation, steering prediction, and other critical autonomous vehicle technologies. The complete implementation is available in the code repository.