English | 简体中文

PaddleDetection applied for specific scenarios¶

We provide some models implemented by PaddlePaddle to detect objects in specific scenarios, users can download the models and use them in these scenarios.

Task	Algorithm	Box AP	Download	Configs
Vehicle Detection	YOLOv3	54.5	model	config
Pedestrian Detection	YOLOv3	51.8	model	config

Vehicle Detection¶

One of major applications of vehichle detection is traffic monitoring. In this scenary, vehicles to be detected are mostly captured by the cameras mounted on top of traffic light columns.

1. Network¶

The network for detecting vehicles is YOLOv3, the backbone of which is Dacknet53.

2. Configuration for training¶

PaddleDetection provides users with a configuration file yolov3_darknet.yml to train YOLOv3 on the COCO dataset, compared with this file, we modify some parameters as followed to conduct the training for vehicle detection:

max_iters: 120000
num_classes: 6
anchors: [[8, 9], [10, 23], [19, 15], [23, 33], [40, 25], [54, 50], [101, 80], [139, 145], [253, 224]]
label_smooth: false
nms/nms_top_k: 400
nms/score_threshold: 0.005
milestones: [60000, 80000]
dataset_dir: dataset/vehicle

3. Accuracy¶

The accuracy of the model trained and evaluated on our private data is shown as followed:

AP at IoU=.50:.05:.95 is 0.545.

AP at IoU=.50 is 0.764.

4. Inference¶

Users can employ the model to conduct the inference:

export CUDA_VISIBLE_DEVICES=0
python -u tools/infer.py -c contrib/VehicleDetection/vehicle_yolov3_darknet.yml \
                         -o weights=https://paddlemodels.bj.bcebos.com/object_detection/vehicle_yolov3_darknet.tar \
                         --infer_dir contrib/VehicleDetection/demo \
                         --draw_threshold 0.2 \
                         --output_dir contrib/VehicleDetection/demo/output

Some inference results are visualized below:

../_images/VehicleDetection_001.jpeg

../_images/VehicleDetection_005.png

Pedestrian Detection¶

The main applications of pedetestrian detection include intelligent monitoring. In this scenary, photos of pedetestrians are taken by surveillance cameras in public areas, then pedestrian detection are conducted on these photos.

1. Network¶

The network for detecting vehicles is YOLOv3, the backbone of which is Dacknet53.

2. Configuration for training¶

PaddleDetection provides users with a configuration file yolov3_darknet.yml to train YOLOv3 on the COCO dataset, compared with this file, we modify some parameters as followed to conduct the training for pedestrian detection:

max_iters: 200000
num_classes: 1
snapshot_iter: 5000
milestones: [150000, 180000]
dataset_dir: dataset/pedestrian

3. Accuracy¶

The accuracy of the model trained and evaluted on our private data is shown as followed:

AP at IoU=.50:.05:.95 is 0.518.

AP at IoU=.50 is 0.792.

4. Inference¶

Users can employ the model to conduct the inference:

export CUDA_VISIBLE_DEVICES=0
python -u tools/infer.py -c contrib/PedestrianDetection/pedestrian_yolov3_darknet.yml \
                         -o weights=https://paddlemodels.bj.bcebos.com/object_detection/pedestrian_yolov3_darknet.tar \
                         --infer_dir contrib/PedestrianDetection/demo \
                         --draw_threshold 0.3 \
                         --output_dir contrib/PedestrianDetection/demo/output

Some inference results are visualized below:

../_images/PedestrianDetection_001.png

../_images/PedestrianDetection_004.png