Skip to the content.

The IPN Hand Dataset

“A new benchmark video dataset with sufficient size, variation, and real-world elements able to train and evaluate deep neural networks for continuous Hand Gesture Recognition (HGR)”

The IPN Hand dataset contains more than 4,000 gesture instances and 800,000 frames from 50 subjects. We design 13 static and dynamic gestures for interaction with touchless screens. Compared to other publicly available hand gesture datasets, IPN Hand includes the largest number of continuous gestures per video, and the largest speed of intra-class variation.

The data collection was designed considering real-world issues of continuous HGR, including continuous gestures performed without transitional states, natural movements as non-gesture segments, scenes including clutter backgrounds, extreme illumination conditions, as well as static and dynamic environments. Example of continuous gestures without transitional states:

sample

An introduction video of the dataset can be found here. More details in our ICPR2020 paper.

Details

The subjects from the dataset were asked to record gestures using their own PC keeping the defined resolution and frame rate. Thus, only RGB videos were captured, and the distance between the camera and each subject varies. All videos were recorded in the resolution of 640x480 at 30 fps.

Each subject continuously performed 21 gestures with three random breaks in a single video. We defined 13 gestures to control the pointer and actions focused on the interaction with touchless screens.

Description and statics of each gesture are shown in the next table. Duration is measured in the number of frames (30 frames = 1 s).

id Label Gesture Instances Mean duration (std)  
1 D0X Non-gesture 1431 147 (133)  
2 B0A Pointing with one finger 1010 219 (67)  
3 B0B Pointing with two fingers 1007 224 (69)  
4 G01 Click with one finger 200 56 (29)  
5 G02 Click with two fingers 200 60 (43)  
6 G03 Throw up 200 62 (25)  
7 G04 Throw down 201 65 (28)  
8 G05 Throw left 200 66 (27)  
9 G06 Throw right 200 64 (28)  
10 G07 Open twice 200 76 (31)  
11 G08 Double click with one finger 200 68 (28)  
12 G09 Double click with two fingers 200 70 (30)  
13 G10 Zoom in 200 65 (29)  
14 G11 Zoom out 200 64 (28)  
      All non-gestures: 1431 147 (133)
      All gestures: 4218 140 (94)
      Total: 5649 142 (105)

Video examples of all classes (.GIF) here

Baseline results

Baseline results for isolated and continuous hand gesture recognition of the IPN Hand dataset can be found here.

Downloadable files

Apart from the RGB frames, real-time optical flow and hand segmentation results are also available. The methods used to calculate them are described in our ICPR2020 paper. Examples of the data included in IPN Hand:

rgb of seg

Description of each downloadable file:

Citation

If you find useful the IPN Hand dataset for your research, please cite the paper:

@inproceedings{bega2020IPNhand,
  title={IPN Hand: A Video Dataset and Benchmark for Real-Time Continuous Hand Gesture Recognition},
  author={Benitez-Garcia, Gibran and Olivares-Mercado, Jesus and Sanchez-Perez, Gabriel and Yanai, Keiji},
  booktitle={25th International Conference on Pattern Recognition, {ICPR 2020}, Milan, Italy, Jan 10--15, 2021},
  pages={4340--4347},
  year={2021},
  organization={IEEE}
}

License

The data and annotations in the IPN Hand dataset are licensed under a Creative Commons Attribution 4.0 License.