Published March 16, 2023 | Version 1
Dataset Open

EOAD (Egocentric Outdoor Activity Dataset)

  • 1. Dr.
  • 2. Assoc. Prof. Dr.
  • 3. Prof. Dr.

Description

EOAD is a collection of videos captured by wearable cameras, mostly of sports activities. It contains both visual and audio modalities.

It was initiated by the HUJI and FPVSum egocentric activity datasets. However, the number of samples and diversity of activities for HUJI and FPVSum were insufficient. Therefore, we combined these datasets and populated them with new YouTube videos.

The selection of videos was based on the following criteria:

  • The videos should not include text overlays.
  • The videos should contain natural sound (no external music)
  • The actions in videos should be continuous (no cutting the scene or jumping in time)

Video samples were trimmed depending on scene changes for long videos (such as driving, scuba diving, and cycling). As a result, a video may have several clips depicting egocentric actions. Hence, video clips were extracted from carefully defined time intervals within videos. The final dataset includes video clips with a single action and natural audio information.

Statistics for EOAD:

  • 30 activities
  • 303 distinct videos
  • 1392 video clips
  • 2243 minutes labeled videos clips

The detailed statistics for the selected datasets and the crawled videos clips from YouTube are given below:

  • HUJI: 49 distinct videos - 148 video clips for 9 activities (driving, biking, motorcycle, walking, boxing, horse riding, running, skiing, stair climbing)
  • FPVSum: 39 distinct videos - 124 video segments for 8 activities (biking, horse riding, skiing, longboarding, rock climbing, scuba, skateboarding, surfing)
  • YouTube: 216 distinct videos - 1120 video clips for 27 activities (american football, basketball, bungee jumping, driving, go-kart, horse riding, ice hockey, jet ski, kayaking, kitesurfing, longboarding, motorcycle, paintball, paragliding, rafting, rock climbing, rowing, running, sailing, scuba diving, skateboarding, soccer, stair climbing, surfing, tennis, volleyball, walking)

The video clips used for training, validation and test sets for each activity are listed in Table 1. Multiple video clips may belong to a single video because of trimming it for some reasons (i.e., scene cut, temporary overlayed text on videos, or video parts unrelated to activities).

While splitting the dataset, the minimum number of videos for each activity was selected as 8. Additionally, the video samples were divided as 50%, 25%, and 25% for training (minimum four videos), validation (minimum two videos), and testing (minimum two videos), respectively. On the other hand, videos were split according to the raw video footage to prevent the mixing of similar video clips (having the same actors and scenes) into training, validation, and test sets. Therefore, we ensured that the video clips trimmed from the same videos were split together into training, validation, or test sets to satisfy a fair comparison.

Some activities have continuity throughout the video, such as scuba, longboarding, or riding horse, which also have an equal number of video segments with the number of videos. However, some activities, such as skating, occurred in a short time, making the number of video segments higher than the others. As a result, the number of video clips for training, validation, and test sets was highly imbalanced for the selected activities (i.e., jet ski and rafting have 4; however, soccer has 99 video clips for training).

                                                                       Table 1 - Dataset splitting for EOAD

   

Train

 

Validation

 

Test

 
 

Action Label

#Clips

Total Duration

#Clips

Total Duration

#Clips

Total Duration

 

AmericanFootball

34

00:06:09

36

00:05:03

9

00:01:20

 

Basketball

43

01:13:22

19

00:08:13

10

00:28:46

 

Biking

9

01:58:01

6

00:32:22

11

00:36:16

 

Boxing

7

00:24:54

11

00:14:14

5

00:17:30

 

BungeeJumping

7

00:02:22

4

00:01:36

4

00:01:31

 

Driving

19

00:37:23

9

00:24:46

9

00:29:23

 

GoKart

5

00:40:00

3

00:11:46

3

00:19:46

 

Horseback

5

01:15:14

5

01:02:26

2

00:20:38

 

IceHockey

52

00:19:22

46

00:20:34

10

00:36:59

 

Jetski

4

00:23:35

5

00:18:42

6

00:02:43

 

Kayaking

28

00:43:11

22

00:14:23

4

00:11:05

 

Kitesurfing

30

00:21:51

17

00:05:38

6

00:01:32

 

Longboarding

5

00:15:40

4

00:18:03

4

00:09:11

 

Motorcycle

20

00:49:38

21

00:13:53

8

00:20:30

 

Paintball

7

00:33:52

4

00:12:08

4

00:08:52

 

Paragliding

11

00:28:42

4

00:10:16

4

00:19:50

 

Rafting

4

00:15:41

3

00:07:27

3

00:06:13

 

RockClimbing

6

00:49:38

2

00:21:59

2

00:18:50

 

Rowing

5

00:47:05

3

00:13:21

3

00:03:26

 

Running

21

01:21:56

19

00:46:29

11

00:42:59

 

Sailing

7

00:39:30

4

00:14:39

6

00:15:43

 

Scuba

5

00:35:02

3

00:23:43

2

00:18:52

 

Skate

91

00:15:53

30

00:07:01

10

00:02:03

 

Ski

14

01:48:15

17

01:01:59

7

00:39:15

 

Soccer

102

00:48:39

52

00:13:17

16

00:06:54

 

StairClimbing

6

01:05:32

6

00:17:18

5

00:20:22

 

Surfing

23

00:12:51

17

00:06:52

10

00:07:04

 

Tennis

34

00:27:04

9

00:06:03

9

00:03:14

 

Volleyball

87

00:19:14

35

00:07:46

7

00:18:58

 

Walking

49

00:43:02

36

00:38:25

10

00:10:23

Total

30

740

20:22:37

452

09:20:23

200

08:00:08

 

EOAD Code Repository

Scripts for downloading raw videos and trim them in to video clips are provided in this GitHub repository.

Regarding the questions, please contact mali.arabaci@gmail.com.

Files

eoad_layout.png

Files (2.2 MB)

Name Size Download all
md5:aebe3fa92a141d5f3e59778616e826e7
1.9 MB Preview Download
md5:feedc15265dced77c27a08853b754039
230.9 kB Preview Download