Iterative Contrast-Classify for Semi-Supervised Temporal Action Segmentation

Dipika Singhania; Rahul Rahaman; Angela Yao

Iterative Contrast-Classify for Semi-Supervised Temporal Action Segmentation

Dipika Singhania, Rahul Rahaman, Angela Yao

[AAAI-22] Main Track

Keywords
Poster Session 1 @ Red 1, Poster Session 11 @ Red 1, Poster Session 1, Poster Session 11

Download Paper

Enter the Virtual Venue

Abstract: Temporal action segmentation classifies the action of each frame in (long) video sequences. Due to the high cost of frame-wise labeling, we propose the first semi-supervised method for temporal action segmentation. Our method hinges on unsupervised representation learning, which, for temporal action segmentation, poses unique challenges. Actions in untrimmed videos vary in length and have unknown labels and start/end times. Ordering of actions across videos may also vary. We propose a novel way to learn frame-wise representations from temporal convolutional networks (TCNs) by clustering input features with added time-proximity condition and multi-resolution similarity. By merging representation learning with conventional supervised learning, we develop an ``Iterative-Contrast-Classify (ICC)'' semi-supervised learning scheme. With more labelled data, ICC progressively improves in performance; ICC semi-supervised learning, with 40% labelled videos, performs similar to fully-supervised counterparts. Our ICC improves MoF by {+1.8, +5.6, +2.5}% on Breakfast, 50Salads and GTEA respectively for 100% labelled videos.

Introduction Video

Sessions where this paper appears

Timezone

Poster Session 1

Thu, February 24 4:45 PM - 6:30 PM (+00:00)

Red 1

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 1
Poster Session 11

Mon, February 28 12:45 AM - 2:30 AM (+00:00)

Red 1

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 11