6 July 2021 13:00 - 14:00 PM

Abstract

Deniz Jafari will be presenting “VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text” by Akbari et al.

Biography

Deniz Jafari will be presenting “VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text” by Akbari et al. You can find the paper here: https://arxiv.org/pdf/2104.11178.pdf.

Join Zoom Meeting

https://utoronto.zoom.us/j/82817308412

Meeting ID: 828 1730 8412
One tap mobile
+16465189805,,82817308412# US (New York)
+16465588656,,82817308412# US (New York)

Dial by your location

Meeting ID: 828 1730 8412

Find your local number: https://utoronto.zoom.us/u/kYqCIBAgM

Milos Popovic

Jarrett Churchill

Michael Grace-Dacosta

Computer Vision Journal Club - VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text

Abstract

Biography

Knowledge . Innovation . Talent . Everywhere

Knowledge . Innovation . Talent . Everywhere

Next Event at KITE

Find Us

Media contacts

Who we are

Affiliated with