Enabling the Next Generation of Smart Devices with Interactive AI - Roland Memisivic September 2020 - Edge AI and Vision Alliance

Page created by Katie Parsons

IT & Technique

English

Like
Share
Embed
Fullscreen
Slides
Download HTML
Download PDF
Abuse

←

→

Page content transcription

If your browser does not render page correctly, please read the page content below

Enabling the Next Generation of Smart Devices with Interactive AI - Roland Memisivic September 2020 - Edge AI and Vision Alliance

Enabling the Next Generation of
  Smart Devices with Interactive AI
  Roland Memisivic
  September 2020

© 2020 twentybn                       1

Our mission at TwentyBN: Moving from
smart speakers...
Smart speakers are confined to the simplest of tasks:                     Existing “service robots” are essentially chatbots with
              “how’s the weather?”                                                                bodies

                                                        © 2020 twentybn                                                         2

...to embodied assistants, that can see and interact with
you

                               © 2020 twentybn              3

Millie, an embodied
                   “Alexa” for public
                          spaces

© 2020 twentybn                         4

A fitness coach that
                       won’t let you cheat
                        (iPhone 8 and up)
FitnessAllyApp.com

     © 2020 twentybn                          5

End-to-end learning is taking over the world

2010   Audio                      Neural network   Text

2012   Pixels                     Neural network   Objects

2014   English                    Neural network   French

                                          ...
                              © 2020 twentybn                6

Humanoid assistants push end-to-end learning to its
 limits!
                                                                                            • Objects
                                                                                            • Actions
                                                                                            • Attributes
Audio-visual input                               Neural network                             • Speech input
                                                                                            • Speech output
                                                                                            • Body control
                                                                                            • ...

  • Understand the scene             • Understand spoken language      • Control the assistant’s face
  • Understand objects and actions   • Generate spoken language        • Control the assistant’s body
  • Understand human                 • Link visual concepts to words   • Etc
                                       (“grounding”)
  • Behaviour
                                     • Reason about past and present
                                       events

                                                   © 2020 twentybn                                            7

How do we get the data for this!?

• We built a worldwide “movie studio” that cranks out up to 45k training videos a day
• To catch every imaginable edge case, videos are recorded and annotated (crowd acting)
• Videos are automatically recorded, re-coded, reviewed, …
• The data collection is constantly evolving according to the AI systems current needs…

    Researchers identify which labels the                                    Crowdworkers record, verify and annotate
        network needs to learn next                                                          videos

• … resulting in >3.5 million videos across thousands of classes and tasks to date
• The network already learned objects, actions, temporal events, counting, captioning, “behaviours”, ...

                                                        © 2020 twentybn                                                 8

Bounding boxes?
                  We let the network
                        figure
                  out where to look!

© 2020 twentybn                        9

Skeleton models?
                  We let the network
                         figure
                  out how well you’re
                        doing!

© 2020 twentybn                         10

Temporal
                     segmentation?
                   We let the network
                        figure out
                  what you’re doing, and
                          when!

© 2020 twentybn                            11

End-to-end learning makes data sourcing hard, keeping
networks simple

                      Apple Devices                                                                             Android Devices

                                                                                                 835                     845                  865
        A11                     A12                      A13

iPhone 8 and 8 Plus    iPhone XS, XS Max,          iPhone 11 and                        Samsung Galaxy S8        Samsung Galaxy S9    Samsung Galaxy S20
      (2017)              and XR (2018)             11 Pro (2019)                       and Note 8 (2017)             (2018)               (2019)
  iPhone X (2017)
                      iPad Air, Mini and Pro      iPhone SE (2020)                     Google Pixel 2, Xiaomi     Sony Xperia XZ2,      Sony Xperia 1 II,
                              (2019)                                                       Mi 6 (2017)           Xiaomi Mi 8 (2018)   Xiaomi Mi 10 (2019)

                                            All inference can run with 8.5B MAC/sec -> .017 TOPS

                                                                     © 2020 twentybn                                                                        12

End-to-end learning makes data sourcing hard, keeping
networks simple
                                                                                                                                                 Option

                      Apple Devices                                                                             Android Devices

                                                                                                 835                     845                  865
        A11                     A12                      A13

iPhone 8 and 8 Plus    iPhone XS, XS Max,          iPhone 11 and                        Samsung Galaxy S8        Samsung Galaxy S9    Samsung Galaxy S20
      (2017)              and XR (2018)             11 Pro (2019)                       and Note 8 (2017)             (2018)               (2019)
  iPhone X (2017)
                      iPad Air, Mini and Pro      iPhone SE (2020)                     Google Pixel 2, Xiaomi     Sony Xperia XZ2,      Sony Xperia 1 II,
                              (2019)                                                       Mi 6 (2017)           Xiaomi Mi 8 (2018)   Xiaomi Mi 10 (2019)

                                            All inference can run with 8.5B MAC/sec -> .017 TOPS

                                                                     © 2020 twentybn                                                                        13

Conclusions

• Personal companions and camera-enabled assistants are moving from
  science fiction to reality
• They’re a gigantic commercial opportunity because they are personalized
  and sticky. The more you use them, the better they get
• They’re also a gigantic research opportunity because
  they stretch the limits of AI via audio-visual dialogue
  and grounding
• On-screen companions can capture a lot of that value since they allows us
  to bring to bear end-to-end learning for most part of the way
• For more info and access to our developer platform visit: 20bn.com

                                                            © 2020 twentybn   14

Resources

Developer platform
• 20bn.com
Fitness Ally app
• www.fitnessallyapp.com
Embodied AI newsletter
• www.embodiedai.co
The “something something” database for learning common sense features
• arxiv.org/abs/1706.04261
Datasets
• https://20bn.com/products/datasets

                                              © 2020 twentybn           15

Me, me!

                            Questions?

          © 2020 twentybn                16

You can also read

Deploying iPhone and iPad Apple Configurator

Sparkle With gifts that keep on giving - Christmas Gift Guide 2020

SENSOR NUMERICAL PREDICTION BASED ON LONG-TERM AND SHORT-TERM MEMORY NEURAL NETWORK - DIVA

Microsoft SQL Server 2012 Failover Cluster on Cisco UCS with iSCSI-Based Storage Access Deployment Guide - June 2012

Cisco DNA Application Assurance - Solutions Adoption Prescriptive Reference-Design Guide

Package 'predictionet' - August 30, 2020 - Bioconductor

LENS User Guide - MultiTech

CORPORATE OVERVIEW - LEOSAT

BASRECCS - The network for CCUS professionals in the Baltic Sea Region - TCCS-10, 18-19 June 2019

OSMM HIGHWAYS NETWORK - ROADS - TECHNICAL SPECIFICATION

A Survey of Neuromorphic Computing and Neural Networks in Hardware - arXiv

Ready Solutions for Data Analytics - Big Data as a Service (Ready Solutions for Big Data) Architecture Guide - Dell EMC

Attachment 6.4 Estimation of value of customer reliability for Western Power's network Access Arrangement Information 2 October 2017 - Estimation ...

Cheap Iphone Xs Contract Deals - TOPDIVE

New iPhone 5C has analysts talking about possibly poor sales - Phys.org

APPLE SOLUTIONS 2020 PRODUCT CATALOG - INVUE

Air Monitoring Network Plan - 2021 State of Hawaii

Review: Apple's iPhone 7 is best model yet, but some may wait for better

Analysis of DSO functions - Consultation July 2020 - Electricity North West

Package 'predictionet' - October 10, 2021 - Bioconductor

Early Breast Cancer Detection Utilizing Artificial Neural Network

Raising the Bar In Smartphone Technology The Apple iPhone - G. William James Handheld Computer Solutions

Apple offers a range of iPhones, from $450 to $1,100 - Phys.org

Artificial Neural Network Modeling for Predicting Wood Moisture Content in High Frequency Vacuum Drying Process - MDPI

The consumer mobile experience - Measuring the consumer experience of using Android mobile services

Iphone Xr On Contract - Niverville Heritage Centre

Apple Product Launch Reaction & Insights - HarrisX Overnight Poll fielded September 12-13, 2018

Energex Pricing Proposal - Distribution services for 1 July 2021 to 30 June 2022 31 March 2021

Energex Final Initial Pricing Proposal - Distribution services for 1 July 2020 to 30 June 2021 17 June 2020

The Unofficial Guide To The iPhone - By Stefan Neagu Tux Geek

APPLE SOLUTIONS 2021 PRODUCT CATALOG - INVUE

Best Iphone Xr Contracts Uk - Arnold

ICT Practices in New Zealand Distribution Utilities

The iPhone Franchise quality investor - Fund Hunter

NetSuite for iPhone - September 19, 2018 - Oracle Docs

Instruction Z-Wave PC based Controller v5 User Guide - Silicon Labs

Is $1,100 too much for an iPhone? Get an older one for less - Phys.org

Samsung Galaxy Tab S7 FE 2021 Product Brand Playbook - Confidential

User Manual Basic Configuration Dragon PTN Network Operation - Hirschmann ...

Grandstream Networks, Inc - IPVT10 Administration Guide

Samsung Android 10 on Galaxy Devices - September 18, 2020 - Administrator Guide - Samsung Knox ...

Annual Pricing Proposal - Australian Energy Regulator

SAMSUNG SOLUTIONS 2020 PRODUCT CATALOG - INVUE

DRAFT BEREC GUIDELINES ON VERY HIGH CAPACITY NETWORKS - 5 MARCH 2020 - BOR (20) 47

SAMSUNG SOLUTIONS 2021 PRODUCT CATALOG - INVUE

IPhone X - The Start of a Super Cycle or the End of an Era of Growth? - GBH Insights

APP USER MANUAL - LED Konzept

Barometer of mobile Internet connections in Indonesia - Year 2017 March 14th 2018 - nPerf

Iphone Se Price Without Contract

Introducing Basic Network Concepts