Creative Days 75% off
NEW Special offer Leveraging Facial And Speech Recognition To Create Deeper Interaction On The Web By cthdrl
Creative Days!
$100 $25
Leveraging Facial And Speech Recognition To Create Deeper Interaction On The Web
English English, Spanish, French (+3) [Machine translation]
5 Hours 10 Mins Course

Leveraging Facial And Speech Recognition To Create Deeper Interaction On The Web

Creative Days!     $100 $25

The connections you create with your users can be made more meaningful and impactful by allowing them to use their face and voice rather than just their mouse and keyboard when using your website. By allowing users to communicate using these methods that come more naturally, you’re not only creating a more authentic and memorable experience but you’re also pulling the user deeper into your narrative.

In this course, we’ll explore what is possible with facial and speech recognition technology on the web. We’ll talk philosophically about when and why you might leverage this type of interaction style, and then dive deep into the tools and techniques that are available. We’ll walk step by step through a fully functional speech, face, and emotion recognition example and break down exactly what’s happening every step of the way.

We’ll cover broad concepts about what tools make this possible, and what role each has in the overall system. Then we’ll examine different types of strategies for accomplishing the type of analysis we want to and the pros and cons of each. We’ll review tools like Tensorflow.js, Face++, Watson, Google Speech, and more.

At the end of this class you’ll not only have a thorough understanding of the landscape of web technology for this type of interaction, but you’ll also be able to create your own experiences from start to finish.

This course is for all levels. Beginners, intermediate students, and professionals will be able to follow the step-by-step structure of the course. Parts of the course will involve writing code, so an understanding of web technology fundamentals will help.

The subtitles are automatically generated, so the quality of the captions may vary.
Course Table of contents
What will you learn?

5 sections • 16 lectures • 5 hours 10 mins

  • Introduction
    2 lectures 13:09 Mins
    • Who am I
      05:06 Mins
    • What we’ll be learning
      08:03 Mins
  • The landscape of AI on the web
    3 lectures 26:44 Mins
    • Client-side and Server-side
      08:51 Mins
    • Basics of TensorFlow
      06:15 Mins
    • Overview of available services
      11:38 Mins
  • Setting up the experience
    6 lectures 2 Hour 07 Mins
    • Overview of the demo project
      11:43 Mins
    • Setting Up Structure & Services
      26:08 Mins
    • Building Phases 1 & 2
      15:35 Mins
    • Building & Styling the Gameplay
      25:20 Mins
    • Adding Visual Feedback for the User
      26:51 Mins
    • Creating the "Game Over" State
      21:36 Mins
  • Tapping into AI
    4 lectures 2 Hours 13 Mins
    • Connecting position data with TensorFlow
      24:48 Mins
    • Leveraging Google Speech for Transcription
      33:00 Mins
    • Matching Transcripts to Phrases
      22:56 Mins
    • Reading & Displaying Emotion with Face++
      52:28 Mins
  • Conclusion
    1 lectures 10:46 Mins
    • Key Concepts & Next Steps
      10:46 Mins
Work Work Work 
Work Work Work 

CTHDRL has been a partner to since day one. They are the only team I trust to take my studio's creative vision beyond a traditional digital output and deliver truly immersive experiences. Their knowledge base and drive to push the limits of what is possible in digital storytelling is what makes them the best at what they do. If you are interested in leveraging facial and speech recognition as a means to tell compelling digital stories you should definitely dig into this course.
Cam Diamond
Cam Diamond Founder / Creative Director at
Course details

Leveraging Facial And Speech Recognition To Create Deeper Interaction On The Web

Leveraging Facial And Speech Recognition To Create Deeper Interaction On The Web

By cthdrl

$25 75% off $100

Creative Days

  • English
  • English, Spanish, French, Japanese, Italian, Portuguese (Machine translation)
  • Beginners, Intermediate & Professional
  • Access on mobile and Desktop
  • Full time access
  • Certificate of completion
About the Speaker
Learn from the best
  • cthdrl


    CTHDRL Partner & Head of Technology


    John Robson is a Creative Technologist with a passion for creating compelling experiences. Through his career he has led the technology departments of several prestigious creative agencies, building digital platforms for clients like Reebok, Mailchimp, Etsy, Amazon, and many others. John has the unusual combination of a keen eye for nuanced design matched with the ability to translate business objectives into robust architecture and scalable technologies. He is a natural community leader in the technology space, and has a proven ability to spot talent and build teams to create compelling, modern, and beautiful digital creations.

Similar Courses