Initiatives & Trends

You are here

Inside Signal Processing Newsletter Home Page

10 years of news and resources for members of the IEEE Signal Processing Society

Initiatives & Trends

In the mid-1940s, a few brilliant people drew up the basic blueprints of the computer age. They conceived a general-purpose machine based on a processing unit made up of specialized subunits and registers, which operated on stored instructions and data. Later inventions—transistors, integrated circuits, solid-state memory—would supercharge this concept into the greatest tool ever created by humankind. So here we are, with machines that can churn through tens of quadrillions of operations per second. We have voice-recognition enabled assistants in our phones and homes.

The Amazon Lex service is now generally available, announced at the Amazon Web Services (AWS) 2017 San Francisco Summit. Amazon Lex is a fully managed AI service that enables developers to build conversational interfaces into any application using voice and text. It is powered by the same deep learning technologies used in Amazon Alexa.

For our June 2017 issue, we cover recent patents dealing with audio coding.

OpenAI recently announced their latest bot, OpenAI Five, which was able to beat amateur human teams in 5v5 Dota 2 games. This happended in less than a year since last August, when OpenAI bot defeated the world’s best Dota 2 player in a 1v1 game.

Patent no. 9,934,430 presents methods, systems, and computer-readable media related to a technique for providing handwriting input functionality on a user device. A handwriting recognition module is trained to have a repertoire comprising multiple non-overlapping scripts and capable of recognizing tens of thousands of characters using a single handwriting recognition model.

Researchers from The Chinese University of Hong Kong, Tencent, and Johns Hopkins University have developed a tool that automatically adds, removes, or modifies facial features of a person in an image. 

Patent no. 9,959,455 presents a system for facial recognition comprising at least one processor; at least one input operatively connected to the at least one processor; a database configured to store three-dimensional facial image data comprising facial feature coordinates in a predetermined common plane;

Can you segment moving objects from image frames captured by car cameras? The 2018 CVPR workshop on autonomous driving (WAD) is hosting a challenge to enable autonomously driven vehicles to label instances of moving objects such as vehicles and pedestrians.

In patent no 9,852,740 a high quality speech is reproduced with a small data amount in speech coding and decoding for performing compression coding and decoding of a speech signal to a digital signal.

Researchers from UC Berkeley have developed a deep learning model that can automatically transfer your font style. The model is named multi-content generative adversarial network (MC-GAN). 

Pages

SPS on Facebook

SPS on Twitter

  • Not sure of machine learning’s impact on your day-to-day life? This should convince you otherwise:… https://t.co/YC4em95Xon
  • For AI to evolve in B2B sales, data accuracy is critical. https://t.co/JtPkrxgH2a
  • Still not convinced of AI and machine learning’s impact on the world? Here are 27 incredible examples of them in pr… https://t.co/yTDSYBnuyo

SPS Videos


Signal Processing in Home Assistants

 


Multimedia Forensics


Careers in Signal Processing             

 


Under the Radar