Skip to main content

SPS Blog Article

Deep CNN-Based Channel Estimation Using 3D Channel Correlation

Millimeter wave (mmWave) communications provide a promising solution to meet the proliferating demand for high data rate because of large bandwidth. The current “boomingly” deployed fifth generation communication system (5G) has not actually touched the dominant frequency band of mmWave and thus can hardly enjoy its merit on dramatically boosting transmission rate, which motivates us to conduct research on the ultimate implementation of mmWave communications.

Read more

Coarse-to-Fine CNN for Image Super-Resolution

A coarse-to-fine SR CNN (CFSRCNN) consisting of a stack of feature extraction blocks (FEBs), an enhancement block (EB), a construction block (CB) and, a feature refinement block (FRB) is proposed to learn a robust SR model.

Read more

Deep-learning-based audio-visual speech enhancement

We all experienced the discomfort of communicating with our friends at a cocktail party or in a pub with loud background music. When difficult acoustic scenarios like these occur, we tend to rely on several visual cues, such as lips and mouth movement of the speaker, in order to understand the speech of interest.

Read more

PANNs: Large-scale Pretrained Audio Neural Networks for Audio Pattern Recognition

Audio pattern recognition is an important research topic in the machine learning area, and includes several tasks such as audio tagging, acoustic scene classification, music classification, speech emotion classification and sound event detection. In this blog, we introduce pretrained audio neural networks (PANNs) trained on the large-scale AudioSet dataset. These PANNs are transferred to other audio related tasks. We investigate the performance and computational complexity of PANNs modeled by a variety of convolutional neural networks. We propose an architecture called Wavegram-Logmel-CNN using both log-mel spectrogram and waveform as input feature.

Read more

Frontal-Centers Guided Face: Boosting Face Recognition by Learning Pose-Invariant Features

Recent years, face recognition has made a remarkable breakthrough due to the emergence of deep learning. However, compared with frontal face recognition, many deep face recognition models still suffer serious performance degradation when handling profile faces. To address this issue, we propose a novel Frontal-Centers Guided Loss (FCGFace) to obtain highly discriminative features for face recognition. Most existing discriminative feature learning approaches project features from the same class into a separated latent subspace.

Read more