Skip to main content

Research Engineer in Speech Technology

The Speech Technology Group of Toshiba Research Europe in Cambridge is looking for exceptional candidates to join our team of researchers, working in automatic speech recognition or statistical dialogue systems. We are looking for candidates with background in signal processing, machine learning, acoustic modelling or expertise in building state-of-the-art systems for ASR or Dialogue. 

PhD student in Wireless Communication

We have an open position for PhD student in area of PHY/MAC layer targeting low latency IoT applications. The research will be carried out in the Communication Systems and Network group at Mid Sweden University.

The applicants should meet following criteria:

- Hold M.Sc. degree in Electrical engineering, Signal Processing, Engineering Mathematics, Computer science, or similar domain

- Excellent written and communication skills

- Strong mathematical background

- Background in wireless communications, signal processing, and optimization is welcome

Competition for preliminary selection of postdoctoral applications

Institute of Electronics and Computer Science (EDI) announces the opening of the competition for preliminary selection of postdoctoral applications for submission to the State Education Development Agency (SEDA) under the Activity 1.1.1.2 “Post-doctoral Research Aid” of the Specific Aid Objective 1.1.1 “To increase the research and innovative capacity of scientific institutions of Latvia and the ability to attract external financing, investing in human resources and infrastructure” of the Operational Programme “Growth and Employment”.

NOMA for Hybrid mmWave Communication Systems With Beamwidth Control

In this paper, we propose a novel non-orthogonal multiple access (NOMA) scheme with beamwidth control for hybrid millimeter wave communication systems and study the resource allocation design to maximize the system energy efficiency. In particular, NOMA transmission allows more than one user to share a single radio frequency chain, which is beneficial to enhance the system energy efficiency. More importantly, the proposed beamwidth control can increase the number of served NOMA groups by widening the beamwidth that can further exploit the energy efficiency gain brought by NOMA.

Delay Guarantee and Effective Capacity of Downlink NOMA Fading Channels

Nonorthogonal multiple access (NOMA) is promising for increasing connectivity and capacity. But there has been little consideration on the quality of service of NOMA; let alone that in generic fading channels. This paper establishes closed-form upper bounds for the delay violation probability of downlink Nakagami- mand Rician NOMA channels, by exploiting stochastic network calculus (SNC).

Delay-Minimization Nonorthogonal Multiple Access Enabled Multi-User Mobile Edge Computation Offloading

The significant advances of cellular systems and mobile Internet services have yielded a variety of computation intensive applications, resulting in great challenge to mobile terminals (MTs) with limited computation resources. Mobile edge computing, which enables MTs to offload their computation tasks to edge servers located at cellular base stations (BSs), has provided a promising approach to address this challenging issue.

A General Framework for Temporal Fair User Scheduling in NOMA Systems

Non-orthogonal multiple access (NOMA) is one of the promising radio access techniques for next generation wireless networks. Opportunistic multi-user scheduling is necessary to fully exploit multiplexing gains in NOMA systems, but compared with traditional scheduling, inter-relations between users’ throughputs induced by multi-user interference poses new challenges in the design of NOMA schedulers. 

Polyphonic Sound Event Detection by Using Capsule Neural Networks

Given the recent surge in developments of deep learning, this paper provides a review of the state-of-the-art deep learning techniques for audio signal processing. Speech, music, and environmental sound processing are considered side-by-side, in order to point out similarities and differences between the domains, highlighting general methods, problems, key references, and potential for cross fertilization between areas.

Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets

We address voice activity detection in acoustic environments of transients and stationary noises, which often occur in real-life scenarios. We exploit unique spatial patterns of speech and non-speech audio frames by independently learning their underlying geometric structure. This process is done through a deep encoder-decoder-based neural network architecture.

Deep Learning for Audio Signal Processing

Given the recent surge in developments of deep learning, this paper provides a review of the state-of-the-art deep learning techniques for audio signal processing. Speech, music, and environmental sound processing are considered side-by-side, in order to point out similarities and differences between the domains, highlighting general methods, problems, key references, and potential for cross fertilization between areas.