Skip to main content

Variational Single Image Dehazing for Enhanced Visualization

In this paper, we investigate the challenging task of removing haze from a single natural image. The analysis on the haze formation model shows that the atmospheric veil has much less relevance to chrominance than luminance, which motivates us to neglect the haze in the chrominance channel and concentrate on the luminance channel in the dehazing process. Besides, the experimental study illustrates that the YUV color space is most suitable for image dehazing.

Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks

Video summarization is an important technique to browse, manage and retrieve a large amount of videos efficiently. The main objective of video summarization is to minimize the information loss when selecting a subset of video frames from the original video, hence the summary video can faithfully represent the overall story of the original video. Recently developed unsupervised video summarization approaches are free of requiring tedious annotation on important frames to train a video summarization model and thus are practically attractive.

Convolutional Networks With Channel and STIPs Attention Model for Action Recognition in Videos

With the help of convolutional neural networks (CNNs), video-based human action recognition has made significant progress. CNN features that are spatial and channelwise can provide rich information for powerful image description. However, CNNs lack the ability to process the long-term temporal dependency of an entire video and further cannot well focus on the informative motion regions of actions.

Deep Multi-Scale Context Aware Feature Aggregation for Curved Scene Text Detection

Scene text plays a significant role in image and video understanding, which has made great progress in recent years. Most existing models on text detection in the wild have the assumption that all the texts are surrounded by a rotated rectangle or quadrangle. While there also exist lots of curved texts in the wild, which would not be bounded by a regular bounding box. 

Prediction and Sampling With Local Graph Transforms for Quasi-Lossless Light Field Compression

Graph-based transforms have been shown to be powerful tools in terms of image energy compaction. However, when the size of the support increases to best capture signal dependencies, the computation of the basis functions becomes rapidly untractable. This problem is in particular compelling for high dimensional imaging data such as light fields. The use of local transforms with limited supports is a way to cope with this computational difficulty.

A Two-Stage Approach to Few-Shot Learning for Image Recognition

This paper proposes a multi-layer neural network structure for few-shot image recognition of novel categories. The proposed multi-layer neural network architecture encodes transferable knowledge extracted from a large annotated dataset of base categories. This architecture is then applied to novel categories containing only a few samples.

Explicit Optimization of min max Steganographic Game

This article proposes an algorithm which allows Alice to simulate the game played between her and Eve. Under the condition that the set of detectors that Alice assumes Eve to have is sufficiently rich (e.g. CNNs), and that she has an algorithm enabling to avoid detection by a single classifier (e.g adversarial embedding, gibbs sampler, dynamic STCs), the proposed algorithm converges to an efficient steganographic algorithm.

Estimation of Moisture Content Distribution in Porous Foam Using Microwave Tomography With Neural Networks

The use of microwave tomography (MWT) in an industrial drying process is demonstrated in this feasibility study with synthetic measurement data. The studied imaging modality is applied to estimate the moisture content distribution in a polymer foam during the microwave drying process. Such moisture information is crucial in developing control strategies for controlling the microwave power for selective heating.

Contrast-Medium Anisotropy-Aware Tensor Total Variation Model for Robust Cerebral Perfusion CT Reconstruction With Low-Dose Scans

Perfusion computed tomography (PCT) is critical in detecting cerebral ischemic lesions. PCT examination with lowdose scans can effectively reduce radiation exposure to patients at the cost of degraded images with severe noise, and artifacts. Tensor total variation (TTV) models are powerful tools that can encode the regional continuous structures underlying a PCT object.