Deep Learning for Visual Computing

Deep Learning for Visual Computing

A Geometry-Sensitive Approach for Photographic Style Classification

A Geometry-Sensitive Approach for Photographic Style Classification

Abstract Photographs are characterized by different compositional attributes like the Rule of Thirds, depth of field, vanishing-lines etc. The presence or absence of one or more of these attributes contributes ...
Read More
Aesthetic Image Captioning from Weakly-Labelled Photographs

Aesthetic Image Captioning from Weakly-Labelled Photographs

Abstract Aesthetic image captioning (AIC) refers to the multi-modal task of generating critical textual feedbacks for photographs. While in natural image captioning (NIC), deep models are trained in an end-to-end ...
Read More
AlphaGAN: Generative adversarial networks for natural image matting

AlphaGAN: Generative adversarial networks for natural image matting

Abstract We present the first generative adversarial network (GAN) for natural image matting. Our novel generator network is trained to predict visually appealing alphas with the addition of the adversarial ...
Read More
ColorNet: Estimating colorfulness in natural images

ColorNet: Estimating colorfulness in natural images

This page contains related materials for learning-based colorfulness estimation. ABSTRACT Measuring the colorfulness of a natural or virtual scene is critical for many applications in image processing field ranging from ...
Read More
Deep Convolutional Neural Networks for estimating lens distortion parameters

Deep Convolutional Neural Networks for estimating lens distortion parameters

Abstract In this paper we present a convolutional neural network (CNN) to predict multiple lens distortion parameters from a single input image. Unlike other methods, our network is suitable to ...
Read More
Deep Normal Estimation for Automatic Shading of Hand-Drawn Characters

Deep Normal Estimation for Automatic Shading of Hand-Drawn Characters

In this paper we present a new fully automatic pipeline for generating shading effects on hand-drawn characters. Our method takes as input a single digitized sketch of any resolution and ...
Read More
Deep Tone Mapping Operator for High Dynamic Range Images

Deep Tone Mapping Operator for High Dynamic Range Images

Abstract A computationally fast tone mapping operator (TMO) that can quickly adapt to a wide spectrum of high dynamic range (HDR) content is quintessential for visualization on varied low dynamic ...
Read More
DeepStereoBrush: Interactive Depth Map Creation

DeepStereoBrush: Interactive Depth Map Creation

In this paper, we introduce a novel interactive depth map creation approach for image sequences which uses depth scribbles as input at user-defined keyframes. These scribbled depth values are then ...
Read More
DublinCity Dataset

DublinCity Dataset

Please click the image or here DublinCity Dataset to access the dataset! ...
Read More
Egocentric Gesture Recognition for Head-Mounted AR devices

Egocentric Gesture Recognition for Head-Mounted AR devices

Natural interaction with virtual objects in AR/VR environments makes for a smooth user experience. Gestures are a natural extension from real world to augmented space to achieve these interactions. Finding ...
Read More
SalNet360: Saliency Maps for omni-directional images with CNN

SalNet360: Saliency Maps for omni-directional images with CNN

Abstract The prediction of Visual Attention data from any kind of media is of valuable use to content creators and used to efficiently drive encoding algorithms. With the current trend ...
Read More
Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition

Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition

Ego hand gestures can be used as an interface in AR and VR environments. While the context of an image is important for tasks like scene understanding, object recognition, image ...
Read More
STaDA: Style Transfer as Data Augmentation

STaDA: Style Transfer as Data Augmentation

Abstract The success of training deep Convolutional Neural Networks (CNNs) heavily depends on a significant amount of labelled data. Recent research has found that neural style transfer algorithms can apply ...
Read More
Sub-pixel Back-projection Network for Fast Single Image Super-resolution

Sub-pixel Back-projection Network for Fast Single Image Super-resolution

Abstract will be available soon Results will be available soon Citation will be available soon Downloads will be available soon Authors Acknowledgement This publication has emanated from research conducted with ...
Read More
Super-resolution of Omnidirectional Images Using Adversarial Learning

Super-resolution of Omnidirectional Images Using Adversarial Learning

Abstract An omnidirectional image (ODI) enables viewers to look in every direction from a fixed point through a headmounted display providing an immersive experience compared to that of a standard ...
Read More
Towards generating ambisonics using audio-visual cue for virtual reality

Towards generating ambisonics using audio-visual cue for virtual reality

Abstract Ambisonics i.e., a full-sphere surround sound, is quintessential with 360-degree visual content to provide a realistic virtual reality (VR) experience. While 360-degree visual content capture gained a tremendous boost ...
Read More
Using LSTM for Automatic Classification of Human Motion Capture Data

Using LSTM for Automatic Classification of Human Motion Capture Data

Creative studios tend to produce an overwhelming amount of content everyday and being able to manage these data and reuse it in new productions represent a way for reducing costs ...
Read More