﻿<?xml version="1.0" encoding="utf-8"?><rss version="2.0"><channel><title>EURASIP Journal on Image and Video Processing</title><link>http://www.hindawi.com</link><description>The latest articles from Hindawi Publishing Corporation</description><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright><item><title>Robust Real-Time 3D Object Tracking with  Interfering Background Visual Projections</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/638073</link><description>This paper presents a robust real-time object tracking system for human computer interaction in mediated environments with interfering visual projection in the background. Two major
contributions are made in our research to achieve robust object tracking. A reliable outlier rejection algorithm is developed using the epipolar and homography constraints to remove false candidates caused by interfering background projections and mismatches between cameras. To reliably integrate
multiple estimates of the 3D object positions, an efficient fusion algorithm based on mean shift is used. This fusion algorithm can also reduce tracking errors caused by partial occlusion of the object in some of the camera views. Experimental results obtained in real life scenarios demonstrate that the proposed system is able to achieve decent 3D object tracking performance in the presence of interfering background
visual projection.</description><Author>Huan Jin and Gang Qian</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/246309</link><description>Simultaneous tracking of multiple persons in real-world environments is an active research field and several approaches have been proposed, based on a variety of features and algorithms. Recently, there has been a growing interest in organizing systematic evaluations to compare the various techniques. Unfortunately, the lack of common metrics for measuring the performance of multiple object trackers still makes it hard to compare their results. In this work, we introduce two intuitive and general metrics to allow for objective comparison of tracker characteristics, focusing on their precision in estimating object locations, their accuracy in recognizing object configurations and their ability to consistently label objects over time. These metrics have been extensively used in two large-scale international evaluations, the 2006 and 2007 CLEAR evaluations, to measure and compare the performance of multiple object trackers for a wide variety of tracking tasks. Selected performance results are presented and the advantages and drawbacks of the presented metrics are discussed based on the experience gained during the evaluations.</description><Author>Keni Bernardin and Rainer Stiefelhagen</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Track and Cut: Simultaneous Tracking and Segmentation of Multiple Objects with Graph Cuts</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/317278</link><description>This paper presents a new method to both track and segment multiple objects in videos
using min-cut/max-flow optimizations. We introduce objective functions that combine low-level
pixel wise measures (color, motion), high-level observations obtained via an independent
detection module, motion prediction, and contrast-sensitive contextual regularization.
One novelty is that external observations are used without adding any association step. The
observations are image regions (pixel sets) that can be provided by any kind of detector. The
minimization of appropriate cost functions simultaneously allows &amp;#8220;detection-before-track&amp;#8221;
tracking (track-to-observation assignment and automatic initialization of new tracks) and
segmentation of tracked objects. When several tracked objects get mixed up by the detection
module (e.g., a single foreground detection mask is obtained for several objects close
to each other), a second stage of minimization allows the proper tracking and segmentation
of these individual entities despite the confusion of the external detection module.</description><Author>Aur&amp;#233;lie Bugeau and Patrick P&amp;#233;rez</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Monocular 3D Tracking of Articulated Human Motion in Silhouette and Pose Manifolds</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/326896</link><description>This paper presents a robust computational framework for monocular 3D tracking of human movement. The main innovation of the proposed framework is to explore the underlying data structures of the body silhouette and pose spaces by constructing low-dimensional silhouettes and poses manifolds, establishing intermanifold mappings, and performing tracking in such manifolds using a particle filter. In addition, a novel vectorized silhouette descriptor is introduced to achieve low-dimensional, noise-resilient silhouette representation. The proposed articulated motion tracker is view-independent, self-initializing, and capable of maintaining multiple kinematic trajectories. By using the learned mapping from the silhouette manifold to the pose manifold, particle sampling is informed by the current image observation, resulting in improved sample efficiency. Decent tracking results have been obtained using synthetic and real videos.</description><Author>Feng Guo and Gang Qian</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Anthropocentric Video Analysis: Tools and Applications</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/676094</link><description /><Author>Nikos Nikolaidis, Maja Pantic, and Ioannis Pitas</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Optimal Features Subset Selection and Classification for Iris Recognition</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/743103</link><description>The selection of the optimal features subset and the classification have become an important issue in the field of iris recognition. We propose a feature selection scheme based on the multiobjectives genetic algorithm (MOGA) to improve the recognition accuracy and asymmetrical support vector machine for the classification of iris patterns. We also suggest a segmentation scheme based on the collarette area localization. The deterministic feature sequence is extracted from the iris images using the 1D log-Gabor wavelet technique, and the extracted feature sequence is used to train the support vector machine (SVM). The MOGA is applied to optimize the features sequence and to increase the overall performance based on the matching accuracy of the SVM. The parameters of SVM are optimized to improve the overall generalization performance, and the traditional SVM is modified to an asymmetrical SVM to treat the false accept and false reject cases differently and to handle the unbalanced data of a specific class with respect to the other classes. Our experimental results indicate that the performance of SVM as a classifier is better than the performance of the classifiers based on the feedforward neural network, the k-nearest neighbor, and the Hamming and the Mahalanobis distances. The proposed technique is computationally effective with recognition rates of 99.81&amp;#37; and 96.43&amp;#37; on CASIA and ICE datasets, respectively.</description><Author>Kaushik Roy and Prabir Bhattacharya</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Optimization-Based Image Segmentation by Genetic Algorithms</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/842029</link><description>Many works in the literature focus on the definition of evaluation metrics and criteria that enable to quantify the performance of an image processing algorithm. These evaluation criteria can be used to define new image processing algorithms by optimizing them. In this paper, we propose a general scheme to segment images by a genetic algorithm. The developed method uses an evaluation criterion which quantifies the quality of an image segmentation result. The proposed segmentation method can integrate a local ground truth when it is available in order to set the desired level of precision of the final result. A genetic algorithm is then used in order to determine the best combination of information extracted by the selected criterion. Then, we show that this approach can either be applied for gray-levels or multicomponents images in a supervised context or in an unsupervised one. Last, we show the efficiency of the proposed method through some experimental results on several gray-levels and multicomponents images.</description><Author>S. Chabrier, C. Rosenberger, B. Emile, and H. Laurent</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Color in Image and Video Processing</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/356781</link><description /><Author>Alain Tr&amp;#233;meau, Shoji Tominaga, and Konstantinos N. Plataniotis</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Color in Image and Video Processing: Most Recent Trends and Future Research Directions</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/581371</link><description>The motivation of this paper is to provide an overview of the most recent trends and of the future research directions in color image and video processing. Rather than covering all aspects of the domain this survey covers issues related to the most active research areas in the last two years. It presents the most recent trends as well as the state-of-the-art, with a broad survey of the relevant literature, in the main active research areas in color imaging. It also focuses on the most promising research areas in color imaging science. This survey gives an overview about the issues, controversies, and problems of color image science. It focuses on human color vision, perception, and interpretation. It focuses also on acquisition systems, consumer imaging applications, and medical imaging applications. Next it gives a brief overview about the solutions, recommendations, most recent trends, and future trends of color image science. It focuses on color space, appearance models, color difference metrics, and color saliency. It focuses also on color features, color-based object tracking, scene illuminant estimation and color constancy, quality assessment and fidelity assessment, color characterization and calibration of a display device. It focuses on quantization, filtering and enhancement, segmentation, coding and compression, watermarking, and lastly on multispectral color image processing. Lastly, it addresses the research areas which still need addressing and which are the next and future perspectives of color in image and video processing.</description><Author>Alain Tr&amp;#233;meau, Shoji Tominaga, and Konstantinos N. Plataniotis</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Three-Dimensional SPIHT Coding of Volume Images with Random Access and Resolution Scalability</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/248905</link><description>End users of large volume image datasets are often interested only in certain features that can be identified as quickly as possible. For hyperspectral data, these features could reside only in certain ranges of spectral bands and certain spatial areas of the target. The same holds true for volume medical images for a certain volume region of the subject&amp;#39;s anatomy. High spatial resolution may be the ultimate requirement, but in many cases a lower resolution would suffice, especially when rapid acquisition and browsing are essential. This paper presents a major extension of the 3D-SPIHT (set partitioning in hierarchical trees) image compression algorithm that enables random access decoding of any specified region of the image volume at a given spatial resolution and given bit rate from a single codestream. Final spatial and spectral (or axial) resolutions are chosen independently. Because the image wavelet transform is encoded in tree blocks and the bit rates of these tree blocks are minimized through a rate-distortion optimization procedure, the various resolutions and qualities of the images can be extracted while reading a minimum amount of bits from the coded data. The attributes and efficiency of this 3D-SPIHT extension are demonstrated for several medical and hyperspectral images in comparison to the JPEG2000 Multicomponent algorithm.</description><Author>Emmanuel Christophe and William A. Pearlman</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Fuzzy Mode Enhancement and Detection for Color  Image Segmentation</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/542378</link><description>This work lies within the scope of color image segmentation by pixel classification. The classes of pixels are constructed by detecting the modes of the spatial-color compactness function, which characterizes the image by taking into account both the distribution of colors in the color space and their spatial location in the image plane. A fuzzy transformation of this function is performed, based on fuzzy morphological operators specifically designed for mode detection. Experimental segmentation results, using several synthetic and benchmark images, show the interest of the proposed method.</description><Author>Olivier Losson, Claudine Botte-Lecocq, and Ludovic Macaire</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Image and Video Processing for Disability</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2007/54743</link><description /><Author>Alice Caplier, Thierry Pun, and Dimitrios Tzovaras</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>A Color Topographic Map Based on the Dichromatic Reflectance Model</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/824195</link><description>Topographic maps are an interesting alternative to edge-based techniques common in computer vision applications. Indeed, unlike edges, level lines are closed and less sensitive to external parameters. They provide a compact geometrical representation of images and they are, to some extent, robust to contrast changes. The aim of this paper is to propose a novel and vectorial representation of color topographic maps. In contrast with existing color topographic maps, it does not require any color conversion. For this purpose, our technique refers to the dichromatic reflectance model, which explains the distribution of colors as the mixture of two reflectance components, related either to the body or to the specular reflection. Thus, instead of defining the topographic map along the sole luminance direction in the RGB space, we propose to design color lines along each dominant color vector, from the body reflection. Experimental results show that this approach provides a better tradeoff between the compactness and the quality of a topographic map.</description><Author>Mich&amp;#232;le Gouiff&amp;#232;s and Bertrand Zavidovique</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Integrated Detection, Tracking, and Recognition of Faces with Omnivideo Array in Intelligent Environments</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/374528</link><description>We present a multilevel system architecture for intelligent environments equipped with omnivideo arrays.  In order to gain unobtrusive human awareness, real-time 3D human tracking as well as robust video-based face detection and tracking and face recognition algorithms are needed.  We first propose a multiprimitive face detection and tracking loop to crop face videos as the front end of our face recognition algorithm.  Both skin-tone and elliptical detections are used for robust face searching, and view-based face classification is applied to the candidates before updating the Kalman filters for face tracking.  For video-based face recognition, we propose three decision rules on the facial video segments.  The majority rule and discrete HMM (DHMM) rule accumulate single-frame face recognition results, while continuous density HMM (CDHMM) works directly with the PCA facial features of the video segment for accumulated maximum likelihood (ML) decision.  The experiments demonstrate the robustness of the proposed face detection and tracking scheme and the three streaming face recognition schemes with 99&amp;#x25; accuracy of the CDHMM rule.  We then experiment on the system interactions with single person and group people by the integrated layers of activity awareness.  We also discuss the speech-aided incremental learning of new faces.</description><Author>Kohsia S. Huang and Mohan M. Trivedi</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Image and Video for Hearing Impaired People</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2007/45641</link><description>We present a global overview of image- and video-processing-based methods to help the communication of hearing impaired people. Two directions of communication have to be considered: from a hearing person to a hearing impaired person and vice versa. In this paper, firstly, we describe sign language (SL) and the cued speech (CS) language which are two different languages used by the deaf community. Secondly, we present existing tools which employ SL and CS video processing and recognition for the automatic communication between deaf people and hearing people. Thirdly, we present the existing tools for reverse communication, from hearing people to deaf people that involve SL and CS video synthesis.</description><Author>Alice Caplier, S&amp;#233;bastien Stillittano, Oya Aran, Lale Akarun, G&amp;#233;rard Bailly, Denis Beautemps, Nouredine Aboutabit, and Thomas Burger</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Comparative Study of Contour Detection Evaluation Criteria Based on Dissimilarity Measures</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/693053</link><description>We present in this article a comparative study
of well-known supervised evaluation criteria that enable the
quantification of the quality of contour detection algorithms.
The tested criteria are often used or combined in the literature
to create new ones. Though these criteria are classical ones,
none comparison has been made, on a large amount of data,
to understand their relative behaviors. The objective of this
article is to overcome this lack using large test databases both
in a synthetic
and a real context allowing a comparison in
various situations and application fields and consequently to start
a general comparison which could be extended by any person
interested in this topic.
After a review of the most common criteria used for the
quantification of the quality of contour detection algorithms,
their respective performances are presented using synthetic
segmentation results in order to show their performance relevance
face to undersegmentation, oversegmentation, or situations
combining these two perturbations. These criteria are then tested
on natural images in order to process the diversity of the possible
encountered situations. The used databases and the following
study can constitute the ground works for any researcher who
wants to confront a new criterion face to well-known ones.</description><Author>S&amp;#233;bastien Chabrier, H&amp;#233;l&amp;#232;ne Laurent, Christophe Rosenberger, and Bruno Emile</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>A Fuzzy Color-Based Approach for Understanding Animated Movies Content in the Indexing Task</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/849625</link><description>This paper proposes a method for detecting and analyzing the color techniques used in the animated
movies. Each animated movie uses a specific color palette which makes its color distribution one
major feature in analyzing the movie content. The color palette is specially tuned by the author
in order to convey certain feelings or to express artistic concepts. Deriving semantic or symbolic
information from the color concepts or the visual impression induced by the movie should be an
ideal way of accessing its content in a content-based retrieval system. The proposed approach
is carried out in two steps. The first processing step is the low-level analysis. The movie color
content gets represented with several global statistical parameters computed from the movie global
weighted color histogram. The second step is the symbolic representation of the movie content.
The numerical parameters obtained from the first step are converted into meaningful linguistic
concepts through a fuzzy system. They concern mainly the predominant hues of the movie, some of Itten&amp;#x2019;s color contrasts and harmony schemes, color relationships and color richness. We use the
proposed linguistic concepts to link to given animated movies according to their color techniques.
In order to make the retrieval task easier, we also propose to represent color properties in a graphical
manner which is similar to the color gamut representation. Several tests have been conducted on
an animated movie database.</description><Author>Bogdan Ionescu, Didier Coquin, Patrick Lambert, and Vasile Buzuloiu</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Activity Representation Using 3D Shape Models</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/347050</link><description>We present a method for characterizing human activities using 3D deformable shape models. The motion trajectories of points extracted from objects involved in the activity are used to build models for each activity, and these models are used for classification and detection of unusual activities. The deformable models are learnt using the factorization theorem for nonrigid 3D models. We present a theory for characterizing the degree of deformation in the 3D models from a sequence of tracked observations. This degree, termed as deformation index (DI), is used as an input to the 3D model estimation process. We study the special case of ground plane activities in detail because of its importance in video surveillance applications. We present results of our activity modeling approach using videos of  both high-resolution single individual activities and  ground plane surveillance activities.</description><Author>Mohamed F. Abdelkader, Amit K. Roy-Chowdhury, Rama Chellappa, and Umut Akdemir</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Human Posture Tracking and Classification through Stereo Vision and 3D Model Matching</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/476151</link><description>The ability of detecting human postures is particularly 
    important in several fields like ambient intelligence, 
    surveillance, elderly care, and 
human-machine interaction. This problem has been studied in recent 
years in the computer vision community, but the proposed solutions 
still suffer from some limitations due to the difficulty of 
dealing with complex scenes (e.g., occlusions, different view 
points, etc.). In this article, we present a system for posture 
tracking and classification based on a stereo vision sensor. The 
system provides both a robust way to segment and track people in 
the scene and 3D information about tracked people. The proposed 
method is based on matching 3D data with a 3D human body model. 
Relevant points in the model are then tracked over time with 
temporal filters and a classification method based on hidden 
Markov models is used to recognize principal postures. 
Experimental results show the effectiveness of the system in 
determining human postures with different orientations of the 
people with respect to the stereo sensor, in presence of partial 
occlusions and under different environmental conditions.</description><Author>Stefano Pellegrini and Luca Iocchi</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Image and Video Processing for Visually Handicapped People</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2007/25214</link><description>This paper reviews the state of the art in the field of assistive devices for sight-handicapped people. It concentrates in particular on systems that use image and video processing for converting visual data into an alternate rendering modality that will be appropriate for a blind user. Such alternate modalities can be auditory, haptic, or a combination of both. There is thus the need for modality conversion, from the visual modality to another one; this is where image and video processing plays a crucial role. The possible alternate sensory channels are examined with the purpose of using them to present visual information to totally blind persons. Aids that are either already existing or still under development are then presented, where a distinction is made according to the final output channel. Haptic encoding is the most often used by means of either tactile or combined tactile/kinesthetic encoding of the visual data. Auditory encoding may lead to low-cost devices, but there is need to handle high information loss incurred when transforming visual data to auditory one. Despite a higher technical complexity, audio/haptic encoding has the advantage of making use of all available user&amp;#39;s sensory channels.</description><Author>Thierry Pun, Patrick Roth, Guido Bologna, Konstantinos Moustakas, and Dimitrios Tzovaras</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>3D Shape-Encoded Particle Filter for Object Tracking and Its Application to Human Body Tracking</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/596989</link><description>We present a nonlinear state estimation approach using particle filters, for tracking objects whose approximate 3D shapes are known. The unnormalized conditional density for the solution to the nonlinear filtering problem leads to the Zakai equation, and is realized by the weights of the particles. The weight of a particle represents its geometric and temporal fit, which is computed bottom-up from the raw image using a shape-encoded filter. The main contribution of the paper is the design of smoothing filters for feature extraction combined with the adoption of unnormalized conditional density weights. The &amp;#8220;shape filter&amp;#8221; has the overall form of the predicted 2D projection of the 3D model, while the cross-section of the filter is designed to collect the gradient responses along the shape. The 3D-model-based representation is designed to emphasize the changes in 2D object shape due to motion, while de-emphasizing the variations due to lighting and other imaging conditions. We have found that the set of sparse measurements using a relatively small number of particles is able to approximate the high-dimensional state distribution very effectively. As a measures to stabilize the tracking, the amount of random diffusion is effectively adjusted using a Kalman updating of the covariance matrix. For a complex problem of human body tracking, we have successfully employed constraints derived from joint angles and walking motion.</description><Author>H. Moon and R. Chellappa</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Are the Wavelet Transforms the Best Filter Banks for Image Compression?</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/287197</link><description>Maximum regular wavelet filter banks have received much attention in the literature, and
it is a general conception that they enjoy some type of optimality for image coding purposes.
To investigate this claim, this article focuses on one particular biorthogonal wavelet filter
bank, namely, the 2-channel 9/7. As a comparison, we generate all possible 9/7 filter banks
with perfect reconstruction and linear phase while having a different number of zeros at 
z=&amp;#x2212;1
for both analysis and synthesis lowpass filters. The best performance is obtained when the
filter bank has 2/2 zeros at z=&amp;#x2212;1 for the analysis and synthesis lowpass filters, respectively. The competing wavelet 9/7 filter bank, which has 4/4 zeros at z=&amp;#x2212;1, is thus judged inferior both in terms of objective error measurements and informal visual inspections. It is further shown that the 9/7 wavelet filter bank can be obtained using gain-optimized 9/7 filter bank.</description><Author>Ilangko Balasingham and Tor A. Ramstad</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Improving the Quality of Color Colonoscopy Videos</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/139429</link><description>Colonoscopy is currently one of the best methods to detect colorectal cancer. Nowadays, one of the widely
used colonoscopes has a monochrome chipset recording successively at 60&amp;#x02009;Hz R,G, and B components
merged into one color video stream. Misalignments of the channels occur each time the camera moves, and
this artefact impedes both online visual inspection by doctors and offline computer analysis of the image
data. We propose to restore this artefact by first equalizing the color channels and then performing a robust
camera motion estimation and compensation.</description><Author>Rozenn Dahyot, Fernando Vilari&amp;#241;o, and Gerard Lacey</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Robust Color Image Superresolution: An Adaptive M-Estimation Framework</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/763254</link><description>This paper introduces a new color image superresolution algorithm in an adaptive, robust M-estimation framework. Using a robust error norm in the objective function, and adapting the estimation process to each of the low-resolution frames, the proposed method effectively suppresses the outliers due to violations of the assumed observation model, and results in color superresolution estimates with crisp details and no color artifacts, without the use of regularization. Experiments on both synthetic and real sequences demonstrate the superior performance over using the L2 and L1 error norms in the objective function.</description><Author>Noha A. El-Yamany and Panos E. Papamichalis</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>New Structured Illumination Technique for the Inspection of High-Reflective Surfaces: Application for the Detection of Structural Defects without any Calibration Procedures</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/237459</link><description>We present a novel solution for
automatic surface inspection of metallic tubes by applying
a structured illumination. The strength of the proposed
approach is that both structural and textural surface
defects can be visually enhanced, detected, and well
separated from acceptable surfaces.
We propose a machine vision approach and we demonstrate
that this technique is applicable in an industrial
setting. We show that recording artefacts drastically increases
the complexity of the inspection task.
The algorithm implemented in the industrial application
and which permits the segmentation and classification
of surface defects is briefly described. The suggested
method uses &amp;#x201C;perturbations from the stripe illumination&amp;#x201D;
to detect, segment, and classify any defects.
We emphasize the robustness of the algorithm against
recording artefacts. Furthermore, this method is applied
in 24&amp;#x2009;h/7 day real-time industrial surface inspection system.</description><Author>Yannick Caulier, Klaus Spinnler, Salah Bourennane, and Thomas Wittenberg</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>A Motion-Adaptive Deinterlacer via Hybrid Motion Detection and Edge-Pattern Recognition</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/741290</link><description>A novel motion-adaptive deinterlacing algorithm with edge-pattern recognition and hybrid motion detection is introduced. The great variety of video contents makes the processing of assorted motion, edges, textures, and the combination of them very difficult with a single algorithm. The edge-pattern recognition algorithm introduced in this paper exhibits the flexibility in processing both textures and edges which need to be separately accomplished by line average and edge-based line average before. Moreover, predicting the neighboring pixels for pattern analysis and interpolation further enhances the adaptability of the edge-pattern recognition unit when motion detection is incorporated. Our hybrid motion detection features accurate detection of fast and slow motion in interlaced video and also the motion with edges. Using only three fields for detection also renders higher temporal correlation for interpolation. The better performance of our deinterlacing algorithm with higher content-adaptability and less memory cost than the state-of-the-art 4-field motion detection algorithms can be seen from the subjective and objective experimental results of the CIF and PAL video sequences.</description><Author>Gwo Giun Lee, Ming-Jiun Wang, Hsin-Te Li, and He-Yuan Lin</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Comparison of Image Transform-Based Features for Visual Speech Recognition in Clean and Corrupted Videos
</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/810362</link><description>We present results of a study into the performance of a variety of different image transform-based feature types for speaker-independent visual speech recognition of isolated digits. This includes the first reported use of features extracted using a discrete curvelet transform. The study will show a comparison of some methods for selecting features of each feature type and show the relative benefits of both static
and dynamic visual features. The performance of the features will be tested on both clean video data
and also video data corrupted in a variety of ways to assess each feature type&amp;#x27;s robustness to potential
real-world conditions. One of the test conditions involves a novel form of video corruption we call jitter
which simulates camera and/or head movement during recording.</description><Author>Rowan Seymour, Darryl Stewart, and Ji Ming</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Combination of Accumulated Motion and Color Segmentation for Human Activity Analysis</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/735141</link><description>The automated analysis of activity in digital multimedia, and especially video, is gaining more and more importance due to the evolution of higher-level video processing systems and the development of relevant applications such as surveillance and sports. This paper presents a novel algorithm for the recognition and classification of human activities, which employs motion and color characteristics in a complementary manner, so as to extract the most information from both sources, and overcome their individual limitations. The proposed method accumulates the flow estimates in a video, and extracts &amp;#x201C;regions of activity&amp;#x201D; by processing their higher-order statistics. The shape of these activity areas can be used for the classification of the human activities and events taking place in a video and the subsequent extraction of higher-level semantics. Color segmentation of the active and static areas of each video frame is performed to complement this information. The color layers in the activity and background areas are compared using the earth mover&amp;#x27;s distance, in order to achieve accurate object segmentation. Thus, unlike much existing work on human activity analysis, the proposed approach is based on general color and motion processing methods, and not on specific models of the human body and its kinematics. The
combined use of color and motion information increases the method robustness to illumination variations and measurement noise. Consequently, the proposed approach can lead to higher-level information about human activities, but its applicability is not limited to specific human actions. We present experiments with various real video sequences, from sports and surveillance domains, to demonstrate the effectiveness of our approach.</description><Author>Alexia Briassouli, Vasileios Mezaris, and Ioannis Kompatsiaris</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Color Image Coding by Colorization Approach</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/158273</link><description>This paper proposes a new color image coding scheme called &amp;#8220;colorization image coding.&amp;#8221; The scheme is based on the colorization technique which can colorize a monochrome image by giving a small number of color pixels. We develop algorithms useful for color image coding. First, the luminance component is separated from an input color image. Then, a small number of color seeds are selected as chrominance information. The luminance image component is coded by a lossy coding technique and the chrominance image component is stored as color seeds. The decoding is performed by the colorization algorithm. It is shown that this colorization technique is effective to image coding, especially for high compression rate, through the experiments using different types of images.</description><Author>Takahiko Horiuchi and Shoji Tominaga</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item><item><title>Anthropocentric Video Segmentation for Lecture Webcasts</title><link>http://www.hindawi.com/GetArticle.aspx?doi=10.1155/2008/195743</link><description>Many lecture recording and presentation systems
transmit slides or chalkboard content along with
a small video of the instructor. As a result, two
areas of the screen are competing for the viewer&amp;#x27;s
attention, causing the widely known split-attention
effect. Face and body gestures, such as pointing,
do not appear in the context of the slides or the
board. To eliminate this problem, this article proposes
to extract the lecturer from the video stream
and paste his or her image onto the board or slide
image. As a result, the lecturer acting in front of
the board or slides becomes the center of attention.
The  entire lecture presentation becomes
more human-centered. This article presents both
an analysis of the underlying psychological problems
and an explanation of signal processing techniques
that are applied in a concrete system. The
presented algorithm is able to extract and overlay
the lecturer online and in real time at full video resolution.</description><Author>Gerald Friedland and Raul Rojas</Author><copyright>&amp;#169; 2008, Hindawi Publishing Corporation. All rights reserved.</copyright></item></channel></rss>