﻿<?xml version="1.0" encoding="utf-8"?><rss version="2.0"><channel><title>Advances in Multimedia</title><link>http://www.hindawi.com</link><description>The latest articles from Hindawi Publishing Corporation</description><copyright>&amp;#169; 2012, Hindawi Publishing Corporation. All rights reserved.</copyright><item><title>A Survey of Open Source Products for Building a SIP Communication Platform</title><link>http://www.hindawi.com/journals/am/2011/372591/</link><description>The Session Initiation Protocol (SIP) is a multimedia signalling protocol that has evolved into a widely adopted communication standard. The integration of SIP into existing IP networks has fostered IP networks becoming a convergence platform for both real-time and non-real-time multimedia communications. This converged platform integrates data, voice, video, presence, messaging, and conference services into a single network that offers new communication experiences for users. The open source community has contributed to SIP adoption through the development of open source software for both SIP clients and servers. In this paper, we provide a survey on open SIP systems that can be built using publically available software. We identify SIP features for service development and programming, services and applications of a SIP-converged platform, and the most important technologies supporting SIP functionalities. We propose an advanced converged IP communication platform that uses SIP for service delivery. The platform supports audio and video calls, along with media services such as audio conferences, voicemail, presence, and instant messaging. Using SIP Application Programming Interfaces (APIs), the platform allows the deployment of advanced integrated services. The platform is implemented with open source software. Architecture components run on standardized hardware with no need for special purpose investments.</description><Author>Pavel Segec and Tatiana Kovacikova</Author><copyright>Copyright &amp;#xa9; 2011 Pavel Segec and Tatiana Kovacikova. All rights reserved.</copyright></item><item><title>Mobile Music Distribution: A Multichannel Approach</title><link>http://www.hindawi.com/journals/am/2011/706164/</link><description>In contrast to what is happening in the Internet-based scenario, the music market in
the mobile scenario is far from being considered a large success. Several studies state
that excessive downloading time and high cost are the main burdens. Motivated by
the growth of social and mobile applications, in this paper we propose an approach
that aims at reducing both the downloading time and the cost to get digital music
when acquired in the mobile scenario. The proposed architecture exploits the usage of
personal communication technologies embedded in cellphones (e.g., Bluetooth and
Wi-Fi) to couple the current distribution model (mainly based on 3G networks), so as
to provide a multichannel distribution model where users are free to redistribute
digital music. The architecture includes a license-based security mechanism that
prevents unauthorized usage of digital music, and makes use of an incentive
mechanism to stimulate and reward the music distribution among customers. By
analyzing pros and cons of the music distribution chain, results show that the
proposed architecture might help in reducing both the downloading time and the cost
to get digital music when acquired in the mobile scenario. Therefore, it might be
helpful to the success of the mobile music scenario.</description><Author>Marco Furini</Author><copyright>Copyright &amp;#xa9; 2011 Marco Furini. All rights reserved.</copyright></item><item><title>Real-Time Adaptive Content-Based Synchronization of Multimedia Streams</title><link>http://www.hindawi.com/journals/am/2011/914062/</link><description>Traditional synchronization schemes of multimedia applications are based on temporal relationships between inter- and intrastreams. These schemes do not provide good synchronization in the presence of random delay. As a solution, this paper proposes an adaptive content-based synchronization scheme that synchronizes multimedia streams by accounting for content in addition to time. This approach to synchronization is based on the fact that having two streams sampled close in time does not always imply that these streams are close in content. The proposed scheme primary contribution is the synchronization of audio and video streams based on content. The secondary contribution is adapting the frame rate based on content decisions. Testing adaptive content-based and adaptive time-based synchronization algorithms remotely between the American University of Beirut and Michigan State University showed that the proposed method outperforms the traditional synchronization method. Objective and subjective assessment of the received video and audio quality demonstrated that the content-based scheme provides better synchronization and overall quality of multimedia streams. Although demonstrated using a video conference application, the method can be applied to any multimedia streams including nontraditional ones referred to as supermedia like control signals, haptic, and other sensory measurements. In addition, the method can be applied to synchronize more than two streams simultaneously.</description><Author>Imad H. Elhajj, Nadine Bou Dargham, Ning Xi, and Yunyi Jia</Author><copyright>Copyright &amp;#xa9; 2011 Imad H. Elhajj et al. All rights reserved.</copyright></item><item><title>Representing Images&amp;#39; Meanings by Associative Values with Given Lexicons Considering the Semantic Tolerance Relation</title><link>http://www.hindawi.com/journals/am/2011/786427/</link><description>An approach of representing meanings of images based on associative values with lexicons is proposed. For this, the semantic tolerance relation model (STRM) that reflects the tolerance degree between defined lexicons is generated, and two factors of semantic relevance (SR) and visual similarity (VS) are involved in generating associative values. Furthermore, the algorithm of calculating associative values using pixel-based bidirectional associative memories (BAMs) in combination with the STRM, which is easy in implementation, is depicted. The experiment results of multilexicons-based retrieval by individuals show the effectiveness and efficiency of our proposed method in finding the expected images and the improvement in retrieving accuracy because of incorporating SR with VS in representing meanings of images.</description><Author>Ying Dai</Author><copyright>Copyright &amp;#xa9; 2011 Ying Dai. All rights reserved.</copyright></item><item><title>A Reliable Event-Driven Strategy for Real-Time Multiple Object Tracking Using Static Cameras</title><link>http://www.hindawi.com/journals/am/2011/976463/</link><description>Recently, because of 
                  its importance in computer vision and 
                  surveillance systems, object tracking has 
                  progressed rapidly over the last two decades. 
                  Researches on such systems still face several 
                  theoretical and technical problems that badly 
                  impact not only the accuracy of position 
                  measurements but also the continuity of 
                  tracking. In this paper, a novel strategy for 
                  tracking multiple objects using static cameras 
                  is introduced, which can be used to grant a 
                  cheap, easy installation and robust tracking 
                  system. The proposed tracking strategy is based 
                  on scenes captured by a number of static video 
                  cameras. Each camera is attached to a 
                  workstation that analyzes its stream. All 
                  workstations are connected directly to the 
                  tracking server, which harmonizes the system, 
                  collects the data, and creates the output 
                  spatial-tempo database. Our contribution comes 
                  in two issues. The first is to present a new 
                  methodology for transforming the image 
                  coordinates of an object to its real 
                  coordinates. The second is to offer a flexible 
                  event-based object tracking strategy. The 
                  proposed tracking strategy has been tested over 
                  a CAD of soccer game environment. Preliminary 
                  experimental results show the robust performance 
                  of the proposed tracking strategy.</description><Author>Amany M. Sarhan, Ahmed I. Saleh, and Ramy K. Elsadek</Author><copyright>Copyright &amp;#xa9; 2011 Amany M. Sarhan et al. All rights reserved.</copyright></item><item><title>Fast Retrieval Algorithm for Earth Mover's Distance Using EMD Lower Bounds and a Skipping Algorithm</title><link>http://www.hindawi.com/journals/am/2011/421820/</link><description>The earth mover's distance (EMD) is a measure of the distance between two distributions, and it
has been widely used in multimedia information retrieval systems, in particular, in content-based
image retrieval systems. When the EMD is applied to image problems based on color or texture,
the EMD reflects the human perceptual similarities. However, its computations are too expensive
to use in large-scale databases. In order to achieve efficient computation of the EMD during query
processing, we have developed &amp;#8220;fastEMD,&amp;#8221; a library for high-speed feature-based similarity retrievals
in large databases. This paper introduces techniques that are used in the implementation
of the fastEMD and performs extensive experiments to demonstrate its efficiency.</description><Author>Masami Shishibori, Daichi Koizumi, and Kenji Kita</Author><copyright>Copyright &amp;#xa9; 2011 Masami Shishibori et al. All rights reserved.</copyright></item><item><title>Utilizing Implicit User Feedback to Improve Interactive Video Retrieval</title><link>http://www.hindawi.com/journals/am/2011/310762/</link><description>This paper describes an approach to exploit the implicit user feedback gathered during interactive video retrieval tasks. We propose a framework, where the video is first indexed according to temporal, textual, and visual features and then implicit user feedback analysis is realized using a graph-based methodology. The generated graph encodes the semantic relations between video segments based on past user interaction and is subsequently used to generate recommendations. Moreover, we combine the visual features and implicit feedback information by training a support vector machine classifier with examples generated from the aforementioned graph in order to optimize the query by visual example search. The proposed framework is evaluated by conducting real-user experiments. The results demonstrate that significant improvement in terms of precision and recall is reported after the exploitation of implicit user feedback, while an improved ranking is presented in most of the evaluated queries by visual example.</description><Author>Stefanos Vrochidis, Ioannis Kompatsiaris, and Ioannis Patras</Author><copyright>Copyright &amp;#xa9; 2011 Stefanos Vrochidis et al. All rights reserved.</copyright></item><item><title>From Community Detection to Mentor Selection in  Rating-Free Collaborative Filtering</title><link>http://www.hindawi.com/journals/am/2011/852518/</link><description>The number of items that users can now access when navigating on the Web is so huge that these might feel lost. Recommender systems are a way to cope with this profusion of data by suggesting items that fit the users needs. One of the most popular techniques for recommender systems is the collaborative filtering approach that relies on the preferences of items expressed by users, usually under the form of ratings. In the absence of ratings, classical collaborative filtering techniques cannot be applied. Fortunately, the behavior of users, such as their consultations, can be collected. In this paper, we present a new approach to perform
collaborative filtering when no rating is available but when user consultations are known. We propose to take inspiration from local
community detection algorithms to form communities of users and deduce the set of mentors of a given user. We adapt one
state-of-the-art algorithm so as to fit the characteristics of collaborative filtering. Experiments conducted show that the
precision achieved is higher then the baseline that does not perform any mentor selection. In addition, our model almost
offsets the absence of ratings by exploiting a reduced set of mentors.</description><Author>Armelle Brun, Sylvain Castagnos, and Anne Boyer</Author><copyright>Copyright &amp;#xa9; 2011 Armelle Brun et al. All rights reserved.</copyright></item><item><title>Open Profiling of Quality: A Mixed Method Approach to Understanding Multimodal Quality Perception</title><link>http://www.hindawi.com/journals/am/2010/658980/</link><description>To quantify the excellence of multimedia quality, subjective evaluation experiments are conducted. In these experiments, the tradition of quantitative assessment is the most dominating, but it disregards the understanding of participants&amp;#39; interpretations, descriptions, and the evaluation criteria of quality. The goal of this paper is to present a new multimedia quality evaluation method called Open Profiling of Quality (OPQ) as a tool for building a deeper understanding on subjective quality. OPQ is a mixed method combining a conventional quantitative psychoperceptual evaluation and qualitative descriptive quality evaluation based on the individual&amp;#39;s own vocabulary. OPQ is targeted for na&amp;#239;ve participants applicable to experiments with heterogeneous and multimodal stimulus material. The paper presents the theoretical basis of the development of OPQ and overviews the methods for audiovisual quality research. We present three extensive quality evaluation studies where OPQ has been used with 120 participants. Finally, we conclude further recommendations of use of the method in quality evaluation research.</description><Author>D. Strohmeier, S. Jumisko-Pyykk&amp;#246;, and K. Kunze</Author><copyright>Copyright &amp;#xa9; 2010 D. Strohmeier et al. All rights reserved.</copyright></item><item><title>MMSA: Metamodel Multimedia Software Architecture</title><link>http://www.hindawi.com/journals/am/2010/386035/</link><description>Interoperability explains how two or more systems or components exchange and process information. The heterogeneity communication mechanisms of the components (GPRS, WIFI, Bluetooth, ZigBee, etc.), transmission speed, as well as the variety of the media (sound, video, text, and image) they manage have a strong influence on the interoperability. That requires the management of the adaptation to an abstract level in order to avoid ad hoc nonreusable, and/or generalizable solutions. In this paper we propose a metamodel for architectures with heterogeneous multimedia components. It enables the description of the software architectures as a collection of components manipulating various types and formats of data, and interacting between them via specific adaptation connectors.</description><Author>Makhlouf Derdour, Philippe Roose, Marc Dalmau, Nac&amp;#233;ra Ghoualmi Zine, and Adel Alti</Author><copyright>Copyright &amp;#xa9; 2010 Makhlouf Derdour et al. All rights reserved.</copyright></item><item><title>UEP Concepts in Modulation and Coding</title><link>http://www.hindawi.com/journals/am/2010/416797/</link><description>First unequal error protection (UEP) proposals date back to the 1960&amp;#39;s (Masnick and Wolf; 1967), but now with the introduction of scalable video, UEP develops to a key concept for the transport of multimedia data. The paper presents an overview of some new approaches realizing UEP properties in physical transport, especially multicarrier modulation, or with LDPC and Turbo codes. For multicarrier modulation, UEP bit-loading together with hierarchical modulation is described allowing for an arbitrary number of classes, arbitrary SNR margins between the classes, and arbitrary number of bits per class. In Turbo coding, pruning, as a counterpart of puncturing is presented for flexible bit-rate adaptations, including tables with optimized pruning patterns. Bit- and/or check-irregular LDPC codes may be designed to provide UEP to its code bits. However, irregular degree distributions alone do not ensure UEP, and other necessary properties of the parity-check matrix for providing UEP are also pointed out. Pruning is also the means for constructing variable-rate LDPC codes for UEP, especially controlling the check-node profile.</description><Author>Werner Henkel, Khaled Hassan, Neele von Deetzen, Sara Sandberg, Lucile Sassatelli, and David Declercq</Author><copyright>Copyright &amp;#xa9; 2010 Werner Henkel et al. All rights reserved.</copyright></item><item><title>Sketch-Based Spatial Queries for the Retrieval of Human Locomotion Patterns in Smart Environments</title><link>http://www.hindawi.com/journals/am/2009/486465/</link><description>A system for retrieving video sequences created by tracking humans in a smart environment, by using spatial queries, is presented. Sketches made with a pointing device on the floor layout of the environment are used to form queries corresponding to locomotion patterns. The sketches are analyzed to identify the type of the query. Directional search algorithms based on the minimum distance between points are applied for finding the best matches to the sketch. The results are ranked according to the similarity and presented to the user. The system was developed in two stages. An initial version of the system was implemented and evaluated by conducting a user study. Modifications were made where appropriate, according to the results and the feedback, to make the system more accurate and usable. We present the details of the initial system, the user study and the results, and the modifications thus made. The overall accuracy of retrieval for the initial system was approximately 93&amp;#37;, when tested on a collection of data from a real-life experiment. This is improved to approximately 97&amp;#37; after the modifications. The user interaction strategy and the search algorithms are usable in any environment for automated retrieval of locomotion patterns. The subjects who evaluated the system found it easy to learn and use. Their comments included several prospective applications for the user interaction strategy, providing valuable insight for future directions.</description><Author>Gamhewage C. de Silva, Toshihiko Yamasaki, and Kiyoharu Aizawa</Author><copyright>Copyright &amp;#x00A9; 2009 Gamhewage C. de Silva et al. All rights reserved.</copyright></item><item><title>Adaptive Packet-Level Interleaved FEC for Wireless  Priority-Encoded Video Streaming</title><link>http://www.hindawi.com/journals/am/2009/982867/</link><description>Packet-level Forward Error Control (FEC) for video streaming over a wireless network has received comparatively limited investigation, because of the delay introduced by the need to assemble a group of packets. However, packet-level interleaving when combined with FEC presents a remedy to time-correlated error bursts, though it can further increase delay if this issue is not addressed. This paper proposes adapting the overall degree of interleaved packet-level FEC according to the display deadlines of packets, transmit buffer occupation, and estimated video input to the wireless channel, all of which address the issue of delay. To guard against estimation error, the scheme applies a conservative adaptation policy, which accounts for picture type importance to ensure that display deadlines are met, thus avoiding this defect of interleaving. The paper additionally introduces a greedy algorithm that effectively groups packet-level FEC protection according to packet priority. Priority encoding adds extra protection during deep fades. As feedback is not required, the interleaving scheme is suitable for all forms of video broadcast. A Bluetooth piconet demonstrates the packet-level FEC interleaving scheme, which provides higher quality delivered video compared to the industry-standard Pro-MPEG Cop#3r2 interleaving scheme.</description><Author>Rouzbeh Razavi, Martin Fleury, and Mohammed Ghanbari</Author><copyright>Copyright &amp;#x00A9; 2009 Rouzbeh Razavi et al. All rights reserved.</copyright></item><item><title>A Survey of Visual Sensor Networks</title><link>http://www.hindawi.com/journals/am/2009/640386/</link><description>Visual sensor networks have emerged as an important
class of sensor-based distributed intelligent systems,
with unique performance, complexity, and quality of service
challenges. Consisting of a large number of low-power camera nodes,
visual sensor networks support a great number of novel
vision-based applications. The camera nodes provide information
from a monitored site, performing distributed and collaborative
processing of their collected data. Using multiple cameras in the
network provides different views of the scene, which enhances
the reliability of the captured events. However, the large amount
of image data produced by the cameras combined with the
network&amp;#39;s resource constraints require exploring new means
for data processing, communication, and sensor management.
Meeting these challenges of visual sensor networks requires
interdisciplinary approaches, utilizing vision processing, communications
and networking, and embedded processing. In this
paper, we provide an overview of the current state-of-the-art in
the field of visual sensor networks, by exploring several relevant
research directions. Our goal is to provide a better understanding
of current research problems in the different research fields of
visual sensor networks, and to show how these different research
fields should interact to solve the many challenges of visual sensor
networks.</description><Author>Stanislava Soro and Wendi Heinzelman</Author><copyright>Copyright &amp;#x00A9; 2009 Stanislava Soro and Wendi Heinzelman. All rights reserved.</copyright></item><item><title>In-Network Adaptation of Video Streams Using Network Processors</title><link>http://www.hindawi.com/journals/am/2009/905890/</link><description>The increasing variety of networks and end systems, especially wireless devices, pose new challenges in communication support for, particularly, multicast-based collaborative applications. In traditional multicasting, the sender transmits video at the same rate and resolution to all receivers independent of their network characteristics, end system equipment, and users&amp;#39; preferences about video quality and significance. Such an approach results in resources being wasted and may also result in some receivers having their quality expectations unsatisfied. This
problem can be addressed, near the network edge, by applying dynamic, in-network adaptation (e.g., transcoding) of video streams to meet available connection bandwidth, machine characteristics, and client preferences. In this paper, we extrapolate from earlier work of Shorfuzzaman et al. 2006 in which we implemented and assessed an MPEG-1 transcoding system on the Intel IXP1200 network processor to consider the feasibility of in-network transcoding for other video formats and network processor architectures. The use of &amp;#8220;on-the-fly&amp;#8221; video adaptation near the edge of the network offers the promise of simpler support for a wide range of end devices with different display, and so forth, characteristics that can be used in different types of environments.</description><Author>Mohammad Shorfuzzaman, Rasit Eskicioglu, and Peter Graham</Author><copyright>Copyright &amp;#x00A9; 2009 Mohammad Shorfuzzaman et al. All rights reserved.</copyright></item><item><title>Traffic and Quality Characterization of the H.264/AVC Scalable Video Coding Extension</title><link>http://www.hindawi.com/journals/am/2008/164027/</link><description>The recent scalable video coding (SVC) extension to the H.264/AVC video coding standard has unprecedented compression efficiency while supporting a wide range of scalability modes, including temporal, spatial, and quality (SNR) scalability, as well as combined spatiotemporal SNR scalability. The traffic characteristics, especially the bit rate variabilities, of the individual layer streams critically affect their network transport. We study the SVC traffic statistics, including the bit rate distortion and bit rate variability distortion, with long CIF resolution video sequences and compare them with the corresponding MPEG-4 Part 2 traffic statistics. We consider (i) temporal scalability with three temporal layers, (ii) spatial scalability with a QCIF base layer and a CIF enhancement layer, as well as (iii) quality scalability modes FGS and MGS. We find that the significant improvement in RD efficiency of SVC is accompanied by substantially higher traffic variabilities as compared to the equivalent MPEG-4 Part 2 streams.  We find that separately analyzing the traffic of temporal-scalability only encodings gives reasonable estimates of the traffic statistics of the temporal layers embedded in combined spatiotemporal encodings and in the base layer of combined FGS-temporal encodings. Overall, we find that SVC achieves significantly higher compression ratios than MPEG-4 Part 2, but produces unprecedented levels of traffic variability, thus presenting new challenges for the network transport of scalable video.</description><Author>Geert Van der Auwera, Prasanth T. David, Martin Reisslein, and Lina J. Karam</Author><copyright>Copyright &amp;#x00A9; 2008 Geert Van der Auwera et al. All rights reserved.</copyright></item><item><title>A Lightweight Buyer-Seller Watermarking Protocol</title><link>http://www.hindawi.com/journals/am/2008/905065/</link><description>The buyer-seller watermarking protocol enables a seller to successfully identify a traitor from a pirated copy, while preventing the seller from framing an innocent buyer. Based on finite field theory and the homomorphic property of public key cryptosystems such as RSA, several buyer-seller watermarking protocols (N. Memon and P. W. Wong (2001) and C.-L. Lei et al. (2004)) have been proposed previously. However, those protocols require not only large computational power but also substantial network bandwidth. In this paper, we introduce a new buyer-seller protocol that overcomes those weaknesses by managing the watermarks. Compared with the earlier protocols, ours is n times faster in terms of computation, where n is the number of watermark elements, while incurring only O(1/lN) times communication overhead given the finite field parameter lN. In addition, the quality of the watermarked image generated with our method is better, using the same watermark strength.</description><Author>Yongdong Wu and Hweehua Pang</Author><copyright>Copyright &amp;#x00A9; 2008 Yongdong Wu and Hweehua Pang. All rights reserved.</copyright></item><item><title>Scalable Video Streaming for Single-Hop Wireless Networks Using a Contention-Based Access MAC Protocol</title><link>http://www.hindawi.com/journals/am/2008/928521/</link><description>Limited bandwidth and high packet loss rate pose a serious challenge for video streaming applications over
wireless networks. Even when packet loss is not present, the bandwidth fluctuation, as a result of an arbitrary number of active flows in an IEEE 802.11 network, can significantly degrade the video quality. This paper aims to enhance the quality of video streaming applications in wireless home networks via a joint optimization of video layer-allocation technique, admission control algorithm, and medium access control (MAC) protocol. Using an Aloha-like MAC protocol, we propose a novel admission control framework, which can be viewed as an optimization problem that maximizes the average quality of admitted videos, given a specified minimum video quality for each flow. We present some hardness results for the optimization problem under various conditions and propose some heuristic algorithms for finding a good solution. In particular, we show that a simple greedy layer-allocation algorithm can perform reasonably well, although it is typically not optimal. Consequently, we present a more expensive heuristic algorithm that guarantees to approximate the optimal solution within a constant factor. Simulation results demonstrate that our
proposed framework can improve the video quality up to 26&amp;#37; as compared to those of the existing approaches.</description><Author>Monchai Lertsutthiwong, Thinh Nguyen, and Alan Fern</Author><copyright>Copyright &amp;#x00A9; 2008 Monchai Lertsutthiwong et al. All rights reserved.</copyright></item><item><title>Morphable 3D-Mosaics: A Hybrid Framework for Photorealistic Walkthroughs of Large Natural Environments</title><link>http://www.hindawi.com/journals/am/2008/697454/</link><description>This paper presents a hybrid (geometry- &amp;amp; image-based) framework suitable for providing photorealistic walkthroughs of large, complex outdoor scenes at interactive frame rates. To this end, based just on a sparse
set of real stereoscopic views from the scene, a set of morphable 3D-mosaics is automatically constructed first, and then, during rendering, a continuous morphing between those 3D-mosaics that are nearby to the current viewpoint is taking place. The morphing is both photometric, as well as geometric, while we also ensure that it proceeds in a physically valid manner, thus remaining transparent to the user. The effectiveness of our framework has been demonstrated in the 3D visual reconstruction of the Samaria Gorge in Crete,
which is one of the largest and most beautiful gorges in Europe.</description><Author>Nikos Komodakis and Georgios Tziritas</Author><copyright>Copyright &amp;#x00A9; 2008 Nikos Komodakis and Georgios Tziritas. All rights reserved.</copyright></item><item><title>On the Design and Exploitation of User&amp;#39;s Personal and Public Information for Semantic Personal Digital Photograph Annotation</title><link>http://www.hindawi.com/journals/am/2008/592690/</link><description>Automating the process of semantic annotation of digital personal photographs is a crucial step towards efficient and effective management of this increasingly high volume of content. However, this is still a highly challenging task for the research community. This paper proposes a novel solution. Our solution integrates all contextual information available to and from the users, such as their daily emails, schedules, chat archives, web browsing histories, documents, online news, Wikipedia data, and so forth. We then analyze this information and extract important semantic terms, using them as semantic keyword suggestions for their photos. Those keywords are in the form of named entities, such as names of people, organizations, locations, and date/time as well as high frequency terms. Experiments conducted with 10 subjects and a total of 313 photos proved that our proposed approach can significantly help users with the annotation process. We achieved a 33&amp;#37; gain in annotation time as compared to manual annotation. We also obtained very positive results in the accuracy rate of our suggested keywords.</description><Author>Supheakmungkol Sarin, Toshinori Nagahashi, Tadashi Miyosawa, and Wataru Kameyama</Author><copyright>Copyright &amp;#xa9; 2008 Supheakmungkol Sarin et al. All rights reserved.</copyright></item><item><title>An Effective Multimedia Item Shell Design for Individualized Education: The Crome Project</title><link>http://www.hindawi.com/journals/am/2008/825671/</link><description>There are several advantages to creating multimedia item types and applying computer-based adaptive testing in education. First is the capability to motivate learning by making the learners feel more engaged and in an interactive environment. Second is a better concept representation, which is not possible in conventional multiple-choice tests. Third is the advantage of individualized curriculum design, rather than a curriculum designed for an average student. Fourth is a good choice of the next question, associated with the appropriate difficulty level based on a student&amp;#39;s response to the current question. However, many issues need to be addressed when achieving these goals, including: (a) the large number of item types required to represent the current multiple-choice questions in multimedia formats, (b) the criterion used to determine the difficulty level of a multimedia question item, and (c) the methodology applied to the question selection process for individual students. In this paper, we propose a multimedia item shell design that not only reduces the number of item types required, but also computes difficulty level of an item automatically. The concept of question seed is introduced to make content creation more cost-effective. The proposed item shell framework facilitates efficient communication between user responses at the client, and the scoring agents integrated with a student ability assessor at the server. We also describe approaches for automatically estimating difficulty level of questions, and discuss preliminary evaluation of multimedia item types by students.</description><Author>Irene Cheng and Anup Basu</Author><copyright>Copyright &amp;#x00A9; 2008 Irene Cheng and Anup Basu. All rights reserved.</copyright></item><item><title>Network Provisioning Using Multimedia Aggregates</title><link>http://www.hindawi.com/journals/am/2007/030893/abs/</link><description>Multimedia traffic makes network provisioning a key issue. Optimal provisioning of network resources is crucial for reducing the service cost of multimedia transmission. Multimedia traffic requires not only provisioning bandwidth and buffer resources in the network but also guaranteeing a given maximum end-to-end delay. In this paper we present methods and tools for the optimal dimensioning of networks based on multimedia aggregates. The proposed method minimises the network resources reservations of traffic aggregates providing a bounded delay. The paper also introduces several methods to generate multimedia traffic aggregation using real video traces. The method is evaluated using a network topology based on the European G&amp;#201;ANT network. The results of these simulations allow us to discover the relationship between a required delay and the necessary bandwidth reservation (or the achievable utilisation limit). An interesting conclusion of these scenarios is that, following several recommendations, the network utilisation can reach values of around 80&amp;#x0025; or higher.</description><Author>Enrique Hern&amp;#225;ndez-Orallo and Joan Vila-Carb&amp;#243;</Author><copyright>Copyright &amp;#x00A9; 2007 Enrique  Hern&amp;#225;ndez-Orallo and Joan  Vila-Carb&amp;#243;. All rights reserved.</copyright></item><item><title>Web Content Search and Adaptation for IDTV: One Step Forward in the Mediamorphosis  Process toward Personal-TV</title><link>http://www.hindawi.com/journals/am/2007/016296/abs/</link><description>We are on the threshold of a mediamorphosis that will revolutionize the way we interact with our TV sets. The combination between interactive digital TV (IDTV) and the Web fosters the development of new interactive multimedia services enjoyable even through a TV screen and a remote control. Yet, several design constraints complicate the deployment of this new pattern of services. Prominent unresolved issues involve macro-problems such as collecting information on the Web based on users&amp;#39; preferences and appropriately presenting retrieved Web contents on the TV screen. To this aim, we propose a system able to dynamically convey contents from the Web to IDTV systems. Our system presents solutions both for personalized Web content search and automatic TV-format adaptation of retrieved documents. As we demonstrate through two case study applications, our system merges the best of IDTV and Web domains spinning the TV mediamorphosis toward the creation of the personal-TV concept.</description><Author>Stefano Ferretti, Marco Roccetti, and Claudio E. Palazzi</Author><copyright>Copyright &amp;#x00A9; 2007 Stefano  Ferretti et al. All rights reserved.</copyright></item><item><title>MOCA: A Low-Power, Low-Cost Motion Capture System Based on Integrated Accelerometers</title><link>http://www.hindawi.com/journals/am/2007/082638/abs/</link><description>Human-computer interaction (HCI) and virtual reality applications pose the challenge of enabling real-time interfaces for natural interaction. Gesture recognition based on body-mounted accelerometers has been proposed as a viable solution to translate patterns of movements that are associated with user commands, thus substituting point-and-click methods or other cumbersome input devices. On the other hand, cost and power constraints make the implementation of a natural and efficient interface suitable for consumer applications a critical task. Even though several gesture recognition solutions exist, their use in HCI context has been poorly characterized. For this reason, in this paper, we consider a low-cost/low-power wearable motion tracking system based on integrated accelerometers called motion capture with accelerometers (MOCA) that we evaluated for navigation in virtual spaces. Recognition is based on a geometric algorithm that enables efficient and robust detection of rotational movements. Our objective is to demonstrate that such a low-cost and a low-power implementation is suitable for HCI applications. To this purpose, we characterized the system from both a quantitative point of view and a qualitative point of view. First, we performed static and dynamic assessment of movement recognition accuracy. Second, we evaluated the effectiveness of user experience using a 3D game application as a test bed.</description><Author>Elisabetta Farella, Luca Benini, Bruno Ricc&amp;#242;, and Andrea Acquaviva</Author><copyright>Copyright &amp;#x00A9; 2007 Elisabetta  Farella et al. All rights reserved.</copyright></item></channel></rss>
