Multimedia Quality Modeling

Publishing date

01 Jan 2022

Status

Closed

Submission deadline

03 Sep 2021

Lead Editor

Zhendong Mu¹

Guest Editors

Chunguo Li² | Kang Song³ | Michel Kadoch⁴

¹Jiangxi University of Technology, Jiangxi, China

²Southeast University, Nanjing, China

³Qingdao University, Qingdao, China

⁴University of Quebec, Quebec, Canada

This issue is now closed for submissions.

Multimedia Quality Modeling

This issue is now closed for submissions.

The volume of multimedia data we handle on a daily basis is growing exponentially due to the availability of ubiquitous and cheap sensors, sharing platforms, and new social trends. Artificial intelligence techniques have proven useful for interpreting this data. In the last few decades, many quality models have been proposed that mimic the process of humans perceiving multimedia data. Such perceptual quality models can provide benefits for a rich variety of multimedia applications. For example, an effective photo aesthetics prediction module can help photographers crop an aesthetically pleasing sub-region from an original poorly framed photo. In addition, a successful photo management system can rank videos based on human perception of video quality (i.e., frame aesthetics, stability, and coherence), thereby the users can conveniently select their favorite pictures into albums. Lastly, different criteria have been developed to select visual or acoustic features for various multimedia applications, e.g., multimodal event detection, real-time speech recognition, and cross-media retrieval.

Extensive research efforts have been dedicated to designing perceptual quality models, but effective tools to manipulate quality prediction are still in their infancy. As far as we know, the key technical challenges include: the deemphasized role of semantic content that may be more important than low-level features in determining media quality; the difficulty to optimally utilize cross-feature information for media quality analysis; and the instability of the biologically/psychologically-inspired features in reflecting human perception, and the lack of a benchmark platform to evaluate the performance of these features.

This Special Issue will focus on the most recent technical progress on computational models for image, video, and audio quality prediction, such as photo/video aesthetic quality ranking and photo cropping/retargeting. We also aim to discover new types of visual/acoustic cues in computational quality models. The primary objective of this Special Issue is to promote the latest research progress in this interesting area. We solicit original research and review articles that address the challenges facing computational models for visual/acoustic quality prediction. This Special Issue targets researchers and practitioners from both industry and academia.

Potential topics include but are not limited to the following: