Abstract

This paper explores the objective of the present video quality analysis (VQA) and measures the full reference metrics keeping in view the quality degradation. During the research work, we conduct experiments on different social clouds (SCs) and low-quality videos. Selected videos are uploaded to SC to assess differences in video service and quality. WeChat shows that the average of all videos (Avg = 100), peak signal-to-noise ratio (PSNR), has no impact on other indicators. Therefore, we believe that WeChat provides the best video quality and multimedia services to their users to meet Quality of Service (QoS)/Quality of Experience (QoE).

1. Introduction

At present, the number of social clouds (SCs) has exceeded the number of multimedia services, and the number of users joining the SC has also increased dramatically. This is because video access from the cloud such as video on demand (VOD), online video gaming, streaming multimedia, Internet protocol television (IPTV), video conferencing, and social networking has been widely used all over the world. These multimedia applications and services are now commonly implemented in various fields, including scientific and educational sectors (e.g., multimedia presentations) [1, 2].

Multimedia service providers (MSPs) devise various approaches and techniques to provide a better QoE that is diversely improved by the user experiences. The users can easily access the QoS from the commercial or noncommercial cloud servers. Therefore, the quality of multimedia services is more critical to the designs and deployed by any networks and services [3].

Users have always high expectations and demands for qualities of video. During the video transmission and online streaming, a single video may experience jitter or additional noise by uploading/downloading of videos on the cloud because the SC compresses the original video and reduces the storage size that automatically affects the video quality [4, 5].

The unnecessary noise increases the problem for MSP and reduces the provision of high definition (HD), high-quality video services agreeing to the user demands. However, QoE and QoS are two modalities in multimedia societies and are highly interconnected to ensure the reliability of quality assessment (QA) that promises to provide QoS and improve the user QoE. Generally, QoE refers to the user factors, i.e., enjoyment, feelings, and satisfaction, while QoS refers to the performance of multimedia applications and networks, e.g., performance, responsiveness, and availability. Quality of Experience is further subcategorized into subjective and objective QoE. Subjective QoE is carried out by surveys (e.g., scale rate, interviews, and questionnaires). Objective QoE, on the other hand, carries out human physiological tests (e.g., MRI and EEG) and measures QoS data or technical parameters (e.g., cost, resolution, frame, and sampling rates) [46].

Concerning the issues and differences in videos compatibility, MSP and network operators believe that the video quality is a more important task in the multimedia processing systems and introduce the video QA methods. QA measures video signals, compression level, and video quality degradation. Moreover, video QA methods can be measured both objectively using mathematical models and subjectively involving user perceptions.

In this paper, our focus is on the object video quality (OVQ) model reflecting the fact that quality models can be measured objectively but do not work together because OVQ is free from human interpretations, and it is considering a part in subjective quality assessment techniques. The OVQ is a dependent variable of mathematical models in which several independent variables (e.g., video coding parameters) are well worked against the subjective assessment using statistical models and regression techniques. The OVQ models can be defined through the video information such as the parameters available about the original video signal and received video or processed video signals. The three technical models, i.e., full reference (FR), reduced reference (RR), and no-reference (NR) models, are mostly used for video quality assessments, PSNR and video quality measurement (VQM) [7].

Generally, the reference model can measure the quality differences between the original video and the received video/processed video. The NR model makes efforts to work for distorted videos in the absence of original reference video, but the NR model approach is a less accurate method as compared to the FR model approach. The FR model considers the degradation of video quality and evaluates it more accurately as compared to human-perceived assessments. The reference method is more complicated and most reliable for video quality assessment that requires the availability of the original video sequence to compute a video quality before transmitting the video coding. Therefore, in most of the situations, the OVQ and FR model requires more computational efforts [79].

However, Internet service providers (ISPs) face several multimedia services issues while transmitting HD videos. Guaranteed QoS and improved quality (video) of user experience is a challenge for service providers because HD videos contain high data size and occupy more storage at cloud site, and during uploading, it requires high network bandwidth to transfer these files on the cloud server. This problem leads to a critical issue of service delivery, and it is essential to the analysis of video quality. SC is the stand-alone application where a variety of multimedia services is offered to the users such as video sharing and video posting. These services are generally used by end-users to share their recorded videos on social media [2]. End users share and upload their HD video on different SC, but the original video compressed by SC does not form the original video because the social cloud stores the video by using predefined compression techniques of different video codecs [35].

Our main contributions are to present the real video streaming analysis and compare results of video quality with reference model metrics. The major objectives include the following:(i)Evaluation of three different video resolutions (144p, 240p, and 360p) and their comparison with SC and original videos(ii)Impact of PSNR and VQM in SC videos(iii)Evaluation of mean square error (MSE), MSAD, and Delta and data distortions in SC(iv)Impact of SC videos and different levels of video degradations

The rest of this paper is organized as follows. Section 2 is related to work on existing objective video quality. Section 3 proposes techniques and preliminary experiments. Section 4 provides detailed discussion and results. The paper is finally concluded in Section 5.

2. Literature Review

Researchers have introduced their work in the cloud services [10] and video quality metric [11] and have proposed various video quality models [7, 8, 12] in different environments. The authors in [10] introduced cloud computing technology and proposed an efficient ESVP, Cloud Architecture, to demonstrate the performance of the mobile video stream. A cloud-assisted adaptive and efficient social video sharing method is also discussed. The proposed solution can store and share videos in the cloud because the authors only focused on improving the response time of live videos [13]. The cloud-based video live streaming cloud (VLSC) services proposed a scheduling method based on live video streaming to facilitate the video transcoding process and provide cost-efficient manners and satisfied the viewer QoS demands. In order to correlate with objective video quality, Appina and Channappayya [14] presented bit-stream based features to measure the full reference video quality metric (PSNR, PVQM, VQM, and VIF).

The video 560 bit‐stream features are tested by high-efficiency video coding (HEVC) and used the monotonic prediction of the number of objective quality metrics. The authors in [15] presented the work on full reference 3D video quality assessment and proposed FR stereoscopic VQA metric based on depth and motion subband coefficients of video frames. The coherence of the covariance matrix was performed, and the coherence scores were computed by applying off-the-shelf 2D FR image quality assessment metric on video frames.

Soundararajan and Bovik [16] discussed the objective quality assessment techniques for video conferencing coding and proposed the Gaussian mixture model-based peak signal-to-noise ratio (GMM-PSNR). The work evaluated the quality of video conferencing coding. Multimedia file format and codec are widely used in video communication systems based on the related work of wireless network and proposed algorithms for a multimedia file format (MPEG4, FLV, and 3GP). The authors used the ratio of signal power to the noise power (SNR) and peak signal-to-noise ratio (PSNR) metrics to calculate the performance and differences of each file format. Results show that the different file formats vary in different qualities [16].

A comparison of objective video quality metrics was identified in mean opinion score (MOS) or DMOS by analyzing the correlation between subjective QoE, which were obtained from user test and perceived score by objective metrics. Results showed the insufficiency of subjective data and recommended to use different subjective procedures that revealed more comprehensiveness [17]. The quality assessment algorithm and a reduced reference VQ assessment is presented. A hybrid approach was used in statistical models to utilize the spatial and temporal differences by using a Gaussian scale mixture model. The proposed algorithm showed better performance PSNR for predicting the video quality. Other sides developed QoE predictive model for Internet videos that measured the user engagement parameters and relationships and showed delivery infrastructure that achieves 20% improvement in user engagement as compared to the other approaches [18].

However, online video streaming traffic and accuracy are estimated at 57% to 69%, while playing video and user perceived video streaming services and providing user experience. The work measured QoE for online video streaming services. QoE metrics are used to optimize quality challenges but not in case of uploaded/downloaded videos from clouds [19].Video quality is estimated based on artificial neural network (ANN), and it adopted a neuro-fuzzy interface that predicted the MOS score. However, video quality is obtained from PSNR to MOS but did not perform subjective assessments [20]. The comparison of related work table is illustrated in Table 1.

The quality of multimedia content under different video file formats and video resolutions is compared and analyzed. Subjective evaluation method was used to analyze the impact of video format on quality. The results show that MP4 video obtained the highest influences, but the author does not provide an objective video quality evaluation [26]. The subjective assessment method was used to analyze the impact of video formats on QoE. Results showed MOS that MP4 video obtained the highest rating for 720p and 2160p, FLV for 240p, and WebM for 360p, but the authors did not provide objective video quality assessments [26].

Literature review reveals that few studies [26, 28] are conducted on subjective QoE video quality on social clouds, but the measurement of objective video quality with reference metrics (PSNR, VQM, MSE, MSAD, and Delta) in social clouds is yet not researched. To the best of authors’ knowledge, this is the first study that deals with the problem of video quality and presents the outcomes of video quality using online video streaming and social media videos.

3. Proposed Technique

To examine the performance of video quality in SC, we preferred to use the three most popular social media clouds Facebook, VK, and WeChat in our experiments. Original videos were selected and taken from the YouTube site [2931] and downloaded using source site [32]. The original video contains three different video resolutions (144p, 240p, and 360p) and has the same (.mp4) file format uploaded on social clouds.

The technical parameters of the original video were recorded before uploading the original video on SCs because the social clouds automatically compress the original video parameters according to their predefined compression algorithms and reduce the storage size. In addition, the original and resulting videos were compared and examined to know the full reference metric by using VQMT simulation tool. The proposed design is given in Figure 1 and the relevant technical parameters are given in Table 2.

The peak signal-to-noise ratio (PSNR) metric is widely used in the field of signal processing because it is a maximum ratio of power between signals to noise. PSNR average (Avg) value is commonly used in the practice of video processing, where video quality and loss compression codecs are considered for testing of video quality or improving the video codec. The higher and lower value of PSNR indicates the reconstructed video quality. If the value of PSNR in video compression is between 30 dB to 50 dB and provides 8 bits of depth, then 50 dB PSNR is better. This metric is also useful to measure video codec standards (MPEG-4, H.263, and H.264). Therefore, this metric is only valid when it applies to compare the results between the same codec, the same codec type, and the same corresponding video contents. Generally, PSNR is equivalent to MSE, but it is easily defined as compared to MSE because of the scale of the technique. It can be calculated by the equation given in [33].

Calculate the average PSNR for sequence video and evaluate MSE average for all frames using the ordinary equation of PSNR that can be presented as

VQM is generally used for objective video quality measurements. This metric evaluates the video streams between original video to processed video and includes features with spatial-temporal alignments, valid region-estimations, and gain-level offset calculations. VQM calculation can extract perception-based features to compute the video quality parameters and generate more accurate predictions of human-perceived video quality [34].

The computed MSE metric is the simplest metric to compute differences between two videos and compare video signals. It can be calculated as

Delta and mean absolute difference (MSAD) metrics are also the simplest way to calculate the mean differences of the color (R, G, and B) components in the corresponding values of the video, and this metric is mostly used for filters and testing codecs. Therefore, the VQM supports all features of MSE, Delta, and MSAD. MSE and Delta can be calculated bywhere corresponds to red color and corresponds to the green color in Delta metrics. MSAD metric is calculated by

4. Results and Discussion

This section presents the detailed results and analysis of video quality; furthermore, it analyzes three SCs (Facebook, VK, and WeChat) and low-quality videos (144p, 240p, and 360p). Selected videos are uploaded to SC to assess differences in video services and quality.

4.1. Original Video Parameters

To collect the objective video quality analysis and results, in the first step, we measured the original video parameters by using VLC (Video LAN Client) media player (version 2.2.1) [34]. The VLC extracted the detailed information of each video file in technical parameters (e.g., frame width/height, data rate, total bitrate, frame rate, audio bitrate, storage size, playing duration, and video file format). The screenshots of original videos and technical parameters are given in Figure 2 and Table 3.

4.2. Social Cloud Video Quality with Reference Metrics

In this section, the interpretation with reference metrics and results are explained. Hence, the five reference metrics (PSNR, MSE, MSAD, Delta, and VQM) are considered. The Moscow State University (MSU) video quality measurement tool (VQMT) software (version 11.0) is preferred for analysis. This software is specially used for testing video codecs and to perform a comparative analysis of objective video quality using reference and no-reference metrics. For interpretation of each parameter, it is necessary to know the values of the metrics, i.e., either social videos provide better quality or not to the end-users. Each frame can measure PSNR in video sequence, which corresponds to the higher/lower values of an average (Avg). The higher value (Avg = 100) is better for the same frames in PSNR. Furthermore, lower values (Avg = 0) for VQM, MSE, and MSAD are better for equal frames, and lower Delta value (Avg = 0) indicates equal frames, i.e., both are signed (+, −) values in mean deviation, which means lower absolute values are better. The parameters of the reference metric are listed in Table 4.

We uploaded 144p video on the Facebook social cloud, and analysis of Facebook Video 144p is presented in Figure 3. Results reveal that Facebook provides low quality and shows that Avg (38.155602) PSNR is less than 100 PSNR quality, VQM metrics (Avg = 0.770144 < 0), MSE metric value (Avg = 9.943065), MSAD (Avg = 1.666160), and Delta metrics (Avg = −0.138500). Facebook Video 240p is not processed by VQMT tool because video resolution scale is decreased to (400 × 224) and is comparatively different from the default original video.

Facebook video 360p results are shown in Figure 4; PSNR value Avg = 40.156540, with original 5.0 kbps bitrate as compared to 4.1 kbps Facebook bitrate is observed. Furthermore, VQM metric was noticed, Avg = 0.648435, lower than zero and MSE value, i.e., Avg = 6.272291. The MSAD value, i.e., Avg = 1.291984, and Delta value were observed to be negative, i.e., Avg = −0.051095.

Figure 5 shows the results of VK video 144p. We observed low resolution video quality that VK decreases the PSNR (Avg = 48.9 < 100), VQM metric (Avg = 0.250), MSE (Avg = 0.847), MSAD (Avg = 0.520), and Delta (Avg = −0.00659) with 1.7 kbps bitrate. Furthermore, VK video 240p is also not processed by VQMT tool because the resolution of video is decreased (320 × 180) and different from the default values.

The next observation of VK Video 360p is shown in Figure 6. The results show that PSNR value is Avg = 46.4 < 100, which is 5.0kbps lower than the original 6.7kbps. Furthermore, it is shown that all metrics are greater than zero such as VQM (Avg = 0.368), MSE (Avg = 1.49), MSAD (Avg = 0.718), and Delta (Avg = −0.00178).

We repeat the same for the next social cloud WeChat for video 144p. WeChat provided the best quality metrics; we observed that PSNR value was Avg = 100, with bitrate 1.7 kbps as compared to original bitrate 1.7 kbps, and the rest of the metric value was (Avg = 0.000). WeChat (144p) results are shown in Figure 7.

The second video of WeChat 240p analyzed the same results as compared to WeChat 144p. The PSNR value was (Avg = 100) with bitrate 2.6 bps as compared to original bitrate of 2.6 bps, and the rest of the metric value was (Avg = 0.000). WeChat 240p results are shown in Figure 8.

Further, we evaluate WeChat 360p video. The results show that PSNR value was (Avg = 100), and VQM, MSE, MSAD, and Delta metric values were (Avg = 0.000) with bitrate 5.0 kbps as compared to original bitrate of 5.0 kbps, and the rest of the metric value was (Avg = 0.000). The extracted WeChat 360p metrics results are given in Figure 9.

The conclusion that we can make from the consolidated comparison results is presented in Table 5. The results show that in video 144 P, Facebook video decreases as PSNR (Avg = 61.84%) and VK (Avg = 56.36%). In addition, Facebook video 240 P with decreased video parameters cannot extract data and mismatch video attributes, and Facebook video 360 P reduces PSNR (Avg =59.84%) and VK (Avg =57.95%). As compared to all WeChat videos, PSNR value was (Avg = 100). It is evident that WeChat provides the best quality of videos to the end-users. Furthermore, the comparison of video quality 144p and 360p of three social cloud results is shown in Figures 10 and 11.

5. Conclusion

We used Intel® core™i7-4790 CPU @3.60GH, RAM 16 GB, and 64 bit OS type. In this paper, we explore an objective video quality assessment technique which is based on reference metrics (PSNR, VQM, MSE, MSAD, and Delta) and is analyzed and compared with the social cloud hosting and video services. While many researchers worked on subjective QoE, objective QoE video quality of SC is never considered by anyone in past research. Our focus is on analyzing video distortion and added noise in videos while uploading on social clouds to assess which video parameters have higher impact on objective QoE. This paper conducted experiments on free social cloud videos. The low-resolution video (144p, 240p, and 360p) was selected from YouTube and was uploaded on Facebook, VK, and WeChat for measuring the social cloud video quality and providing QoS/QoE user satisfaction level. These results provided an assessment of video quality, in which social clouds suffered due to a high compression level. Results show that decrease of full reference metrics has a higher impact on video quality, i.e., the results show that for video 144p, the average PSNR of Facebook decreased by 61.84%, while the average PSNR of VK decreased by 56.36%. Video 360p showed that the average PSNR of Facebook was affected (Avg = 59.84%), and the average PSNR of VK was affected (Avg = 57.95%). The other indicators (VQM, MSE, MSAD, and Delta) were also affected, compressing video at 144p, 240p, and 360p because the average PSNR value was inefficient. WeChat shows that the average peak signal-to-noise ratio of all videos is (Avg = 100) PSNR, which has no impact on other indicators and is the best video quality. We prove in results that Facebook and VK compressed more videos and provided low quality of video, but increased user QoE. Therefore, we can conclude that WeChat provides a high quality of services that ensure user satisfaction. This research will be helpful for service providers to pay concentration and improve multimedia services in the future.

Data Availability

The data used to support the findings of the this study are included within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Authors’ Contributions

Sajida Karim and Hui He conducted research to write the paper and A. R. Junejo and Mariyam Sattar reviewed the paper to set context and contributed as experts of the field.

Acknowledgments

This study was supported by the National Key R&D Program of China under grant no. 2017YFB803304 and the National Science Foundation of China (NSFC) under grant nos. 61872110 and 61672186.