Advanced Pattern Recognition Systems for Multimedia DataView this Special Issue
Analysis on the Application of Multimedia-Assisted Music Teaching Based on AI Technology
In order to improve the effect of modern music teaching, this paper combines AI technology to construct a multimedia-assisted music teaching system, combines music teaching data processing requirements to improve the algorithm, proposes appropriate music data filtering algorithms, and performs appropriate data compression processing. Moreover, the functional structure analysis of the intelligent music teaching system is carried out with the support of the improved algorithm, and the three-tier framework technology that is currently more widely used is used in the music multimedia teaching system. Finally, in order to realize the complex functions of the system, the system adopts a layered approach. From the experimental research results, it can be seen that the multimedia-assisted music teaching system based on AI technology proposed in this paper can effectively improve the effect of modern music teaching.
The main function of constructing a music teaching multimedia system is to realize resource sharing. In the system, a variety of teaching courseware can be provided for students to appreciate and help improve students’ level. Music is a hands-on course that requires in-depth discussions between students and teachers, so as to quickly improve the level of singing or performance and personal performance. The combination of traditional teaching methods and informatization methods can help in learning from each other’s strengths and can improve students’ interest in learning through some video and audio . Specifically, the significance of constructing a multimedia system for music teaching is mainly manifested in the following aspects. The multimedia system is a rich resource library, which can realize the fair distribution of educational resources . Music majors can view the music resources shared in the multimedia system. There is no difference between good and bad conditions, so that every student has equal educational opportunities, which helps to improve students’ music literacy and teaching quality .
It can provide a brand-new communication method and interactive platform between students and teachers. Under the traditional teaching mode, the communication between teachers and students is limited to the classroom or the phone. When students encounter problems in the music process, they often have trouble to be resolved in time. For this purpose, this topic provides an intelligent question-and-answer module, which supports automatic answering of students’ questions, so that students’ learning is no longer restricted by time and can ask questions at any time. If you encounter a question that the system cannot answer, you can leave it behind and solve it together when the teacher is online or in class, which can improve efficiency. Each student’s learning style is different and is not the same, and the traditional teaching mode cannot meet the learning needs of different students, and under the information mode, students can develop a learning plan that meets their personal characteristics according to their needs, so that they can fully explore intrinsic potential of the individual, improving students’ ability to learn independently. The music teaching system provides a wealth of music teaching resources, including multimedia courseware and teaching audiovisual. Each student can choose teaching resources that match the progress according to their needs to learn, which can give full play to the students’ learning initiative, which is the teacher’s teaching and student learning. The auxiliary tools also help to improve the teaching quality of music majors in a university.
Artificial intelligence is becoming an important tool for automatic and personalized learning. The application of AI makes music education and creation easier than ever. For example, using the artificial intelligence music creation Amper Music, students can create their own music albums, and SmartMusic can also synthesize and create music. These new forms of music creation greatly stimulate students’ innovation and creativity in music. The application of artificial intelligence technology can complete the extraction, processing, and storage of MIDI cloud data. The teaching quality evaluation is carried out through the music classroom teaching wisdom evaluation platform. Evaluate the knowledge points of music appreciation and investigation: singing or performance form; timbre discrimination (instrumental music, vocal); musical structure, mode discrimination, and music genre; and the characteristics of musical elements and musical styles. Turning subjective evaluation into objective evaluation to a large extent changes the existing music education system.
This article applies AI technology to music teaching and combines multimedia technology to construct a multimedia-assisted music teaching system based on AI technology to improve the scientificity and effectiveness of music teaching.
2. Related Work
The application of metadata to describe the organization and network of educational resources to improve the existing educational resource utilization structure is the GEM project. The core principle of the development of the specific element data element of GEM is the core of the project . GEM reuses the 15 elements of the Dublin Core Metadata Element Set (DC) as the core element and is expanding functionally based educational resources. GEM Cat is a coding software for users’ educational resource metadata. With GEM Cat users, corresponding metadata records of educational resources are generated, and metadata records are used to describe and locate network resources . The design principles of the American Educational Resource System emphasize that GEM does not collect actual resources, only storage resources for metadata information. The collection of all the records assembled on the GEM constitutes the basic resource of the entire education. The acquisition module collects metadata records in order to create a simple HTML page in a record gateway abroad, and teaching resource management system has been used for many years . GEM Cat is a coding software for users’ educational resource metadata. With GEM Cat users, corresponding metadata records of educational resources are generated, and metadata records are used to describe and locate network resources. For example, in the card catalog library, metadata are formed in two forms, and other metadata records in which the resource is embedded can be saved as a separate file . The acquisition plan is to collect all the metadata resources about a specific website to form a local site index and a catalog list of portal websites editors to add resources . The design principles of the American Educational Resource System emphasize that GEM does not collect actual resources, only storage resources for metadata information. The collection of all the records assembled on the GEM constitutes the basic resource of the entire education. The acquisition module collects metadata records in order to create a simple HTML page which is the relevant educational resource in a record gateway system . BorwseBu11der is an online educational resource search engine of the project, which integrates a wide range of public and nonpublic information suppliers, educational institutions at all levels, and educational management and information resources. Users can search for the educational resources they need, or browse the web by keywords or topics .
GEM Cat, Harvest, and Browse Builder editing and organizing tools of GEM project metadata specification are developed to build a huge educational resource description metadata record database on this basis. The educational resource management system enables teachers and students to effectively and accurately obtain information about these organizations . The goal of the project is to achieve cooperation between different education departments and many training institutions in different countries. From the development of EdNA, the system develops local information resources and metadata standards. Government and nongovernmental organizations, schools, vocational education and training, adult education, and higher education (EdNA) provide a rich resource for services . EdNA provides two main services: education and training and information catalog and online education resource library. Metadata development and editing tools, EndA Toolset development, created the Australian education portal EdNA online . It is developed and managed by a nonprofit organization-education Co., Ltd. It is subordinate to the Australian Ministry of Education and Training and supervises its use to ensure the quality of resources. Metadata standards follow the electronic resource specifications to ensure that education departments and training institutions discover and manage network resources to achieve the development of interconnection. EdNA online creates a complete index of course resources. It contains two structures: all education and training catalog levels .
3. Multimedia Music Teaching Music Processing Based on AI Technology
As per the regularity of wavelet basis, the vanishing moment order of wavelet function and the support of have a greater influence on music compression; we will introduce the classification of wavelet basis and the relationship between wavelet basis and data compression.
There are many types of wavelet bases, including orthogonal, biorthogonal, and some that are approximately completely reconstructed. Below, we focus on several commonly used wavelet bases.
is a multiresolution analysis of , and is the scaling function of the multiresolution analysis, which satisfies the two-scale equation :
The Fourier transform of h (n) is
Under the regular conditions, the filter or h (n) satisfies the following two equations:or
From the point of view of the filter, we need to find a conjugate filter or h (n) :
The scaling function is constructed by iterative filtering, and then the orthogonal wavelet basis is constructed from the scaling function :or
Generally, we hope that the filter is real; that is, the coefficients of are real, so the polynomial Q should also have real coefficients. According to the structure of , the filter H has N zeros at z = −1 on the unit circle, and the square of the filter’s modulus is . Since Q is a polynomial with real coefficients,
Therefore, can be expressed as a polynomial of cos (kw). If and , then the conditional expression of the conjugate filter can be rewritten as 
Here, P (y) ≥ 0, y∈[0, 1]. Solving the conjoint filter h (n) is transformed into finding a polynomial P (y) that satisfies the above formula.
In fact, the practical application of conjoint filters often requires a finite length, the so-called compact support, which requires the filter h (n) to be nonzero only in a limited area: . The scale function and wavelet function p (x) constructed by the conjoint filter of compact support also have compact support characteristics. On the other hand, since the filter A (n) has only a finite number of nonzero points, the polynomials Q and P also contain only finite terms.
We can summarize the process of constructing a conjoint filter into the following five steps:(1)The algorithm sets a N − 1 order polynomial PN (y) that satisfies the following conditions: (a) P (y) 20; (b) y″ P (1 − y) + (1 − y)″ P (y) = 1; (c) Sup Py < 2 1 :(2)The algorithm sets and finds all the zeros of the polynomial :(3)On the unit circle (), set the algorithm(4)The algorithm writes a polynomial :(5)The algorithm compares the coefficients on both sides of the following equation to obtain the various parameters of the filter A (n):
For signal filtering, we all want the filter to have a linear phase, which requires the filter to have a symmetrical structure. Except for Harr bases, all available orthogonal conjugate filters with compact support characteristics and their wavelet bases do not have a symmetric structure.
In order to find a tightly supported filter bank with a symmetrical structure to realize the wavelet analysis of the signal, one approach is to appropriately relax the requirements for the orthogonality of the filter and use the biorthogonal filter. From the perspective of multiresolution analysis, we will use a pair of dual but unsystematic scale functions and wavelet functions during analysis and synthesis. represents analysis scale and comprehensive scale function and analysis wavelet and comprehensive wavelet function respectively. They respectively satisfy the two-scale expansion and contraction equations:
The biorthogonal relationship between them is represented by the following formula :
The regularity requires to be a continuous function that the corresponding filter is iteratively filtered and converged. If the filter is a finite-length filter with a symmetric structure, the corresponding scale function and wavelet function are also compactly supported and have a symmetric structure. Therefore, the construction of the biorthogonal wavelet base is the same as the construction of the orthogonal wavelet base, and it also comes down to the construction of the filter.
Here, the constructed four filters are required to have a symmetrical structure and enable the wavelet analysis synthesis system to meet the conditions of complete reconstruction. After reasoning, the relationship between them can be drawn:
For the complete reconstruction analysis of the real coefficients, the comprehensive filter bank has the following orthogonality relationship:
The above formula shows that the low-pass filter of the analysis filter bank is orthogonal to the high-pass filter of the synthesis filter bank, and the high-pass filter of the analysis filter bank is orthogonal to the low-pass filter of the synthesis filter bank (biorthogonal).
For linear phase filters, Vetterli proved that they must be one of the following three structures under the condition of complete reconstruction:(1)The two filters H0 (Z) and Hi (Z) both have a symmetrical structure and have odd lengths. The difference between the lengths of the two filters is an odd multiple of 2.(2)Both filters are of even length, one filter is symmetric and the other is antisymmetric, and the difference between the lengths of the two filters is an even multiple of 2.(3)One filter has an even length, and the other filter has an odd length. The zero points of the filter all fall on the unit circle. The two filters are either symmetrical, or one is symmetrical and the other is antisymmetric. In fact, since the zero point can only be on the unit circle, this filter structure has no use value.
The so-called regularity is a description of the smoothness of a function and also a measure of the energy concentration of a function in the frequency domain. The regularity of function is generally described by the Holder definition:
That is, 0 < α < 1, if for any t, β∈R has 
Then, the Holder regularity order of (t) is called α, where C is a constant independent of t and β. If the N-order derivative of p (t) satisfies the above formula and r = N + a, then the regularity order of Holder of w (t) is called r.
Obviously, if r > N (N is a positive integer), then o (t) has an N-order continuous derivative, and the greater the regularity order r. It means that the smoother p (t), the more concentrated the energy in the frequency domain from the relationship between the frequency domain and the time domain of the signal.
From the relationship between the wavelet function and the filter Ho), Go), the regularity of pt) can be controlled by H (o). If H (o) e c°, there must be ( = c″, a) ec, where q is an integer, and C° represents the q-th consecutive differentiable function class on the set of real numbers R. If the regularity order r of p (t) is larger, the regularity of the time domain form of H (a) and G (a) is also stronger. Generally speaking, the effect of reconstruction after data compression is also better.
In the process of wavelet transform, when a smooth part of the input signal passes through filters H (o), G () with poor regularity, its output will quickly show discontinuity as the number of transformation stages increases. This nonsignal discontinuity will also be reflected in the transform coefficients of the discrete wavelet transform (DWT). When compressing music data, these coefficients must be quantized before inverse transformation, resulting in errors in these coefficients, and the higher the compression ratio, the greater the error. The errors of these coefficients appear as distortions of the reconstructed music after the reconstruction filter. The magnitude of the distortion is proportional to the error of the transform coefficient.
For the case where is an FIR filter, the length of h is L. For the case of length L ≤ 10, the effect of music compression and reconstruction will improve faster with the increase of r. When L > 10, the effect of increasing r and L on the peak signal-to-noise ratio (PSNR) after music compression and reconstruction is not obvious. At the same time, the greater the length L of the filter h, the greater the calculation complexity and the longer the calculation time, which is not conducive to real-time processing. Therefore, looking for a filter with the largest regularity order r designed from a smaller L will be beneficial to the processing of music data and improvement of the quality of music processing.
At the same time, it should be noted that the higher the regularity order is, the better the music compression effect is only for most cases. In some cases, there is also a wavelet basis with a small regularity order that is better than a wavelet with a high regularity order on music compression. This is mainly because there are many factors that affect the effect of music compression. In addition to regularity, wavelet basis functions and specially coded music may affect the quality of reconstructed music.
Definition. The function pt is said to have a vanishing moment of order m (me N), ifWe can easily introduce it according to the above formula:That is, is the m-th order zero of . If the wavelet function has m-th order vanishing moments, according to the two-scale relationship and , we can getHere,It can be deduced that the high-pass decomposition filter and the low-pass reconstruction filter have the same order zeros at and respectively, but their order is determined by the vanishing moment of the dual wavelet of .
Obviously, the vanishing moment of the wavelet function characterizes the flatness of the high-pass filter at and the low-pass filter at . The larger the vanishing moment is, the closer the performance of the filter is to an ideal filter. Since the energy of music is concentrated in low frequencies, the larger the vanishing moment of the filter, the less low-frequency energy leaks into the high-frequency subimages during wavelet decomposition, and the smaller the high-frequency coefficients located in the smooth area of the music. Therefore, it is more conducive to music compression. Similarly, because the high-frequency energy leaks to the low-frequency relatively less, there may be more high-amplitude coefficients in the edge or texture area of the music in the high-frequency submap.
In order to minimize the number of high-amplitude coefficients, the frequency band of the signal of infinite length is strictly limited. According to the sampling theorem, strict sampling of the subband signal can satisfy these two conditions. However, for a signal of finite length, after the signal is linearly filtered by wavelet transform, the data size of the subband signal will be larger than that of the original signal, which will cause the boundary extension.
The support length and vanishing moment of the function are a priori independent. But the vanishing moment order and the length of the support are related. Taking orthogonal wavelets as an example, ifhas p-th order vanishing moments, then its support length is at least 2p − 1. For a given vanishing moment order, Daubechies wavelet has the smallest support. In this sense, Daubechies wavelet is optimal. Therefore, when we choose specific wavelets, we are faced with the trade-off between vanishing moment order and support length. If the music is very smooth between the edge points, then we must choose a wavelet with high-order vanishing moments to generate a large number of small-value wavelet coefficients. If the density of edge points increases, the support length should be reduced at the cost of reducing the vanishing moment order. This is because the wavelet whose support includes edge points produces large-scale coefficients. However, for mobile multimedia music, the support cannot be too long, because the support is too long, the calculation amount is too large, and there is no use value, and the increase in the calculation complexity in exchange for not significantly increasing the PSNR value is not worth the loss.
In the wavelet analysis of the signal, the analysis and reconstruction system must meet the following two conditions. 1. Complete reconstruction: the original signal can be completely reconstructed from its subband signal. 2. The sum of the data size of the subband signal is not much as the data size of the original signal; otherwise, the compression efficiency of the encoding will be reduced.
If the complete reconstruction condition is satisfied, the data size of the subband signal during strict sampling will increase, and condition two cannot be satisfied. However, if condition two is met, information will be lost, the reconstructed signal will change, and complete reconstruction cannot be achieved. In order to meet the above two conditions at the same time, the signal must be extended to form an infinite signal to reduce the loss of information.
The length of the original signal is N, and the length of the filter is L. The original signal is extended by [L − 2]/2 points to form the extended signal . There are four commonly used extension methods as follows:(1)Zero continuation is(2)Period continuation is(3)The zero continuation of the boundary is(4)The continuation of the symmetry point isThe boundary distortion is not only related to the correlation of the signal itself near the boundary and subsampling of the transformation result but also related to the nonlinear phase characteristics of the filter. When the filter does not have a linear phase, the weaker the correlation of the signal near the boundary, the greater the distortion of the reconstructed signal at the boundary. In actual music data compression, the more commonly used wavelet bases are the tightly supported orthogonal wavelet bases and the biorthogonal wavelet bases. Except for the Harr wavelet (Harr wavelet has poor regularity), the compactly supported orthogonal wavelet base has no symmetry. The symmetry of the tightly supported orthogonal wavelet function is equivalent to the linear phase characteristic of its filter. Whether the filter has a linear phase is great for eliminating the distortion near the boundary.
4. Multimedia-Assisted Music Teaching System Based on AI Technology
This article combines AI technology and multimedia technology to construct an auxiliary music teaching system. The intelligent music teaching system constructed in this paper is shown in Figure 1.
The music multimedia teaching system must be designed and developed based on the B/S model, so that users can complete the above work online. According to the above requirements, a local area network system can be established first, and then the local area network can be connected through the Internet network to form a music multimedia teaching system. According to the above analysis, the physical structure of the system can be drawn, as shown in Figure 2.
In order to improve the development efficiency and system performance, this paper uses the three-tier framework technology that is currently used in the music multimedia teaching system, which divides the entire music multimedia teaching system into user interface layer, business logic layer, and data storage layer. The combination of the above technologies lays the foundation for the efficient implementation and operation of the system, and its working principle is shown in Figure 3.
The music multimedia teaching system is composed of five parts: music resource management, system management, homework management, interactive management, and basic data setting, as shown in Figure 4.
The timing diagram of the text resource classification adding function is shown in Figure 5.
The timing diagram of the text file playback function in the music resource viewing module is shown in Figure 6.
Based on the analysis of system functions and three-tier structure, the system establishes multiple database storage tables. The relationship between the tables is shown in Figure 7.
The system structure can be divided into three steps: data access, business logic, and system UI design according to the application of logic functions. In order to realize the complex functions of the system, the system adopts a layered approach. The so-called layering means that the system is divided into different modules, and the modules directly adopt loose coupling methods to realize logic calls. Through mutual calls between various modules, a system application architecture is formed, and a complex functional application model is constructed. The system consists of six projects, including one website project and five class library projects, as shown in Figure 8.
After constructing the above system, perform performance test on the system. The system proposed in this paper is applied to music-assisted teaching through AI technology and multimedia technology. Therefore, in the experiment, the music processing effect of the system proposed in this paper is mainly evaluated, the teaching effect is evaluated, and the results shown in Tables 1 and 2 and Figures 9 and 10 are obtained.
From the above experimental research, the multimedia-assisted music teaching system based on AI technology proposed in this paper can effectively improve the effect of modern music teaching.
For AI to be combined with real music teaching guidance and become a usable tool, more understanding of this art form is still needed. To integrate artificial intelligence technology into music classrooms, we first start with the construction of smart classrooms, making musical instruments and auxiliary teaching programs. Moreover, it is connected with digital functions and platforms to analyze the data on the activities and performance of students in music classes provided on the platform and provide feedback information. At the same time, it is necessary to explore the teaching design and implementation of the independent form of AI music courses, learning methods, and learning discussions. It includes teaching resources, course objectives, course content, course activities, course organization and implementation, course design and skills, and basic operation plans for teaching practice. This article applies AI technology to music teaching and combines multimedia technology to construct a multimedia-assisted music teaching system based on AI technology to improve the scientificity and effectiveness of music teaching.
The labeled datasets used to support the findings of this study are available from the corresponding author upon request.
Conflicts of Interest
The author declares no conflicts of interest.
This study was sponsored by Shandong University of Arts.
D. B. C. Kiliç, “Pre-service music teachers’ metaphorical perceptions of the concept of a music teaching program[J],” Journal of Education and Learning, vol. 6, no. 3, pp. 273–286, 2017.View at: Google Scholar
J. Zhang and D. Tao, “Empowering things with intelligence: a survey of the progress, challenges, and opportunities in artificial intelligence of things,” IEEE Internet of Things Journal, vol. 8, no. 10, pp. 7789–7817, 2020.View at: Google Scholar
E. Gun, “The opinions of the preservice music teachers regarding the teaching of orchestra and chamber music courses during distance education process,” Cypriot Journal of Educational Sciences, vol. 16, no. 3, pp. 1088–1096, 2021.View at: Google Scholar
X. Shengmin, “Analysis on the innovative strategy of national music teaching in colleges from the perspective of visual communication,” Studies in Sociology of Science, vol. 7, no. 6, pp. 52–55, 2017.View at: Google Scholar
S. K. Kim, N. Sahu, and M. Preda, “Beginning of a new standard: Internet of media things,” KSII Transactions on Internet and Information Systems, vol. 11, no. 11, pp. 5182–5199, 2017.View at: Google Scholar
F. L. Reyes, “A community music approach to popular music teaching in formal music education,” The Canadian Music Educator, vol. 59, no. 1, pp. 23–29, 2017.View at: Google Scholar
V. K. Jones, “Voice-activated change: marketing in the age of artificial intelligence and virtual assistants,” Journal of Brand Strategy, vol. 7, no. 3, pp. 233–245, 2018.View at: Google Scholar
C. Johnson, “Teaching music online: changing pedagogical approach when moving to the online environment,” London Review of Education, vol. 15, no. 3, pp. 439–456, 2017.View at: Google Scholar
S. Y. Hong and Y. H. Hwang, “Design and implementation for iort based remote control robot using block-based programming,” Issues in Information Systems, vol. 21, no. 4, pp. 317–330, 2020.View at: Google Scholar