Abstract

One of the most serious limitations to the practical utilization of solid-state gas sensors is the drift of their signal. Even if drift is rooted in the chemical and physical processes occurring in the sensor, improved signal processing is generally considered as a methodology to increase sensors stability. Several studies evidenced the augmented stability of time variable signals elicited by the modulation of either the gas concentration or the operating temperature. Furthermore, when time-variable signals are used, the extraction of features can be accomplished in shorter time with respect to the time necessary to calculate the usual features defined in steady-state conditions. In this paper, we discuss the stability properties of distinct dynamic features using an array of metal oxide semiconductors gas sensors whose working temperature is modulated with optimized multisinusoidal signals. Experiments were aimed at measuring the dispersion of sensors features in repeated sequences of a limited number of experimental conditions. Results evidenced that the features extracted during the temperature modulation reduce the multidimensional data dispersion among repeated measurements. In particular, the Energy Signal Vector provided an almost constant classification rate along the time with respect to the temperature modulation.

1. Introduction

In a seminal paper by Zaromb and Stetter [1] the possibility to approach the quantitative and the qualitative analysis of simple and complex gas mixtures with solid-state sensor arrays and intelligent signal processing algorithms was introduced. Since that, countless examples of this approach appeared in literature and, among the used sensors technologies, metal-oxide semiconductor gas sensors were largely investigated [2–5]. Even if these researches corroborated the initial hypothesis, the practical application of solid-state sensors is still limited in practice by nontolerable instability of sensors signals. This is manifested as a nonpredictable temporal variation of the sensor signal whose more evident consequence is the spread of sensors responses when the sensor undergoes the same gas exposure. Consequently, calibrated sensor response patterns become, even at short time scale, obsolete providing false evaluations and requiring, as countermeasure, frequent calibrations. The instability of sensor signals derives from two main sources. The first is clearly found in the nature of the chemical and physical processes occurring in the sensor [6], such as ageing (e.g., the reorganization of the sensor surface) and poisoning (e.g., irreversible analyte binding). The second source of instability, often neglected, pertains to uncontrolled changes of the measurement conditions due for instance to fluctuations of the sample flow rate or the temperature of gas and sensors. In addition, other issues related to the experimental setup can induce either memory effects (such as systematic errors due to repeated measurement sequences) or short-term effects (system warm-up, thermal trends). These effects may also trigger the degradation of samples that in some cases can give rise to nonreproducible sensor responses that can be attributed to sensors drift. The sources of sensor instability due to nonaccurate measurement procedure can hopefully be counteracted by careful experimental designs.

The removal of those effects of physical and chemical origins resulting in sensors instability is a complicated task that requires a deep knowledge of the physics and the chemistry involved in the sensing process and, even more complicated, in the control of those technological steps related to the sensor fabrication process. Nonetheless, disregarding the β€œphysical” approach to sensor stability, several authors demonstrated that it is possible to improve the sensor performance by an intelligent selection of the sensor features [7], by an optimization of the sensor operating conditions [8], and by a clever design of signal processing algorithms [9].

Feature extraction is defined as the operation necessary to extract from the temporal evolution of the sensor signal a set of synthetic descriptors that can be used in computations aimed at identifying the gas components in a mixture and their concentration. Features are practically related to the measure of the sensor signal in two conditions defined by the absence and the presence of the sample. The signal measured in these two states can be combined in order to obtain different quantities. For instance, in case of chemically sensitive resistances the simplest features that can be defined are, 𝑅gasβˆ’π‘…ref, and (𝑅gasβˆ’π‘…ref)/𝑅ref where 𝑅gas and 𝑅ref are the sensor resistance measured in presence of the sample and the reference atmosphere, respectively. It is known that even these simple quantities have different stability properties [10]. A methodology suggested to improve the sensor performance consists in the time modulation of the sensor signal obtained changing at least one of the sensor operating conditions. In case of metal-oxide semiconductor gas sensors, since the sensor properties are strongly dependant on the sensor temperature [11], the modulation of temperature is an easy method to obtain a time variable sensor signal. In these sensors temperature modulation is expected to alter the kinetics of both adsorption and reaction processes taking place at the sensor surface. Temperature modulation leads to the development of response patterns, which may be characteristic of the species being detected. In other words, by retrieving information from response dynamics, new response features are obtained that can confer more selectivity to metal oxide sensors. Indeed, several authors have developed this strategy and applied several techniques to extract features that are important for the discrimination or quantification of gases. Fast Fourier Transform (FFT) and Discrete Wavelet Transform (DWT) have been used as feature extraction tools in several studies [12–17]. Various researches shown that the features extracted from temperature modulated gas sensors are more stable with respect to sensors operated at fixed temperature [18–22]. An important set of parameters to be adjusted is defining the temperature modulation profile. In case of simple sinusoidal modulation some results evidenced the existence of optimal modulation frequencies whose application results in a general improvement of selectivity and stability [14, 15].

Concerning the expected efficiency of the methods based on optimized features extraction, signal processing, and operative conditions it is important to consider that it is rather unlikely that these approaches can remove the degradation of physical and chemical properties of a sensor, but since the instability is also due to the fluctuations of the experimental conditions signal processing optimization can be considered as the search of features that are less sensitive to those quantities that fluctuate among repeated measurements.

In this paper, we illustrate the stability properties of a number of distinct features derived from systems theory, a discipline defining the instruments to study and represent complex time evolving systems.

In this study, the features are simultaneously extracted from a temperature-modulated array of metal-oxide semiconductor gas sensor. The stability is evaluated in a four-classes pattern recognition problem assuming that improved stability results in a better classification of the four classes. Sensors were operated in a discontinuous mode temperature modulation where they are normally kept nonpowered and the temperature modulation is applied only during the time interval when feature extraction is performed [23].

2. System Theory and Sensors Features Extraction

Elements of systems theory have been used in the past to model sensors behavior considering not only the steady state changes but also the dynamic evolution of signals. Attempts to model these behaviors using either time series or analytical approximation have been shown in the past [24–27], and the use of specific methods to extract only the dynamic information was also demonstrated [28, 29]. Nonetheless, only recently one of the central concepts for the analysis of dynamics systems, the phase space, has been taken into consideration. In order to define the phase space let us consider a system whose state is completely described by 𝑛 scalar variables. Different states corresponds to different points in an 𝑛-dimension vector space defined by an orthonormal basis where each direction corresponds to one of the scalar variables. The main property of such space is the unique correspondence between points in the space and states of the system.

Practically, the information gained about a system, such as a sensor, is contained in the temporal evolution of the measured signal. Nonetheless, from the time dependence of this scalar quantity a number of novel variables can be generated. For instance, in mechanical systems, the observation of the position with the time gives place to variables such as velocity (first derivative) and acceleration (second derivative) from which the phase space is constructed.

As a consequence, a system, observed through a single scalar quantity for a period of time, is represented by a sequence of points in the phase space. Once suitably parameterized with the time, this sequence forms a trajectory containing all the captured information about the phenomena underlying the system evolution. The relation between the observed signals and the internal variables forming the phase space has been demonstrated by the embedding theorem of Takens [30]. According to this theorem, the variables of the phase space are given by a number of differentiable transformations of the observed signal such as a set of independent coordinates π‘₯(𝑑),π‘₯(𝑑+𝜏1),π‘₯(𝑑+𝜏2),β‹―,π‘₯(𝑑+πœπ‘›). Each variable of this set represents the experimental time series at different time samples (after a specific time delay πœπ‘›, a multiple of the sampling time step Ts). Any dynamic state of the system is therefore represented in a multidimensional phase space as 𝑦(𝑑)=[π‘₯(𝑑),π‘₯(𝑑+𝜏1),π‘₯(𝑑+𝜏2),β‹―,π‘₯(𝑑+πœπ‘›)]. The number of coordinates defining the phase space and the values of the time delays πœπ‘› must be chosen suitably in order to obtain an optimal description of the phenomena. There are sufficient conditions to avoid false projection and ambiguities of the trajectories.

From a general point of view, a chemical sensor can be considered as a dynamic system whose response signal temporally evolves following, with its proper dynamics, the concentration of the analytes. The information is in the temporal evolution of a single scalar quantity (i.e., in the sensor signal) and from this time dependence, a number of new variables can be generated. The minimum set of variables able to describe the state and the evolution of the sensor comprises the response signal (i.e., the observed variable) and its first derivative. Therefore, a phase space can be defined by the orthonormal basis formed by the observed signal and its first derivative. The use of the phase space in the field of gas sensors was first introduced in 2003 [31]. In the Figure 1 the responses of SnO2 sensor in presence of different gas concentrations and the relative trajectories spanned in the phase space are shown.

The description of the trajectory in the phase space give rise to novel features related to the underlying phenomena that can be used to build more accurate gas identification models. The area spanned by the trajectory when it undergoes a transformation (e.g., during the adsorption) is a rather straightforward descriptor to quantify the process. In this quantity both the magnitude and the velocity of the sensor response are considered. However, the area is a descriptor that overlooks some characteristics of the trajectory evolution (i.e., the shape of the trajectory). This is the reason to include other descriptors, for example, the dynamic moments (DMs), that can describe also the morphology of the trajectories [32].

Among the possible quantities that can bring such notions, the following DMs are here discussed, all of them are defined as combination of the phase space variables:

1DM2=𝑛𝑛𝑖=0π‘₯𝑖𝑦𝑖(1)DM3𝑃𝐡=√22𝑛𝑛𝑖=0ξ€·π‘₯2π‘–π‘¦π‘–βˆ’π‘₯𝑖𝑦2𝑖(2)DM3𝑆𝐡=√22𝑛𝑛𝑖=0ξ€Ί2π‘₯3𝑖π‘₯+32π‘–π‘¦π‘–βˆ’π‘₯𝑖𝑦2𝑖(3)DM3𝑋=12𝑛𝑛𝑖=0ξ€·π‘₯3π‘–βˆ’3π‘₯𝑖𝑦2𝑖(4)DM3π‘Œ=12𝑛𝑛𝑖=0ξ€·π‘₯3π‘–βˆ’3π‘₯2𝑖𝑦𝑖(5) where π‘₯(𝑑)|𝑑𝑖𝑑+π‘˜π‘– and 𝑦(𝑑)|𝑑𝑖𝑑+π‘˜+πœπ‘–+𝜏=π‘₯(𝑑+𝜏)|𝑑𝑖𝑑+π‘˜+πœπ‘–+𝜏are portions of the sensor signal (𝑑i=0,1,2,3,…;π‘˜=1,2,3and 5 s; 𝜏=1,2and3s). Each DM describes a different morphological feature of the trajectory, so that the collective use of more than one moment is usually required for an exhaustive description of a phase space trajectory. The first gas sensor signal described by DMs was the quartz microbalance [33], whereas in the example here illustrated they are applied to temperature modulated metal-oxide gas sensors. In Figure 2 typical trajectories in the phase space of a tungsten oxide gas sensor in the presence of the gases are shown. The exposure to different gases and mixtures results in evidently distinct trajectories. Thes differences are transformed in quantities by the application of the dynamic moment.

Phase space is an example of a principle of the classical signal theory applied to the study of the time evolution of a gas sensor signal. Following the suggestions of signal theory let us consider a gas sensor array as a mathematical object, where the observable signals of each sensor are part of a multidimensional array. When the sensors simultaneously interact with the same chemical compounds, the signals of the sensors becomes characterized by a certain degree of mutual correlation. The amount of correlation may change according to the quality and quantity of the gas species to which the array is exposed. One of the quantities that can express the relationship between the sensors of an array is the mutual energy. Given two generic signals π‘₯(𝑑) and 𝑦(𝑑), the mutual energy in the interval (𝑑0,𝑑1) is given by the following definition:

πœ‰π‘₯𝑦=ξ€œπ‘‘1𝑑0π‘₯(𝑑)𝑦(𝑑)𝑑𝑑.(6) The value of the energy likely depends on the affinitie between the gas species present and the sensor active layer. Hereafter, we call energy vector (EV) the following quantity:

||πœ‰EV=1,1πœ‰1,2β€¦πœ‰1,π‘˜πœ‰2,1πœ‰2,2β€¦πœ‰2,π‘˜β€¦πœ‰π‘˜βˆ’1,π‘˜βˆ’1β€¦πœ‰π‘˜,π‘˜||,(7) where π‘˜is the number of sensors in the array, and the generic vector element πœ‰π‘₯𝑦 is defined by (7). This method was originally proposed by Martinelli [34] and later more extensively applied to gas sensors arrays by Vergara et al. [35].

3. Experimental

3.1. Sensor Fabrication

Sensing layers were deposited over an integrated four microhotplates fabricated using microsystems technology. Basically, each sensor was formed by a gas-sensitive layer, interdigitated electrodes, insulating layers and a poly-silicon heater, all stacked on a 0.2β€‰πœ‡m thick Si3N4 membrane grown by low-pressure chemical vapor deposition (LPCVD). The micro hotplate was characterized by a temperature coefficient of resistivity (TCR) equal to 6.79Γ—10βˆ’4. Heaters resistance was also used to measure the actual temperature of the sensor membranes. The power consumption was about 80 mW when working at 480∘C and the thermal response of the coated membranes was measured to be near 20 milliseconds (thermal response was estimated to be about 10  milliseconds in bare devices). The technological fabrication steps of the substrates are reported elsewhere [36, 37]. Finally, the 4-element microhotplate array was packaged on a standard TO-8 package (see Figures 3 and 4). Each chip had 4 membranes, that can be functionalized with proper sensing layers, the size of which was 450Γ—450πœ‡m2 and the electrode area was 50Γ—50πœ‡m2.

The sensor array was formed by four thick-film metal oxide layers. The following materials were used: 1% Pt-doped and 1% Pd-doped tin oxide (Sensors A and B) and 1% Pt-doped and 1% Pd-doped tungsten oxide (Sensors C and D). These were deposited using a drop coating technique. A paste was prepared dispersing the active material (SnO2 or WO3 nanopowders + 1% Pd or Pt) in glycerol. This process was repeated four times in order to obtain four different layers on the different membranes that integrated a micro-sensor array. The as-deposited films were dried in an oven at 180∘C for 2 hours. Subsequently; they were annealed in situ at 480∘C for 2 hours. The purpose of annealing was to ensure a good adherence of the active material to the sensor substrates and to burn out the organic vehicle (i.e., glycerol) used in the paste. The selection of these materials was based on previous experience for an analogous gas analysis application [38].

3.2. Experimental-Setup

An array composed of seven micro-hotplate gas-sensors was placed in a 20 mL volume air-tight test chamber to measure vapors of acetaldehyde, ethylene, ethanol and ammonia. This chamber was connected to a continuous flow system that allowed us to obtain, with high reproducibility, different concentrations of pollutant gases diluted in dry air. Mass flow controllers provided the dilution of test gases (at the desired concentrations) in dry air. The total gas flow was set to 200 mL/min and kept constant. The moisture level was kept to 10% of relative humidity (measured at 30∘CΒ±1∘C) during all the experiments.

The signals of the micro-gas sensor array were measured when the operating temperature of sensors was modulated using an optimized multisinusoidal signal. The multisinusoidal signal was generated using a written-in-house MATLAB code. A set of six modulating frequencies (optimal frequencies selected in a systematic way) was employed to synthesize the temperature-modulating multisinusoidal signal. For the sake of simplicity, the same multisinusoidal signal was applied to the heating element of every sensor within the array. The frequencies composing this signal were 12.8 mHz, 25.6 mHz, 38.5 mHz, 92.9 mHz, 339.7 mHz and 682.7 mHz. The first three frequencies are known to be more important for gas identification, while the last three frequencies are helpful both for identifying and quantifying the gases considered [39]. The range of modulating frequencies is far below the cut-off frequency of the metal oxide coated membranes, which is the inverse of their thermal response (i.e., (1/20)10βˆ’3π‘ βˆ’1=50Hz) [22].

3.3. Databases

The following compounds were tested: acetaldehyde (50 ppm), ethylene (50 ppm), ethanol (25 ppm), and ammonia (50 ppm). Five sessions of measurements were performed. Each session consisted in 12 replicas of each exposure, resulting in a total of 240 measures; samples sequence, in each session, was randomized. Among the measurement sessions, sensors were powered off. Each measurement took 312 seconds corresponding to the time of application of the multisinusoidal temperature modulation. The whole experiment was performed in four months. During the entire measurement process, before and after each measurement, both sensors and their heating elements were not powered. Finally, the sensors were powered when the analyte was already into the sensors cell.

The sampling rate of the sensor signals was set at 100 samples per second, which was far higher than the frequency range applied to the temperature modulating signal. The peak to peak amplitude of the multisinusoidal current signal applied to the heater was equal to 2 mA. It corresponds to a sensor operating temperature range between 220∘C and 450∘C. Figure 5 shows a typical transient response from a Pd-doped SnO2 gas sensor in the presence of 50 ppm of ammonia when the measurement time was set to 312 s.

3.4. Data Analysis

From the recorded signals the previously defined features called dynamic moments and energy vectors were calculated. For sake of comparison the following standard quantities were also calculated and considered in the analysis:

Steady state feature extractionΔ𝑅/𝑅0, where the Δ𝑅 is the shift of the resistance between the instant value of the measure and the initial baseline (𝑅0) according to this formula:

𝑑Δ𝑅𝑖𝑅0=π‘…ξ€·π‘‘π‘–ξ€Έβˆ’π‘…0𝑅0𝑅𝑑𝑖=155𝑗=1π‘…ξ€·π‘‘π‘–βˆ’π‘—ξ€Έ(8) with 𝑑i=0,1,2,3,…

(i)The integral of the sensor signal INT(π‘₯(𝑑𝑖))π‘˜=βˆ«π‘‘π‘–π‘‘+π‘˜π‘–π‘₯(𝑑)𝑑𝑑 with π‘˜ values considered in the data analysis are 1, 2, 3 and 5 (𝑑i=0,1,2,3,…).(ii)The Fast Fourier Transform (FFT) coefficients of the sensor signal calculated on the entire temperature modulation applied to the sensor [40];

In order to reduce the measurement time all the features, except FFT, have been calculated in the first 20 seconds of each measurement.

Partial Least Square Discriminant Analysis (PLS-DA) has been utilized as classifier to compare the performance of features. For validation purposes, the entire dataset was split in training and testing sub-dataset (144 and 96 measurements, resp.). The number of latent variables used for PLS-DA models [41] was optimized minimizing the generalization errors applying the Leave-One-Out Cross-validation (LOO-CV).

4. Results and Discussion

Collected data was analyzed in order to study the performance of the defined features in a shorter time with respect to the whole time during which the temperature modulations was applied. To this scope, only the first twenty seconds of the sensors response were considered. Since FFT requires the entire modulation period to be efficiently calculated it was not included in this analysis. Features were calculated every second in the first 20 s considering sensor signal lengths of 1 s, 2 s, 3 s, and 5 s.

The classification rates are summarized in Figure 6. The results obtained in validation and calculated considering a signal portion of 1 s are shown. All the features shown in Figure 6 achieved more than 93% of correct classification in training. Among these features, EV achieved the highest classification rate equal to 91%. Furthermore, EV provides discrimination rate higher than 80% in the first second of measurement the classification rate reaches 90% after 5 s. Common features, such as Δ𝑅/𝑅0, give the worst performance in almost all the twenty seconds. Finally, it is important to note that the classification rates of EV and DMs are quite constant over the entire measurement time. EV results are even promising for real applications, in particular considering that a short measurement time results in reduced sensor power consumption. It is important to remark that the application of dynamic feature extraction contributes to a reduction of the measurement time as it was observed since several years ago applying a multiexponential fit to the signal of SnO2 based gas sensors [42].

The performance of features such as EV, Integral and DMs are clearly dependent on the signal behavior and their results could change considering different time windows. This effect is shown in Figure 7 where a summary of the classification rate of the above features considering a time window of 2 s is shown using a box-whisker plot.

DMs’ have the highest classification rate (hMD, about 90%), but this occurs only in a particular situation determined by a particular matching of the signal with the requirements of DM. The box-plot for these descriptors is characterized by a low mean value (mMD) with respect to the hMD confirming a large variability in the first 20 s. A different behavior is observed for EV, where the distribution of the classification rate is less dispersed than that of DMs. Finally, the integral shows the same order of variability of the EV but, generally, a lower classification rate. A supplementary investigation has been done considering the performance of the features calculated using 3 s and 5 s of the sensors signal portion. In this case, the classification rate was lower than the previous cases.

A further improvement was obtained considering more EVs calculated at different times but using for each calculus a time lengths of 1 s. EVs were calculated at 3 s and 8 s. In this case the classification rate of validation reached 96% and almost 100% in training. In Figure 8 a summary of the best performance obtained in validation for each feature is shown.

The best result is obtained using a combination of two different EVs. This means that EVs did not provided totally overlapped information. These results are better appreciated considering that FFT, calculated with the whole experiment time of 312 s, provided the 86% of correct classifications.

Concerning the performance of DMs, results show that it is possible to increase the classification rate but only in a precise interval of the signal evolution and combining different DMs. This last detail reinforces the efficiency of the recognition power of EV that was almost independent from the temperature modulation applied and, on the other hand, the difficulties to determine an optimized configuration for DMs features.

EV promising results are fundamentally due to the combination in these features of the relations between couples of sensor signals putting in evidence the cross-correlation among sensors, and that this quantity is expected to be weakly sensitive to any instability source.

5. Conclusions

The stability of different kinds of features has been investigated utilizing an array of optimized multisinusoidal signal temperature modulated metal oxide gas sensors. For this measurement setup, results have evidenced that the dynamic features achieve better performance with respect to the standard feature extraction such as Δ𝑅/𝑅0, the integral or the FFT. The DMs have shown a good discrimination but the performances are strongly dependent by the considered portion of the signal evolution. Instead, the EV has shown an almost constant classification rate along the time with respect to the temperature modulation being also able to obtain more than 90% of correct classification rate in the first 5 s. This aspect is of fundamental importance for practical applications since this feature is able to accurately identify and classify different species using few seconds of the signal response independently from the position on the profile of the temperature modulation.

Moreover, further increment in sensing performance is obtained when more Evs, calculated at different time times are taken into account.

Finally conventional sensor arrays using the same type of construction technology have been thoroughly investigated and are widely utilized today in many critical applications. It remains as a standing question, however, how to maximize their performance and simultaneously minimize the time required for it. This is an important question, because sensors composing an array can convey different attributes of the process being monitored, and, when properly conditioned, can complement each other. In this frame, a more complex classification task and the utilization of quality/variety of chemo-sensing technologies using this feature extraction procedure will require more deep investigations.