Abstract

The surging advance in micro- and nanotechnologies allied with neural learning systems allows the realization of miniaturized yet extremely powerful multisensor systems and networks for wide application fields, for example, in measurement, instrumentation, automation, and smart environments. Time and location context is particularly relevant to sensor swarms applied for distributed measurement in industrial environment, such as, for example, fermentation tanks. Common RF solutions face limits here, which can be overcome by magnetic systems. Previously, we have developed the electronic system for an integrated data logger swarm with magnetic localization and sensor node timebase synchronization. The focus of this work is on an approach to improving both localization accuracy and flexibility by the application of artificial neural networks applied as virtual sensors and classifiers in a hybrid dedicated learning system. Including also data from an industrial brewery environment, the best investigated neural virtual sensor approach has achieved an advance in localization accuracy of a factor of 4 compared to state-of-the-art numerical methods and, thus, results in the order of less than 5 cm meeting industrial expectations on a feasible solution for the presented integrated localization system solution.

1. Introduction

Sensor networks have established themselves in scientific, industrial, and military applications. Military reconnaissance and surveillance, for example, employing distributed microphone or hydrophone arrays, can be named here. In the last decades, numerous nonmilitary, commercial applications could be observed, which incubated the creation of more and more powerful and increasingly smart and networked data loggers. The development was boosted by the soaring progress in microelectronics and MEMS technology, sensor technology, communications, and computational intelligence with a strong emphasis on artifical neural systems. This advanced the integration of wireless, autonomous, or even self-sufficient multisensor systems and sensor networks and opened, due to miniaturization and cost-effectiveness, new application fields from process control and automation to Ambient-Intelligence, for example, [1], Internet-of-Things (IoT), Cyber-Physical-Systems (CPS), Cyber-Physical-Production Systems (CPPS), Industrie4.0, and numerous biomedical tasks. The key vision for this development is the concept of Smart-Dust, introduced, for example, by the University of Berkeley [2, 3], which formulates the idea to integrate in a cubic millimeter multisensors, processing, communication, energy supply, and potentially actuators to a Smart-Sensor corresponding networks of numerous such units. The ITRS roadmap [4] clearly points out the advantageous development of microelectronics, MEMS, sensors, and packaging technology, for example, 3D integration technology, in particular on the so-called More-than-Moore direction of development [5] for Systems-in-Package (SiP) solutions, which allows the increase of information processing in the named nodes themselves. More and more complex methods of computational intelligence, neural networks, and learning systems, thus, become amenable for the realization of distributed intelligent integrated sensory systems or sensory swarms. Though the Smart-Dust vision has not completely become reality, the named advance in micro- and nanotechnologies already allows with regard to sensory diversity/bandwidth, achievable measurement uncertainty, and power dissipation minimization the realization of suitable sensor swarms for distributed measurement in industrial applications in the measurement and instrumentation field [6, 7].

One important aspect in the practical application of sensory networks or swarms with multisensory process parameter registration, for example, of temperature, pressure, humidity, conductivity, or impedance spectrum, is the logging of time and location context. A synchronized timebase, potentially addional location sensors, and apt signal processing are required for the task of context acquisition. Evaluation of distributed swarm measurements will be of value only, if time and location of the registration are available for swarm data visualization and analysis. In many application scenarios, including indoor, outdoor, or even maritine tasks, the radio signal itself and its attenuation (Received-Signal-Strength-Indicator (RSSI)) can serve as sensory source and baseline for localization.

However, in industrially relevant scenarios of closed, large-scale containers, for example, stainless-steel fermentation tanks in brewery industry shown in Figure 1, strong attenuation in the medium and reflections from container walls hamper communication itself, and, thus, also localization.

In our research, an alternative localization system has been conceived to locate the units of the sensor swarm based on magnetic technology and inspired by existing tracking systems and technology, for example, helmet tracking in military applications or instrument or pill tracking in medical applications. The developed approach and system scales to the required industrial tank sizes. It employs integrated magnetic sensors together with algorithms from computational intelligence to robustly calculate the sensor node localization from magnetic field emitting beacons. In this paper, the basic electronic system and an extension by neural networks as virtual sensors in a supervised approach for the significant improvement of the localization quality and the simplified adaptation to changed or new application scenarios are presented. In Section 2, we will outline our approach, system architecture and electronics, scenarios, data acquisition, and the synchronization approach with the corresponding classification experiments. In Section 3 virtual sensor methods and corresponding parameter settings will be given. In Section 4, before concluding, our experiments and results will be presented.

2. Magnetic Localization System

Magnetic localization systems are in widespread use since more than two decades and related publication and patent activity, for example, [8, 9], can be observed. Numerous concepts and systems solutions can be found, which achieve a localization or a tracking of one or potentially several objects. These systems can be conceived in two dual ways by either employing an potentially integrated, magnetic marker, which is located by an external stationary sensor system or sensor array, or alternatively employing a plurality of external stationary magnetic field generator units, working as beacons, which are detected by magnetic sensors within the regarded mobile sensory nodes. The first approach is commonly employed in medical tracking systems, for example, in systems of Motilis [10] or Matesy [11], which employ permanent magnets as markers, for example, in a pill size arrangement moving through the human body. Tracking takes place at rather short distance. Application to a sensor swarm faces difficulties, as the discerning of multiple markers, that is, sensor nodes in the swarm, has not been satisfactorily solved so far. Naturally, instead of deploying permanent magnets in the sensory nodes, also actively controlled magnetic field generation by coil setups or spinning magnet setups could have been considered. However, with regard to size and power budget of autonomous miniaturized systems. The second approach with active beacons and integrated magnetic sensing in the swarm nodes, thus, is preferred in most applications, for example, in systems Polhemus [12] and Ascension [13] as well as medical applications of instrument or endoscopic task tracking. Also, this variant is more fortunate for and compatible with the development of Smart-Pills, for example, in the Vector-project [14]. Basically, two beacon control methods can be distinguished, employing AC-fields (e.g., systems of Polhemus [12]) or DC fields (e.g., systems of Ascension [13]), which in the latter case means that the beacons work in a time multiplexing scheme; that is, each beacon emits a field at a defined time frame, which allows the sensor and the the localization algorithms to uniquely identify the source. However, this feature also gives rise to the synchrony of the beacon control and sensor swarm units time bases, that is, requiring an approach for maintaining synchronization.

AC-field generation additionally, allows the use of frequency multiplexing, that is, the simultaneous activity of several beacons at different frequencies. However, this requires more effort on the receiver side. Also, quasi-DC approaches are less vulnerable to the presence of magnetic materials and eddy-current induced fields.

The various system solutions, for example, for indoor localization [1518] or underground animal tracking [19] and so forth, have employed magnetic sensors from the spectrum of available principles, for example, SQUIDs, miniaturized coils, Hall elements, for example, [20], field plates, or magnetoresistive sensors with various operating principles, summarized as XMR sensors. Compass sensors, for example, for the mobile phone market, or automotive tasks have incubated the conception of new sensory units, for example, with 9 axes, including acceleration, Gyros, and magnetic sensors in 3D arrangement [21, 22]. These rather novel chips on the market will support the compact implementations of the system and approach presented in this paper even better. For reasons of sensitivity, that is, due to weak beacon fields or large distances between sensor and beacon, Anisotrope-Magneto-Resistive (AMR) sensors are very attractive as they feature high sensitivity and offer favorite linearity properties and temperature range, as well as small size and low-power operation.

2.1. Pursued Magnetic Localization Approach

In our research, based on the outlined state-of-the-art and the particular constraints of the regarded application domain, a magnetic localization system, including synchronization capability, for a sensor swarm has been developed. The system employs the concept of quasi-DC field generation by stationary field generating beacons, realized by dedicated coils and suitable computer controlled power electronics. Further, AMR sensors AFF755B [23] of Sensitec have been employed in 3 axial 3D arrangement, which has been integrated by various packaging technologies [24], for example, standard PCB, or Active-Multi-Layer (AML) [25].

Figure 2 shows the block diagram of our magnetic localization system, which has been implemented and employed in four different demonstrators.

In our 3D-AMR magnetic sensor, each AMR channel is connected to an AD 8290 instrumentation amplifier, which provides a gain of 50, control of the bridge, and power-down of bridge and amplifier under control of the microcontroller (atmel XMEGA256A3). Three such channels serve for , , and registration and sampling takes place in a ternary scheme of the exciting magnetic field, computing the mean of positive and negative phase voltages, and , in (1). Only the magnitude of the flux density is required (see (2) and (3)): Here, , the bridge supply is , and the sensitivity value is taken from the Sensitec datasheet with the typical value of 11.7, which advocates the calibration of for each sensor instance. In our work, we will comprise this step by neural network learning. In (4), Bio-Savarts-Law is employed to establish a relationship between the flux density values obtained from measurement and the aspired distances by resolving for the distance in (5): Here, is the permeability in vacuum, is the number of windings of the coils, is the DC current driving the coil, and is the coil radius. As this simple model only works fine, if the sensor is located on the principle coil axis and the sensor instance sensitivity is known, the estimated distances can be out of scale, which is fixed in a first processing step by a heuristic global correction or scaling factor added in (6). The computation of is explained in Section 2.3.2(e): Thus, RSSI equivalent distance values are obtained for the case of the magnetic localization system and can be treated with the standard methods given in Section 2.3.

2.2. Demonstrator and Integration Issues

In order to validate and optimize the localization system under changing environmental condition and different scale, three experimental setups or demonstrators were conceived.

The first demonstrator is located in our ISE (integrated sensory systems) lab. Six coils are placed in a cylindrical fashion similar to the shape of a steel tank. The six coils are separated into two rings of three coils each on two levels, rotated by an angle of . An --movable sledge with attached scales serves for ground truth position acquisition. Relevant technical data is surveyed in Table 1. In the following, this setup is referred to as ISE demonstrator.

The brewery demonstrator (see Figure 3) is a real brewery container in smaller scale. It was temporary set up at Technikum Warstein to validate the localization system in the industrial environment where parasitic magnetic fields of, for example, large pumps or other heavy machinery might cause problems. The dimensions are bigger than those of the ISE demonstrator, so in total 12 coils separated into 3 rings of 4 coils each were installed. The coils itself were also larger in diameter. The coil positions were appropriately determined as a mandatory baseline for localization. Relevant technical details are summarized in Table 1. A suiting reference system for ground truth sensor position determination was established inside the tank, which for obvious reasons was not filled with liquid in these experiments. Also, acquiring a larger number of points than the registered 30 locations without proper climbing support was not easy, especially within the limited duration of availability of the experimental setup for this work.

Here, a lower limit for the achievable localization error is in particular given by the uncertainty of the actual sensor center determination.

Additionally, the obtained data matches well with data acquired from the target industrial process. The reason for this is that the experimental tank was placed in close proximity to the production environment including fermentation tanks, so it can be rightfully assumed that it was subject to the same sources of influence like pumps, heaters, coolers, and so forth. Due to the random nature of the switching activity of these devices with regard to the coil switching of our localization scheme, as well as enforced confidentiality on process details, a comprehensive investigation is very difficult. But, due to our ternary switching of the coil fields and ensuing differential processing in the sensory nodes, all sources of influence quasiconstant in each coil’s time slot will be significantly suppressed or even canceled out.

The last demonstrator is a small, down-scaled, and mobile version of our localization system and was specially designed and build for the presentation at Hannover Messe Industrie (HMI) 2013. Figure 4 shows the demonstrator at our booth with the electronics on the table and a monitor showing a MATLAB GUI in the background. The tank is modeled by a tube of acrylic glass. Also the holding construction of the six coils is made of this material. See Table 1 for comparison to the other demonstrators. This mobile demonstrator will be called HMI demonstrator in the following.

The coil placement, similar to the general problem of anchor node placement in wireless sensor networks [26, 27], was an important consideration in our work. While a symmetrical distribution of the coils is not essential, equidistant placement of the coils helps in making sure that in every possible sensor position in the tank there is a minimum of three coils that generate fields of sufficient strength for proper localization. Also significant asymmetrical positioning of the coils will aggravate the angle problem discussed in Section 2.1 and will proportionally degrade the localization accuracy. We investigated 3 different topologies parallel planes, cylindrical, and ellipsoidal placement of coils. The HMI demonstrator has a very unfortunate aspect ratio and scale, which could be partly compensated by the most promising ellipsoidal coil arrangement.

The overall goal of the described research is the achievement of a compact 3D integrated data logger for a sensor swarm in distributed measurements. In the first development step, standard printed-circuit board (PCB) version of 3D-AMR-sensor (see Figure 5 (left)) [28] and the complete datalogger (see, e.g., Figure 4) have been conceived. The 3D-AMR-sensor has already seen implementation in various 3D integration technologies [24]. Figure 5 (right) shows the example of the Active-Multilayer-Technology (AML) first version with a first-cut design size of . The bulky connectors visible in Figure 5 are required only for the modular development systems and, of course, will be obsolete for the integrated target system. Further substantial size reduction by 3D layout optimization can be expected. AML technology is one favorable option to encapsulate and integrate the complete aspired data logger.

2.3. Standard Localization Algorithms
2.3.1. Standard Algorithms in Wireless Sensor Systems

In the majority of wireless sensor systems, RF-communication signals serve as the information source for the localization approaches. The RSSI, again, is the most common indicator to estimate the distance between a sender and receiver pair in the network, for example, a stationary beacon and a sensor node. Based on four or more distance estimates, quite similar to data visualization approaches, triangulation, multilateration [2931], or multidimensional scaling (MDS) methods, in particular the nonlinear Sammon’s mapping (NLM), [32] are employed in standard approaches for sensor location or coordinates estimation. Sammon’s iterative mapping is computationally demanding and requires a postprocessing step, denoted as conformal mapping, to compute the actual absolute coordinates from the relative information. Further, employed gradient descent technique might not converge to the best solution. The computationally fortunate multilateration, which is based on least squares optimization or Moore-Penrose pseudoinverse, or the standard NLM both work fine for the magnetic system pursued here and the localization results of these unsupervised state-of-the art methods will be compared to those of the newly proposed ones.

2.3.2. Enhanced Algorithms for Wireless Sensor Systems

The issues of high computational complexity , potential local optimum solution, and required post processing for absolute coordinate obtainment motivated the recent development of an advanced approach. In the particular scenario faced here, mobile sensors individually have to estimate their position with regard to stationary beacons of known number and position. This can be tackled well with the NLM recall (NLMR) variant [33], reducing the computational complexity to and immediately returns absolute coordinates. This approach along with advanced optimization methods for achievement of better solution quality has been introduced in [34] and will also serve, in the variations briefly outlined below for a self-contained presentation, for comparison and extension of the newly proposed localization methods.

(a) NLMR. The NLMR for localization as introduced in [34] has the following simplified cost function with as step or iteration variable: where where is the distance between the currently mapped recall datum and the previously mapped training data samples in the high dimensional space. The distances in the new space can be found using standard or advanced optimization methods.

(b) Gradient Descent. The gradient descent technique for NLMR is from [33, 34]. The equations are with where is the new position, MF is the magic factor which reduces with time, is the current position and is the cost function at the current position. MF is initialized to 1. We keep reducing the MF by 10% everytime we find a better fitness.

(c) NLMR-Simulated Annealing. We use the basic simulated annealing [34] where we start with a relatively high temperature () which is reduced () over the number of cycles and reduce the chances of choosing a bad solution as the temperature decreases (accept any solution if ). The new solutions are found by a Markov chain shown in (9) with a random between and where (energy factor) reduces over time. The algorithm runs for 1000 iterations to get the best solutions. The number of iterations required was found heuristically.

(d) NLMR: Particle Swarm Optimization. Standard particle swarm optimization described in [35] is used with and without inertia. Having no inertia helped in faster convergence of the algorithm. 300 particles were used with 150 generations to find the best results in the experiments.

(e) Correction Factor . A scale error in distance estimation became obvious from measurements that is introduced by the model (see (3) to (5)). This reduces the accuracy of the algorithms. Multilateration is mostly robust to this scale error because it uses only the differences in distance and not absolute distance, while the other methods require a correction. The correction factor , shown in (6), can be found by determining the ratio of the model estimated and actual distances. We investigated suitable -settings by computing this ratio for all samples of each data set and found that there was only 1% variation in the result. So, in the simplest case just taking one representative sample to compute the correction factor already significantly improves the results. More sophisticated search strategies to find the correction factor, for example, using unsupervised hyper-heuristics, will be considered for future work.

2.4. Data Acquisition

The data sets acquired from the three described demonstrators for the ensuing experiments have been collected by either a wired standard or a wireless proprietary measurement system. The wired one is a Data Translation DT9816 data acquisition board (DAQ), which is controlled by MATLAB. The 3D-AMR-sensor module is directly wired to the DAQ, and the amplified sensor voltages are measured by analog input ports with 16-bit ADCs and are immediately available in MATLAB for signal processing and localization computation. The wireless system (see also Figure 2) corresponds to the target architecture of the final data logger, which in the first step has been implemented as a modular development PCB system. This development board (see Figure 4) features process sensors and a radio module for host PC via a gateway communication, for example, for configuration and measured process data transfer. For conversion of the 3D-AMR-sensor voltages the 12-bit ADC of the C atmel XMEGA 256A3 is used in time-multiplex. Table 2 gives an overview of the three representative data sets chosen as the baseline for the following investigations. The ISEL data, in particular, serves with a 13 13 equidistant spatial sampling with a a 10 cm pitch in a plane for the elucidation of the spatial localization error distribution. ISEL was recorded two times with two different sensor instances, which will be denoted as and in the following.

Figure 7 shows raw data from the ISEL data sets. The noise level is quite substantial in comparison to the actual signal. High frequent noise is alleviated by multiple sampling of each DC plateau, for example, 10,000 times, and computing the plateau mean. This approach has been chosen instead of a low pass filter because the edges of the magnetic DC plateaus also serve as synchronization signals and thus have to be as steep as possible. Other sources of error, for example, stationary magnetic fields, as the earth magnetic field, can be canceled out by either standard AMR-sensor flipping or the ternary coil switching introduced in this work, which is more efficient from the point of view of energy conservation in the sensory node [28].

The substantially lower resolution of the C ADC required a more sophisticated read-out approach to avoiding loss of distance resolution. A zooming technique was applied that by differential measurement, offset autozeroing, and scaling to full scale makes maximum use of the C ADC 12-bit resolution [24]. Thus, competitive localization results to the DAQ could be achieved on the integrated data logger target platform, which was employed to acquire HMI data set.

2.5. Synchronization Issues

The introduced magnetic localization concept and system crucially depends on the knowledge of the timing of each coil’s activation in the respective autonomous wireless sensor node. This requires a synchronization between the clock in the coil switching unit and the clock in each sensor node. As timebases commonly show a significant uncertainty, in particular, when they are expected to be small, cheap, and low-power, repeated synchronization is required. In our case, the tolerable or recoverable deviation limit is determined by 50% of the duration of a single coil’s switching cycle. In wired versions, the synchronization information easily can be made available by an extra trigger line. Also, in RF-based wireless sensor networks synchronization can be achieved by communications. However, in the given container scenario, deficiencies of RF hamper data communication in general and localization as well as synchronization in particular. A very straightforward idea is now to derive the coveted synchronization information also from the emitted magnetic field. Indeed, this has been invented already in [8], however, with the sensor, denoted as sync pick-up coil, stationarily located very close to the emitting coil and attached by long wires to the sensor itself. A lock-in amplifier is used for the synchronization step there.

The industrial scenario investigated in our case does not allow such a fortunate arrangement. The magnetic sensor employed for localization has to be employed also to measure the data for synchronization and, thus, is remote and at varying distances and orientations with regard to each of the coils. This scenario aggravation requires additional engineering effort. A resource hungry sampling of a sufficient time window around the expected first rising edge in the magnetic field has to be carried out. In the very first-cut solution, a heuristic threshold detector had been conceived in our previous work, which detects the magnetic field rising edge of a coil being switched on. Based on the difference of the detected edge and timestamp to the expected timestamp the local wireless autonomous sensor node’s clock will be cyclically readjusted. The underlying technical problem of finding an edge or pulse in substantially noisy electrical or magnetic field data has been visited before in communications, networking, and, most important, magnetic head data reading in mass storage, for example, in [36]. The task mentioned last comes closest to our interest and activities. In the work presented here, we investigate the SVM classifier (SVM-C) technique as trainable edge detector. It is applied with the parameter settings of unnormalised data obtained from the learning task of 10,000 and kernel function is RBF with . The input feature space is represented by a 2,000 samples wide sliding window at an increment of 250 samples and the output of the classifier represents the two classes “edge” and “no edge.” The processing structure of SVM based edge detection system is illustrated in Figure 9. The experimental data for the synchronization investigations has been extracted from the wireless sensor node prototype in the HMI demonstrator in continuous sampling mode; that is, whole localization cycles were sampled, whereas in contrast to this, for localization only parts of the coil switched-on plateaus have to be sampled. Three recorded raw data sets (“syncraw1,” “syncraw2,” and “syncraw3”) of localization cycle, each contains 262,144 samples of acquired ADC 12-bit values, were used in the experiment. Each raw data set contains 33 different “edge” shapes and levels and 1,028 “no edge” events, including some spurious switching activities from other sources in the environment that superpose like crosstalk. These samples have been extracted from several localization cycles and labeled by a human supervisor. Figure 10 shows in the top strip the sampled data of one localization cycle. In the learning phase, “syncraw1” was split into two parts by hold-out sampling method resulting in two subdata sets with similar class distribution and data size in order to generate the classifier with optimum parameters. The remaining two raw data sets (“syncraw2,” “syncraw3”) were employed in the testing phase to analyze performance of trained classifier. This gives 66 examples in “edge” class and 2,056 in “no edge” class. The results are shown in Table 3. The overall classification rates of SVM-C and the heuristic method are 98.822% and 97.88%, respectively. But these results look a bit too optimistic, as just the “no edge” events are ruled out well, while the “edge” events still lack a comprehensive number of correct classifications or detections, which means that synchronization cycles occasionally might be missed. However, this is not a major problem, as the system does not need synchronization for each localization cycle. Nevertheless, improvement of this classification subsystem is aspired and is underway.

Figure 10 shows an excerpt from “syncraw2” and “syncraw3” data of the length of one localization cycle in the upper strip and the edge detections of SVM-C versus heuristic method as well as the coil switching control signal time points in the lower strip. The visible constant lag between coil switching control signal and the observed actual edge locations is due to the delay in the currently used power electronics for coil driving. Obviously, the SVM-C solution in the given straight form can already be dealing with noise, spurious switching activities, or crosstalk as well as coils in various distances, while the heuristics fails to do so in a significantly higher number of cases.

Future work has to tackle performance increase along with effort reduction with regard to energy consumption, for example, reducing sampling rate and/or window size. The number of support vectors currently employed is currently computed as 907 with 2,000 features or dimensions each. Possible benefits of feature computation and sequential approaches [36] as well as larger data sets should be regarded for a lean and efficient embedded implementation in follow-up work.

3. Neural Virtual Sensors

Virtual sensors are an established engineering concept to obtain the equivalent of sensory registration that is not directly amenable to measurement either due to lack of physical transduction principle or due to too expensive available physical transduction principle. A well known example of the latter case is knock-detection in combustion engines, where available but prohibitively expensive pressure sensing is replaced by a feasible acoustical sensing principle [37]. The implied, often nonlinear, mapping task can be well implemented by suitable artificial neural networks, such as, for example, Multi-Layer-Perceptron with Backpropagation learning (MLP), Fahlman’s Cascade Correlation (CC) network, Radial-Basis-Function (RBF) networks, or Support-Vector-Regression (SVR) networks [37, 38].

3.1. Motivation

In this paper, the most promising neural network candidates, for example, RBF and SVR networks, are investigated as neural virtual sensors to improve localization quality. The basic idea of the localization process including standard method from Section 2 and two different enhancing approaches with neural virtual sensors are illustrated in Figure 11. Two main lines of investigation with the supervised neural virtual sensor approach are depicted by two branches in the figure. The first one employs the model estimated distances as input variables and remaps these to new corrected distances, followed by the standard localization algorithms of Section 2 for coordinate calculation. This method, which requires the actual coil and sensor positions for ground truth distance calculation, will be denoted as the distance-to-distance (D2D) approach. The second one directly maps the acquired sensor voltages to the sensor coordinates or position, completely omitting any model as well as omitting standard localization algorithms. This will be denoted as as the voltage-to-coordinates (V2C) approach. In both cases, representative training data must be provided for the supervised mapping generation in the neural virtual sensors.

The motivation of the proposed approach and its two variations comes from the well known weakness of distance estimation as expressed in (5) and (6). The employed model assumes the sensor to be situated on the principal axis of the respective coil, an assumption that is rarely met in actual sensor locations in container volumes. This implies that the stronger the sensor position deviates from the principal axis of the regarded coil, the larger the resulting error of the estimated distance from the sensor to the corresponding coil will be. Figure 12 illustrates this effect for one -plane of the ISE demonstrator. The error in the center is quite small because the sensor comes closest to the principal coil axes due to the cylindrical arrangement.

The effect, underlying the illustration in Figure 12, is well known and algorithmic correction schemes have long been suggested [8, 9]. The advantage of the suggested supervised learning approach is that also a calibration of the localization system with regard to instance specifics is achieved. In the referred to patents, also the straight estimation of the sensor location from magnetic sensor readings has been investigated by look-up-table (LUT) mechanisms. The advantages of RBF or SVR approaches with regard to LUT in size, generalization, and so forth are well known and obvious.

3.2. RBF Networks

Regarded RBF networks and tool implementations, in particular, differ in determination mechanism and size of the hidden layer and choice of the employed kernel function, for example, the Gaussian function where is the index of the hidden layer, is the center of the corresponding basis function, and is the spread which determines the sensitivity of the neuron. The output layer then performs a linear transformation of the hidden neurons activations to the target output values. It is calculated as with and being the weights. The centers are learned form the training set and the weights are optimized while training [39, 40]. In this work, the implementation from MATLAB with the parameters spread and performance goal is employed. A more resource efficient version of the RBF is Platt’s Resource-Allocating (RAN) Network for Function Interpolation [41]. RAN allows the growth of the hidden layer from scratch and spread of every kernel function to be adjusted during training [41] and can be for future leaner realizations.

3.3. SVM Regression

Support vector regression (SVR) [42] is an extension of the well established Support-Vector-Machines (SVMs) in order to solve the regression problem of learning and predicting continuous domain data. SVR generates models from the training set that perform with best fit in a linear function and result with a minimum deviation in the loss function. Using -insensitive loss function to reduce the error to zero for all points that are smaller than , in some training points, however, this error is beyond ; to deal with unfeasible constraints, the slack variable is introduced in the optimization problem. The optimization problem of -insensitive support vector regression (-SVR) [42] can be formulated as where determines the trade-off between the model complexity and the tolerance of the deviations larger than . The regression function is given by transforming the problem in (13) into its dual problem subject to and where is the number of support vectors (s) and and are Lagrangian multipliers. The kernel function can be chosen as radial basis function (RBF). Applying the so-called kernel trick allows tackling of a nonlinear regression problem with linear estimation by mapping the data set into a higher dimensional space. The RBF kernel function is computed as The optimum generalization performance of SVR is based on the setting of model parameters which is usually assigned as level of typical noise in the training data, as well as parameter , and the kernel parameter . For finding a convergence point of the optimum SVR prediction performance, a grid-search method is commonly suggested [43], as independent characteristics to prediction model of and .

4. Experiments and Results

The data sets introduced in Section 2.4 will serve now for experimental validation according to the outline in Figure 11 of the proposed methods. For this aim, the data sets have to be sampled to generate appropriate training and test sets for the supervised learning of the neural virtual sensors. With regard to the moderate but sufficient available data size, the hold-out approach was adopted. Table 4 summarizes the selected training sets. The residual data of each demonstrator are saved as test sets.

For the data set the measured points are orthogonally located in one -plane which can be seen in Figure 12. The training data contains 25 input-target pairs which are marked by the filled circles in the corresponding following error maps (Figures 14 and 16). The BREW data set positions are spatially less regularly distributed (see Figure 8). Every position with even index is used for training and the positions with odd index are used for test set, resulting in a training data set size of 165 trials and a test data set size of 160 trials. For the HMI data set, there are 44 different positions, whereas at each height (-position) 4 different --positions where acquired. This results in 11 -planes of 4 --positions each. The training data set is composed of 6 -planes and the test set contains the remaining 5 -planes, whereby test and train -planes alternate.

For D2D remapping networks, which correspond to path 2 in Figure 11, two different network architectures with suitable parameter setting ranges have been investigated, based on a standard MATLAB implementation. The varied parameters are the spread () and the performance goal, which is defined as the mean squared error of the training data. The architecture examined first is a network with input and output layer size equal to the number of coils. With being the number of coils and being the number of hidden layer neurons, the network architecture can be referred as . The second architecture consists of individually trained networks of topology. The number of networks grows linearly with the number of coils and can be more greedy with regard to resources, but hidden layers can be individually grown, with some similarity to RAN [41], and convergence commonly is easier. This architecture will be denoted as in the following.

For V2C mapping the same approach will be pursued. However, in the case of the multinetwork architecture only three coordinates have to be generated, independent of the number of coils. So the architecture for V2C mapping and one single net is and for multiple networks it is , obviously alleviating resource issues and the training process. The V2C approach is illustrated by path 3 in Figure 11. All the presented results are achieved using the multiple network architectures for both D2D and V2C.

For determining an optimum RBF parameter set, a basic sensitivity analysis has been carried out with regard to mean localization error minimization and generalization maximization. The investigated RBF parameters are the performance goal and the spread. First, the performance goal is set to fixed values of either 1, 0.1, or 0.01 to limit the effort to a one-dimensional search. For these three different settings the spread is swept. With this approach a local, suitable optimum combination of performance goal and spread quality can be achieved, which returns a minimized distance error and hence localization error. Figure 13 shows one example of a spread sweep for the BREW data set and D2D remapping. The localization error is computed for either the entire data set (training + test data set) and for the test data set. The optimum spread settings for those two data sets and analysis runs are not identical. Currently, the RBF spread which performs best for the test set is chosen. This approach can be applied for all network architectures for D2D remapping and for V2C, except for D2D with multiple network architecture case. The spread is swept for each network individually to make sure that an optimum spread can be found for each coil. If the criteria would be the localization error, there would be no way to extract the best spreads because the multilateration performs a transformation from an M-dimensional input to the 3-dimensional output. So, in case of D2D remapping with architecture the criteria are the distance error which can be calculated before computing multilateration. For each coil, there is a specific RBF spread which results in a minimum distance error.

SVR is employed as the second method in the entire experiments with identical train and test data sets to RBF part of the work. Here the LIBSVM [44] library was implemented on MATLAB platform. Input and supervised learning data for D2D and V2C investigations were identical to the RBF case, too. Applying a grid search method, to cover a wide spectrum of parameter space in searching, model parameters and are determined in the range of [1, 100,000] and [0.1, 100], respectively. Parameter is usually defined to the level of typical noise in the training data. In the training phase, the pair of parameters and delivering the minimal mean square error of the model validation process will be selected to generate the prediction model. The particular setting values of for the ISEL1, BREW, and HMI data are 0.03, 0.01, and 0.04 for D2D and 0.01, 0.01, and 0.03 for V2C, respectively.

The outlined experiments are conducted for each data set with RBF and SVM, each performing D2D and V2C mapping. Each best performing network is trained and recalled at least 3 times to make sure that random initialization effects do not affect the results. The results are presented in the following two subsections.

To put the upcoming results and improvements into perspective, in addition to standard multilateration, we have applied the advanced methods from Section 2.3 to all three raw, -corrected distance data sets (see (6)). The achieved localization quality is shown in Table 5, which shows substantial improvements to multilateration for all methods, but, in particular, for the PSO based method.

4.1. Distance to Distance

The data set has a mean localization error of 22.80 cm and a maximum localization error of 41.27 cm applying standard multilateration. By setting the coil distance scale factor to its optimum value of 1.35, the mean error can be reduced to 3.60 cm and the maximum error is reduced to 15.03 cm. The D2D remapping approach applied to the data set leads to a further improvement. The error map in Figure 14 shows that the maximum localization error is reduced by a factor of 8 compared to the initial results of Figure 12 which are achieved without any scaling factor or neural virtual sensor. The mean error is reduced by a factor of 21 to just 1.05 cm for the test data set.

Table 6 summarizes the results for RBF and SVR in D2D mapping of data. The two networks are compared side by side for each of the data sets.

Without D2D remapping, the mean localization error for the BREW data set is 13.18 cm and the maximum error is 135.66 cm. By comparing the mean localization errors of both approaches for the test sets from Table 6, it can be seen that the RBF generalizes better than the SVR.

The mean localization without D2D remapping for the HMI data set is 11.75 cm and maximum error is 121.61 cm. Compared to the actual demonstrator dimensions (see Table 1) which is the smallest of all three demonstrators the initial error for the standard method (multilateration without D2D remapping) is very high. This is due to the use of the built-in ADC of the C which has a resolution of 12 bit compared to 16 bit of the DAQ and a different sensor which is less sensitive than the one used with the first two data sets. Both methods improve the localization result. See Table 6 for more details.

To better assess these results and improvements, in addition to standard multilateration, we have applied the advanced methods from Section 2.3 to all three -corrected, D2D remapped distance data sets. This has been done for RBF and SVR for complete data sets as well as test data only (Figure 15). The achieved localization quality is given in Table 7, which again shows substantial improvements both to the results before D2D remapping as well as to standard multilateration application for all methods. This has been summarized in Figure 15. As before, the PSO based method is taking the lead in result quality.

4.2. Voltage to Coordinate

The results for V2C mapping approach are found to be comparable in terms of localization error to the previous approach. Figure 16 shows the error map for RBF-V2C applied to data set. The mean localization error for data set is higher than the one of D2D followed by standard multilateration but still is in an acceptable range of just 2.14 cm, which is in the order of the current sensor’s dimensions and, thus, sufficient for the regarded application. Table 8 summarizes and compares RBF and SVR for V2C in the same way as previously provided for the D2D investigations. The SVR approach for data performs superior to the RBF approach, but both approaches lag behind all previously obtained D2D results.

For the BREW data set the mean localization error for V2C and RBF is nearly twice the one of D2D (RBF) with ML. The maximum error unfortunately is also increased. Nevertheless,one advantage of the V2C approach is the significantly smaller number of neurons of just 27, while D2D (RBF) and ML required 333 neurons. A trade-off of result quality and computational resource can be considered. The SVR approach for BREW data performs superior to RBF approach but also lags behind D2D results.

For the HMI data set, an increase in mean localization error can also be observed but is not so expressed as observed for the first two regarded data sets. Comparing the results from Table 8 with those from Table 6 results do not differ significantly, except for the mean localization error for the test data set where D2D (RBF) and ML perform 0.93 cm better than V2C. However, the SVR V2C approach outperforms RBF-based methods and D2D approaches in general for HMI test data.

4.3. Discussion

The overall results given in Tables 6 and 8 show that all proposed methods are feasible and provide substantial result improvements with regard to raw model output data and standard multilateration. The methods from Section 2.3 are already powerful and completely unsupervised. However, they have to rely on the model output and offer no calibration options. The D2D extension, in particular when combined with the methods from Section 2.3, offers calibration options and remarkable result improvement at substantial computational cost, dependence on the model for distance calculation, and the necessity to obtain representative supervised data for each application scenario. These methods are best sited for host-based, postmeasurement localization computation.

The V2C approach, though it did not give top performance in most of the investigations, still is appealing, as it is not depending on a model and is computationally much less demanding, because V2C is linear in complexity with regard to the number of coils and requires smaller network sizes (see first row of Table 6 and of Table 8, resp.). Thus, it is better suited for on-line node-based localization computation and offers calibration options with the same need met in D2D of obtaining representative supervised data for each application scenario. Also, sometimes mediocre results of V2C are partly due to lack of rotation invariance with regard of the rotation of the sensor node itself. Either sufficient rotated examples have to be provided or an invariance transform has to be added in future improvements.

Clearly, a trade-off in result quality and resource demand is offered by the different proposed methods. This is illustrated in Figures 17 and 18 by putting the performance and required cost for both methods D2D and V2C side by side for RBF and SVR, respectively. This looks on first sight too much in favor of D2D, as existing overhead of computing the distances from magnetic measurements and computing coordinates from transformed distances is not yet included in the given effort calculation. V2C approach does not need these steps at all. The final choice of method depends on the applications requirements on localization accuracy, allowable resource expenses, and the potential need for node-based localization computation during measurement.

5. Conclusion

This paper presented our work on a artificial neural network based enhancement of a dedicated magnetic localization system for a wireless integrated autonomous sensor swarm for distributed process parameter measurement in industrial environment, such as, for example, large scale stainless containers or fermentation tanks. The concept has no limitation on the sensor swarm size. The focus of our work was on the improvement of the initially achieved mediocre localization accuracy of the basic electronic system by means of a learning system based on neural virtual sensors adapting to the regarded scenario and system instance. RBF and SVR techniques were investigated in D2D, along with standard and advanced distance to coordinate computation and V2C approaches for three different demonstrators from lab to industrial scale.

The supervised learning approach achieved significant improvements, even for uncalibrated sensors and measurement set-up in all investigated configurations. The most fortunate method combination of D2D by SVR followed by PSO-based distance to coordinate computation returned a factor large than 4 of improvement for the BREW data from industrial scenario. The mean error of about 3 cm is less than the size of the currently employed sensor node, bringing the localization results well in the order of the expectation met, for example, in brewery industry as well as in challenging smart-environment applications. However, the presented learning system could also be abstracted to the benefit of nonmagnetic, that is, RF-based, localization.

The mandatory synchronization for the pursued magnetic localization approach motivated the extension of the described approach by a learning SVM-based edge detector for coil switching time detection and sensor clock correction. Simulations on data from the HMI demonstrator showed the viability of the approach and, thus, the overall system.

Future work will improve the system with regard to artificial neural network size reduction, robustness issues, lean synchronization techniques, swarm visualization, self-x mechanisms, improved coil-switching power electronics, and 3D integration.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The authors would like to gratefully acknowledge the support of the German BMBF, mst-AVS, Project PAC4PT-ROSIG Grant no. 16SV3604 for funding the baseline of this follow-up work and the contributions of L. Rao to the first-cut sensor node SW and of S. Carrella for collecting the raw data for the BREW data set in Warsteiner brewery together with D. Groben.