Research Article

Image Geolocation Method Based on Attention Mechanism Front Loading and Feature Fusion

Table 3

The performance comparisons between the proposed method and NetVLAD on the Pitts 30k [81].

Noise filtering layerFeature aggregation layerPCADimensionR@1R@5R@10R@20

×NetVLAD×3276879.4590.1092.7795.19
×NetVLAD51277.5289.4192.5995.39
×NetVLAD102478.9290.1092.8795.48
×NetVLAD204879.5590.4292.9795.38
×NetVLAD409679.5090.2392.995.19
NetVLAD×3276880.9991.0993.5795.48
NetVLAD51280.3491.3793.9795.76
NetVLAD102481.2491.6194.0795.77
NetVLAD204881.4191.6293.8595.85
NetVLAD409681.2991.3793.6395.6
NetVLAD + SPP+GeM×4812883.6792.3694.1695.77
NetVLAD + SPP+GeM51282.8892.3994.9196.24
NetVLAD + SPP+GeM102483.7392.8194.9196.20
NetVLAD + SPP+GeM204884.0192.7894.8196.11
NetVLAD + SPP+GeM409683.9292.6294.595.95

“×” means that the operation corresponding to the column name is not applied to the model, and “√” means the opposite. “Dimension” denotes the dimension of the final descriptor; “R@N” denotes .