Introduction to visa

Kang Yu

2021-04-19

Imaging Spectroscopy (also known as Hyperspectral Remote Sensing, HRS) technology and data are increasingly used in environmental sciences, and nowadays much more beyond that, thus requiring accessible data and analytical tools (especially open source) for students and scientists with a diverse background. Therefore, I come up with such a idea since i was a PhD student at the University of Cologne, inspired by the growing community of R and R users. I have been mainly working with spectral data of plants, and that is reason i use the name VISA for this package, with the aim to facilitate the use of imaging spectroscopy techniques and data for the extraction of vegetation signatures for plant stresses and biodiversity.

Future development of this tool has a long-term goal to include: i) implement the state-of-the-art applications of vegetation spectral indicators, ii) provide a platform to share vegetation spectral data to address certain questions of interest or applications in a broad context, and iii) make it compatible with more data formats and tools, such as the r package hsdar1.

Currently, visa can be installed via my GitHub repository visa, by devtools::install_github("kang-yu/visa"), and its submission to CRAN is in progress. This vignette will introduces the design and features of visa from the following aspects:

Data

The visa package intends to simplify the use and reduce the limit of data format and structure. visa uses two data formats, and a hacking use of R’s data.frame and, a S42 class specifically for visa.

Built on data.frame

Why i call it a hacking use is because a data.frame is a table organized by variables, and it is ideal to store every spectral band in as a variable. Imaging that you have thousands of columns and you have to refer to thousands of bands when using data.frame. Then, why not just store all the spectral bands, ie. the spectral matrix in a single variable, like the example data NSpec.DF.

This will be ease your coding for analysis, and you write your argument as ‘y ~ spectra’ instead of ‘y ~ band1 + band2 + band3 + …

# check the data type of `NSpec.DF`
class(NSpec.DF) 
class(NSpec.DF$spectra) 

Storing spectra matrix as a variable in data.frame

S4 class Spectra and SpectraDatabase

There are already a lot of r packages for spectral data analysis, and some of them use the S4 class, e.g. the hsdar package. visa also supports the S4 format but in a simplified version, using only five slots currently.

# check the data type of `NSpec.DB`
class(NSpec.DB) 
class(NSpec.DB@spectra) 

Storing spectra matrix as a slot of the S4 class

Notice that the small difference of accessing data in two types of data, i.e., using $ and @, respectively.

Functions

Computing correlation matrix

The first idea of writing this package was to compute the correlation matrix for the thorough analysis of correlations between, on one hand, the combinations of spectral bands, and on the other hand, the vegetation variables of interest.

Here gives the example using the cm.nsr function, which can be used for non-spectra data as well.

library(visa)
data(NSpec.DF)
x <- NSpec.DF$N # nitrogen
S <- NSpec.DF$spectra[, seq(1, ncol(NSpec.DF$spectra), 10)] # resampled to 10 nm steps
cm <- cm.nsr(S, x, cm.plot = TRUE)
#> [1] "The max value of R^2 is 0.5333"
#> [1] "i_460" "j_450"

Plotting correlation matrix

The correlation matrix plot is the plot of correlation coefficients (r/r2) by bands in x- and y-axis.

# use the output from last example
# cm <- cm.nsr(S, x)
# Plotting the correlation matrix
ggplot.cm(cm)
#> [1] "The max value of R^2 is 0.5333"
#> [1] "i_460" "j_450"

Plot of correlation matrix

More Examples and Details

The computation of SR and NSR follow the equations, e.g.:

\(SR = \lambda_i / \lambda_j\)

\(NSR = (\lambda_i - \lambda_j)/(\lambda_i + \lambda_j)\)

To know more about the NDVI, please also check on Wikipedia3.

Example data NSpec.DB

The first type is the ‘NSpec.DB’ in the default S4 class ‘Spectra’.

library(visa)
# check the data type
class(NSpec.DB) 

[1] “Spectra” attr(,“package”) [1] “visa”

# data structure
# str(NSpec.DB)
# print the first 10 columns
knitr::kable(head(NSpec.DB@spectra[,1:10]))
350 nm 351 nm 352 nm 353 nm 354 nm 355 nm 356 nm 357 nm 358 nm 359 nm
0.0107850 0.0103620 0.0105130 0.0107780 0.0105690 0.0104170 0.0104270 0.0105240 0.0104590 0.0103860
0.0105900 0.0101850 0.0101270 0.0102320 0.0101690 0.0101270 0.0101720 0.0102500 0.0101840 0.0102270
0.0094211 0.0088921 0.0091092 0.0095035 0.0093468 0.0090576 0.0089967 0.0091336 0.0090165 0.0089895
0.0104040 0.0098426 0.0098587 0.0101030 0.0100700 0.0100920 0.0100790 0.0099628 0.0097052 0.0096571
0.0104650 0.0102510 0.0101960 0.0102150 0.0101590 0.0098056 0.0097447 0.0099987 0.0099463 0.0097695
0.0123460 0.0122110 0.0121620 0.0121480 0.0120900 0.0121430 0.0120990 0.0119900 0.0121290 0.0121080

Example data NSpec.DF

The second type is a data.frame format, i.e., NSpec.DF.

# check the data type
class(NSpec.DF) 

[1] “data.frame”

# check whether it contains the same data as 'NSpec.DB'
knitr::kable(head(NSpec.DF$spectra[,1:10]))
350 nm 351 nm 352 nm 353 nm 354 nm 355 nm 356 nm 357 nm 358 nm 359 nm
0.0107850 0.0103620 0.0105130 0.0107780 0.0105690 0.0104170 0.0104270 0.0105240 0.0104590 0.0103860
0.0105900 0.0101850 0.0101270 0.0102320 0.0101690 0.0101270 0.0101720 0.0102500 0.0101840 0.0102270
0.0094211 0.0088921 0.0091092 0.0095035 0.0093468 0.0090576 0.0089967 0.0091336 0.0090165 0.0089895
0.0104040 0.0098426 0.0098587 0.0101030 0.0100700 0.0100920 0.0100790 0.0099628 0.0097052 0.0096571
0.0104650 0.0102510 0.0101960 0.0102150 0.0101590 0.0098056 0.0097447 0.0099987 0.0099463 0.0097695
0.0123460 0.0122110 0.0121620 0.0121480 0.0120900 0.0121430 0.0120990 0.0119900 0.0121290 0.0121080

Accessing data

spectra

wavelength

Compatibility

Data format conversion

as.spectra

as.spectra.data.frame

Future development

Regarding compatibility for future development, special focuses will be put on:

visa believes:

“Software probably makes knowledge gaps, but should not be due to the access to software.”


  1. Lukas W. Lehnert, Hanna Meyer, Joerg Bendix (2018). hsdar: Manage, analyse and simulate hyperspectral data in R↩︎

  2. The S4 object system http://adv-r.had.co.nz/S4.html↩︎

  3. Normalized difference vegetation index (https://en.wikipedia.org/wiki/Normalized_difference_vegetation_index)↩︎