Separating spectral mixtures in hyperspectral image data using independent component analysis: validation with oral cancer tissue sections

(1)

(2)

Separating spectral mixtures in hyperspectral image data using independent component analysis:

validation with oral cancer tissue sections

Jeng-Ren Duann,

a,b,c

Chia-Ing Jan,

d,e,f

Mang Ou-Yang,

g

Chia-Yi Lin,

b,h

Jen-Feng Mo,

b

Yung-Jiun Lin,

b

Ming-Hsui Tsai,

h,i

and Jin-Chern Chiou

b,g

a

China Medical University, Graduate Institute of Clinical Medical Science, Taichung, Taiwan 404

China Medical University, Biomedical Engineering Research and Development Center, Taichung, Taiwan 404

c

University of California San Diego, Institute for Neural Computation, La Jolla 92093

d

National Yang-Ming University, Department of Dentistry, Taipei, Taiwan 112, New Zealand

e

China Medical University and Hospital, Department of Pathology, Taichung, Taiwan 404

f

China Medical University and Beigang Hospital, Department of Pathology, Yunlin, Taiwan 651

g

National Chiao Tung University, Department of Electrical Engineering, Hsinchu, Taiwan 300

h

Department of Otolaryngology, China Medical University, Taichung, Taiwan 404

i

China Medical University Hospital, Department of Otorhinolaryngology, Taichung, Taiwan 404

b

Abstract. Recently, hyperspectral imaging (HSI) systems, which can provide 100 or more wavelengths of emission autofluorescence measures, have been used to delineate more complete spectral patterns associated with certain molecules relevant to cancerization. Such a spectral fingerprint may reliably correspond to a certain type of mol- ecule and thus can be treated as a biomarker for the presence of that molecule. However, the outcomes of HSI systems can be a complex mixture of characteristic spectra of a variety of molecules as well as optical interferences

due to reflection, scattering, and refraction. As a result, the mixed nature of raw HSI data might obscure the extrac- tion of consistent spectral fingerprints. Here we present the extraction of the characteristic spectra associated with keratinized tissues from the HSI data of tissue sections from 30 oral cancer patients (31 tissue samples in total), excited at two different wavelength ranges (330 to 385 and 470 to 490 nm), using independent and principal com- ponent analysis (ICA and PCA) methods. The results showed that for both excitation wavelength ranges, ICA was able to resolve much more reliable spectral fingerprints associated with the keratinized tissues for all the oral cancer

tissue sections with significantly higher mean correlation coefficients as compared to PCA (p < 0.001). © The Authors.

Keywords: oral cancer; hyperspectrum; autofluorescence; independent component analysis; principal component analysis;

keratinization.

Paper 130440R received Jun. 26, 2013; revised manuscript received Oct. 16, 2013; accepted for publication Nov. 4, 2013; published online Dec. 16, 2013.

1 Introduction

In the past 10 years, clinical studies have shown that multispec- tral-based imaging technologies have outperformed the gold standard white light imaging method in terms of early cancer detection specificities and sensitivities.

1–4

However, such multi- spectral methods have limited themselves by comparing the differences in the amplitudes of two to three emission wave- lengths in the known ranges associated with certain fluoro- phores, such as nicotinamide adenine dinucleotide plus hydrogen (NADH) and flavin adenine dinucleotide (FAD), col- lagen, oxy- and deoxy-hemoglobin, etc.

5–7

As a result, research- ers have expanded the wavelength range to more than 100 spectral channels for each image pixel using a so-called hyper- spectral technique.

8–12

The hyperspectral image has been reported to be useful not only for aiding cancer detection for diagnostic purposes, but also for determining the tumor margin, evaluating the tumor base after tumor resection, and surveying

Address all correspondence to: Jin-Chern Chiou, China Medical University, Biomedical Engineering Research and Development Center, 91 Hsueh-Shih Rd., North District, Taichung 404, Taiwan. Tel: 886-4-22053366 x 1682; Fax:

886-4-22073495; E-mail: [email protected]

and examining a vast area in a less invasive manner to provide a

global screen for early cancer detection.

9

Recently, we devised a microscope-based relay-lens hyperspectral imaging (HSI) sys- tem that has proven feasible for projecting the image plan from an optical system, such as a microscope, to an image acquisition unit [e.g., a charge-coupled device (CCD) camera].

The scanning of the image plan can be achieved using a rela- tively low-cost moving platform to move the relay-lens system with less precision because the scanned image plan has first been magnified through the projection optical path.

13

However, the acquisition from hyperspectral images may reflect a complex mixture of different sources. For example, the mixture can come from the underlying reorganization of bio- chemical fluorophores in and around a tumorigenesis area.

14

Such a reorganization includes an increased metabolism (as

reflected by the increasing NADH) due to neoplastic develop-

ment, a collapsed collagen matrix caused by tumor infiltration,

and decreasing FAD concentration due to tissue remodeling. On

the other hand, some excited fluorophores irrelevant to cancer-

ization, as well as optical interferences caused by the imaging

hardware, such as optical noise, reflection, scattering, and

refraction, can also be included in the image mixture. As a

result, it should be noted that the intensity alteration at a

specific

(3)

wavelength might result from the net effect of the mixture of different biochemical fluorophores and thus yield different results depending on how the HSI data are projected.

A recent approach that uses independent component analysis (ICA) to process the biomedical signal and image data was successfully applied to electroencephalography (EEG) and func- tional magnetic resonance imaging (fMRI) data to separate the mixture of these data into independent brain blood

oxygenation-

level dependent (BOLD) activities, as well as the underlying electrical oscillations associated with specific brain proc- esses.

15–17

Here, we used ICA to decompose the HSI data obtained from 31 oral cancer tissue sections that contained well-developed and identifiable keratinized tissues from the fixed, excised oral cancer biopsies of 30 patients. We hypoth- esized that after separating the distinct spectral features associ- ated with different molecules as well as the optical

interferences,

we would be able to easily identify the independent spectral component associated with the keratinized tissue given that ICA provides both the spatial and spectral information for each of the independent components, showing which area in the tissue contains what characteristic spectrum.

15,17

After the independent spectral components of the keratinized tissues from the different patients/samples were identified, we could further test the consistency of the keratinized tissue-related spec-

tra across different patients using a simple correlation analysis, plotting the mean and standard deviation of the multisample spectra and computing the coefficient of variation (COV) across different wavelengths.

In the past, a similar approach using principal component analysis (PCA) was also documented for analyzing the HSI data of cancerous tissues.

18

We therefore also compared the ICA results to the principal spectral components derived using the PCA method. The results show that the ICA method extracted more consistent keratinized tissue-related spectra with much higher correlations at both excitation wavelengths.

EMCCD was 1000 × 1000 pixels with 10-μm pixel size. As a result, the devised microscope-based HSI system generated a three-dimensional data cube with an in-plane resolution of 30 × 10 μm in the spatial domain and 2.8 nm in the spectral domain. Finally, since a 20× objective was used in the fluores- cence microscope, this resulted in a spatial resolution of 1.5 × 0.5 μm for this study.

3 Cancerous Tissue Preparations and Labeling

2 Microscope-Based Hyperspectral Imaging System

A microscope-based HSI system (for more details, see Hsieh et al.

13

) was used to produce the HSI data of thin, frozen sections

cut from excised oral cancer specimens. Briefly, a regular inverted fluorescence microscope was connected to a custom- designed and manufactured embedded relay-lens system capable of decomposing the two-dimensional (2-D) image of a thin-sliced object, one line at a time, into continuous spectra.

By scanning through the entire 2-D object space, the relay-lens system produced a data cube with two dimensions in the spatial domain and one in the spectral domain (thus, x by y by λ).

8,13

The embedded relay-lens system consisted of a moving relay- lens set with a fixed slit of 30 μm width and a hyperspectrom- eter, spanning a spectral range of 400 to 1000 nm, to convert the

slit image into continuous spectra. The slit image was relayed from the sliced object placed on the object platform of the microscope, which was lit by a Xenon light source filtered into two wavelength ranges, 330 to 385 nm and 470 to 490 nm, for excitation. Inside the embedded relay-lens system, a relay-lens was mounted on a stepping motor-controlled plat- form that could move the relay-lens in 10-μm steps to cover the entire field of view of the slice sample. On the other side of the embedded relay-lens system, an electron multiplying CCD (EMCCD) was mounted to take a snapshot of the HSI data pro- jected from the image slit. The image resolution produced by the

Thirty-one tissue sections of fixed oral cancer specimens, each containing clearly identifiable keratinized tissues, from 30 patients (aged 56.8 Æ 9.2 years, all males) were used to test the feasibility of finding the spectral fingerprints associated with keratinized tissues using ICA and PCA decomposition.

These patients were all diagnosed with oral cancer with mod- erate- to well-differentiated squamous cell carcinoma in their oral cavity and/or tongue (stage II or higher). Before surgery, all patients were informed that the excised tissues would undergo hyperspectral image acquisition and be further proc- essed by the ICA- and PCA-based image analysis procedure as a further examination. They also provided their signed informed consent. The use of the excised cancerous tissues as well as the image acquisition and analysis procedure for the HSI data were approved by the Institutional Review Board of China Medical University Hospital, Taichung Taiwan.

All the procedures used complied with the Declaration of Helsinki.

The regular pathological diagnosis procedure with hema- toxylin and eosin (H&E) staining was used to prepare the sec- tion slices. An experienced pathologist (C.-I. J.) viewed the section slices under a regular microscope, rated the stage of cancer development, and labeled the regions with

keratinization.

The same sample slides then underwent hyperspectral image acquisition using the aforementioned system (see Sec. 2) and according to the locations labeled by the pathologist. The HSI data were then transferred to a computer server located in a different department for analysis by a different data analyst (J.-R. D.). Before the HSI data were transferred, the identifica- tion information linking the image data to the personal demo- graphic data and the pathological diagnoses was removed for deidentification purposes. Note that the information regarding the spatial location of the keratinization in the samples was also not revealed to the data analyst while he was analyzing the data.

4 Data Preprocessing and Image Reconstruction Before each section slide was scanned by the HSI system, two calibration images were obtained: (1) a dark field image with no light to the camera, which was used to remove the system noise and (2) a reference blank image, for which an area on the slide was scanned with all the layers of glass but without the tissue section sample. The latter calibration image was used to remove

the nonuniformity caused by uneven illumination, periodic scan line stripping, and the reflection from the glass medium.

11,13

After data preprocessing, the HSI data cube was first recon- structed into a series of 2-D image slices, each representing a 2-D autofluorescence intensity of the thin tissue slice at an emis-

sion wavelength. The two image dimensions corresponded to the spatial x- and y-axis of the 2-D object slice. In order to fit the limited memory space in the computer server used for the ICA decomposition, as well as to reduce the total compu- tation time, each 2-D image slice was downsampled in space

•

(4)

to one fourth (500 × 500) of the original image size

(1000 × 1000). Prior to downsampling, the 2-D image was spa- tially smoothed using a Gaussian spatial filter with a full width at half maximum of 2 pixels (1 μm) in each dimension.

The processed data cube was then converted into a 2-D (250;000 × 1000, pixels by wavelengths) surrogate data set for further ICA decomposition.

also used to select the principal spectral components from the PCA results.

6 Consistency of the Derived Autofluorescence Spectra of the Keratinized Tissues

5 ICA Decomposition and Component Selection Prior to ICA decomposition, the surrogate data were first decomposed using PCA so as to determine the component num- ber to be decomposed into via the ICA. The criterion to deter- mine the component number was the principal dimension that preserved >98% of the data variance. The PCA result was also stored so it could be compared to the ICA result.

The surrogate data for each tissue sample were then decom- posed using ICA into the previously determined component number. The Infomax ICA algorithm, as developed by Makeig et al.

16

and implemented in the FMRLAB (http://

sccn.ucsd.edu/fmrlab),

15

was used to decompose the surrogate data. The ICA process was used to maximize the independence by maximizing the sparseness or the kurtosis of each indepen- dent component (IC), which was derived via a linear decompo- sition using a 2-D inverse weight matrix. During the training phase of ICA decomposition, the optimization algorithms of gradient descent was used to iteratively adjust the elements of the inverse weight matrix so as to maximize the sparseness among the ICs. The iterative adjustment was halted when the total amount of adjustment was <10

−7

or when the maximum iteration had been reached (maximum steps ¼ 512 was used here). Because of the data size, it could take up to 8 h for each ICA decomposition process to be completed.

After the ICA process was finished, a linear back-projection was performed to multiply the input surrogate data with the inverse weight matrix to compute the independent spectral com-

ponents that composed the HSI data. Each independent spectral component gave the spatial distribution in pixels, with relative intensities indicating the strengths (activations) with which the independent spectral component contributed to the data vari- ance. This spatial distribution, also called the component map, was then standardized by removing the mean map and dividing it by the standard deviation of the component map.

A threshold of a t-value >1.5 was applied to find the highly significant pixels, and the characteristic autofluorescence spec- trum across the entire wavelength span was computed by aver- aging the back-projected autofluorescence spectra of the selected highly participating pixels. As a result, the ICA gave two important pieces of information: one, in the spectral domain, showing the characteristic autofluorescence spectrum and the other, in the spatial domain, indicating which area in the tissues produced the derived independent spectrum.

Given that the samples of cancerous tissue sections were pre- pared with H&E staining and annotated by an experienced pathologist, the spatial labels of keratinized tissues in the sam- ples were finally compared with the results of the ICA compo- nents to determine the accuracy of the decomposition result.

Since the ICA decomposition used all of the domain data in the process without any a priori knowledge regarding the spatial labels, the ICA-based spatial clustering process could thus be considered as blind. One IC that best matched the spatial loca- tions of the labeled keratinized tissue in the oral cancer sample was selected as the target component. This same criterion was

After the independent and principal components (PCs) with component maps overlapping with the keratinized tissues in the oral cancer samples were identified, the characteristic auto- fluorescence spectra associated with these selected

components

were computed for a further test of consistency. This consistency

test was conducted by computing correlations between the characteristic spectra of any two of the 31 tissue samples (thus yielding 465 possible pairs). Finally, the mean correla- tions were computed for each of the four conditions

[two decomposition methodsðICAversusPCAÞ × two excitation wavelength ranges (330 to 385 nm versus 470 to 490 nm)]. In addition, we also tested the consistency by plotting the standard

deviations along with the mean spectrum obtained using differ- ent decomposition methods on the HSI data under different exci-

tation wavelengths. These plots provided clear visualization of the consistency of the derived spectra by checking the envelope spanned by the standard deviations. In order to check in which part along the wavelength axis the derived spectra exhibited bet-

ter consistency, we further computed the COV by computing the ratio of the standard deviation and mean spectrum at each wave-

length. As a result, the COV could be plotted as a function of wavelength for each of the four conditions (two decomposition methods × two excitation wavelength ranges).

7 Results

The data preprocessing and ICA decomposition were conducted on a Windows 2008 Server with an 8-core CPU and 24 GB of memory installed. A batch of routines implemented in the FMRLAB and running under a MATLAB® environment (Mathworks, MA) were used to derive the ICA results in a batch line mode. All the decomposition processes could be con- verged with <512 training steps. For each HSI data cube, at least

three decompositions were performed to guarantee the consis- tency of the outcomes. All the ICA results were stable, except for the fact that only the rankings of the output ICs might some- times differ slightly in different decompositions.

Generally speaking, ICA can be thought of as a grouping or clustering process that first separates the underlying indepen- dent processes (or sources) acquired in a mixed manner via an image or signal acquisition system and then groups the acquisition elements (e.g., pixels or voxels) to which a certain IC has significantly contributed. The example shown in this study is similar to the applications demonstrated by Duann et al.

15

and Siegel et al.,

19

wherein the spatial distributions of similar temporal behaviors were extracted from mixed data sets. Here the ICA process also provided two pieces of informa- tion, one being the autofluorescence spectrum that can be con- sistently found in the significant voxels and the other being the spatial distributions defining where those voxels are. In Fig. 1, the first column shows images of the same two tissue samples acquired with white light illumination. These images were iden- tical to those on which the pathologist’s diagnoses were based.

The white bar at the bottom of the image indicates the physical size of 100 μm. The second and third columns in Fig. 1 give the results of the ICA decomposition applied to the HSI data of two (out of the 31) tissue samples: sample #1 at the top and sample

#2 at the bottom; excitation wavelength range #1 (330 to

(5)

Fig. 1 Independent component analysis (ICA) and principal component analysis (PCA) decomposition results of two exemplar cancerous tissue sam- ples with the sample images acquired under two different excitation wavelength ranges (F1: 330 to 385 nm; F2: 470 to 490 nm). The first column gives the images of the two cancerous tissue samples acquired under white light illuminance; the second and third give the ICA results of the hyperspectral images (ICA at F1 and ICA at F2); and the fourth and fifth show the PCA results of the hyperspectral images (PCA at F1 and PCA at F2). The trace under each component image plot shows the autofluorescence spectrum of the keratinized tissues as highlighted in the images. The white bar at the bottom of the white light microscopic image indicates the size of 100 μm.

385 nm) on the left and excitation wavelength range #2 (470 to 490 nm) on the right. The colored pixels in the spatial maps highlight the locations where their characteristic spectra highly resemble the spectrum shown at the bottom trace. The different colors (from red to yellow to white) indicate how strongly the

individual pixels’ spectra resemble the bottom trace, with the white color indicating the highest and red indicating the lowest level of resemblance. Some of the independent spectral compo- nents revealing optical artifacts, such as background speckle noises, were also separated from the HSI data (Fig. 2). Note

Fig. 2 Example artifactual independent spectral components decomposed from the three-dimensional hyperspectral image data using ICA.

•

(6)

that these artifactual components also show specific fluores- cence spectral patterns that might affect cancer tissue detection

if not removed beforehand. On the other hand, the PCA also gives the spatial and spectral information, as did the ICA (the fourth and fifth columns in Fig. 1), but with different result- ant characteristic autofluorescence spectra due to different decomposition principles.

Figure 3 illustrates the mean spectra (thick black lines) and the standard deviation (shade) of the spectra associated with the

keratinized tissues in the 31 tissue samples. As can be seen, ICA method results in more consistent characteristic spectra of keratinized tissues in both excitation wavelength ranges with smaller standard deviations across the entire wavelength range. ICA also gives sharper peaks in both cases. On the other hand, both ICA and PCA result in more consistent spectra from all HSI data under the excitation wavelength range of 470 to 490 nm. When applied to the HSI data excited by the wave- length range of 330 to 385 nm, both ICA and PCA result in higher uncertainty at the rising edges of the extracted character-

istic spectra.

In order to examine the consistency of each decomposition result across the entire spectrum, we further computed the COV by dividing the standard deviation of the resultant spectra with the mean spectra at each wavelength across different tissue sam-

ples. As a result, the COVs are plotted as a function of wave- length in Fig. 4. The COVs of ICA results are plotted in red and the COVs of PCA are plotted in blue; the solid lines depict the results decomposed from the HSI data excited by the wave- length range of 330 to 385 nm and the broken lines the results decomposed from the HSI data excited by the wavelength range of 470 to 490 nm. The results again show that the resulting spec-

tra from ICA decomposition are quite consistent across different specimens (COVs <1.0 for most of the wavelength range).

Fig. 4 Coefficient of variation of the resulting spectra as a function of wavelengths of the two decomposition methods applied to the hyper- spectral imaging data excited at different wavelength ranges. The red lines show the results of ICA decomposition and the blue the results of PCA decomposition. Solid lines indicate the results of the excitation wavelength range of 330 to 385 nm and broken lines the excitation wavelength range of 470 to 490 nm.

However, the PCA decomposition gives only consistent spectra up to 650 nm and then the COVs increase largely and reach peaks at 700 nm for the HSI data excited with wavelength range of 330 to 385 nm and 760 nm for the HSI data excited with wavelength range of 470 to 490 nm.

Table 1 gives the mean correlation coefficients between the characteristic autofluorescence spectra of any two of the sample

tissues derived by different decomposition methods applied to

Fig. 3 Mean (solid lines) and standard deviation (shades) of the spectra of the keratinized tissues from the entire 31 oral cancer tissue samples as found

using ICA and PCA decomposition methods. F1 indicates the excitation wavelength range from 330 to 385 nm and F2 indicates the excitation wave-

length range from 470 to 490 nm.

(7)

Table 1 Mean correlation coefficients (mean Æ standard deviation) between the autofluorescence spectra of the 10 sample tissues derived via various decomposition methods applied to different hyperspectral image data obtained at different excitation wavelength ranges.

330 to 385 nm

Independent component analysis

Principal component analysis

0.92 Æ 0.09

0.85 Æ 0.21

470 to 490 nm

0.97 Æ 0.03

0.89 Æ 0.24

the HSI data and at different excitation wavelength ranges. As can be seen, the ICA applied to the HSI data acquired at exci- tation wavelengths ranging from 470 to 490 nm yields the most consistent spectral fingerprints (with the highest mean correla- tion and lowest standard deviation) of keratinized tissues in the oral cancer tissue sections as compared to other conditions. On average, the ICA method outperformed the PCA method when applied to the HSI data acquired at both excitation wavelength ranges (p < 0.001 for both excitation wavelength ranges).

8 Discussion and Conclusion

Although autofluorescence spectroscopy has long been used to differentiate cancerous tissues from normal tissues, the reported

evidence in the literature and the detection criteria used in com- mercially available testing devices are sometimes incompatible.

As summarized by Ramanujam,

20

distinct excitation wave- lengths are needed in order to elicit maximal emission

wavelengths for different endogenous fluorophores. As a result, any approach that targets different cancerization-related endog- enous fluorophores may use different excitation wavelengths and find peak differences at different maximal emission wave- lengths, thereby facilitating the ideal differentiation between cancerous and normal tissues. In order to have a more complete view using the characteristic autofluorescence spectroscopy associated with the endogenous fluorophores related to cancer development, it might be desirable to look at a wider spectrum so as to establish spectral biomarkers for cancer diagnosis and staging.

11

In light of this, researchers are now actively pursuing imaging devices and data analysis strategies for HSI.

8

However, one of the main obstacles in extracting reliable bio- markers for different biochemical fluorophores is the complex mixture of inputs during the acquisition of HSI data. The HSI data acquisition may involve autofluorescence emitted from various fluorophores, such as NADH, FAD, collagen matrix, elastin, etc., and other optical interferences. The autofluores- cence of irrelevant fluorophores, as well as the artifacts, may sometimes contaminate the autofluorescence spectrum of the targeted fluorophores and, in turn, deteriorate the

reproducibility

of the resulting spectrum. Such a mixture problem would also affect traditional analysis methods in which the differences in fluorescence intensity at certain wavelengths that are excited at UV or other wavelengths are used to differentiate cancerous from normal tissue. On the other hand, the decrease or increase of fluorescence intensity is a measure relative to the baseline.

As

such, whether or not a reliable baseline can be obtained is criti- cal to the validity of a traditional analysis method. Therefore, removing the effects of possible baseline drifting and separating the overlapping effects of two or more driving factors on the fluorescence intensity changes at the same emission wave- lengths should shed light on the search for more reliable

autofluorescence biomarkers for any fluorophores that actively participate in cancerization.

To achieve this goal, one may consider using an algorithm, such as ICA or PCA, to separate and summarize the portions of data variance from a more specific process of cancerization.

Such algorithms can separate the input data into components with minimum overlap in terms of statistical properties. For example, PCA, in general, finds a smaller set of uncorrelated variables that gives as good representation as possible based on second-order statistics.

21

It thus tries to summarize as much data variance as possible into only the top few PCs. To achieve this, PCA finds this first PC that accounts for the largest possible variance, and each succeeding component in turn accounts the highest possible variance under the constraint that it be orthogonal to the preceding components. As a result, PCA can be efficiently implemented using, for example, singu- lar value decomposition to convert a set of observations of pos- sibly correlated variables into a set of linearly uncorrelated variables. ICA, on the other hand, separates the observation into statistically independent components based on higher- order statistics (e.g., kurtosis). To achieve this, ICA uses the algorithms to maximize the non-Gaussianity of the estimated components, which are not necessarily orthogonal to each other.

22

Such nonorthogonal transforms that are sensitive to the higher-order statistics may help to fit into the detail struc- tures of the data variance and find the underlying true sources.

23

In the past, PCA has been used to separate underlying spectroscopic processes in fluorescence emission spectral data;

18

the PCA method has been known to efficiently summa- rize data variance using the first few PCs (usually, the first five PCs tend to account for >90% of data variance). It may thus be likely that one PC can combine more than one characteristic autofluorescence spectrum in the HSI data. How many and which characteristic spectra would be combined in a PC depends on the variance of the different principal spectral com- ponents and could vary across different HSI data. Consequently, as shown in this study, the PCA resulted in less consistent spec- tral fingerprints, even though the keratinized tissues in the tissue

sections could be identified equally well from the HSI data.

Compared to PCA, ICA has proved capable of decomposing electroencephalograph/megnetoencephalograph (EEG/MEG) and fMRI data into reliable ICs that demonstrate more consistent behavior across different subjects.

15–17

The extracted ICs repre- sented more purified brain EEG electrical oscillations and fMRI BOLD signals with less contamination from other irrelevant brain activations and/or artifactual components. In line with these findings, our ICA results revealed more consistent inde- pendent characteristic spectra associated with keratinized tissues

in the HSI data from various cancer specimens of different patients.

Although the present study has proven the process of ICA useful for reliably separating the independent characteristic spectra associated with keratinized tissues from the mixture of HSI data of different oral cancer tissue sections, this can only be considered a first step toward finding the hyperspectral fingerprints for cancer development and diagnosis. As keratin- ized tissues are mostly found during very late stages of cancer- ization and the biochemical composition seems to be more consistent in keratinized tissues,

24,25

the ability to detect the presence of keratinized tissues is of no direct diagnostic value for early cancer detection. Keratinized tissues were used in this study simply to test the feasibility of the proposed

•

(8)

8. R. A. Schultz et al., “Hyperspectral imaging: a novel approach for microscopic analysis,” Cytometry 43(4), 239–247 (2001).

9. H. Akbari et al., “Cancer detection using infrared hyperspectral imag- ing,” Cancer Sci. 102(4), 852–857 (2011).

10. G. M. Palmer et al., “Optical imaging of tumor hypoxia dynamics,”

J. Biomed. Opt. 15(6), 066021 (2010).

11. A. M. Siddiqi et al., “Use of hyperspectral imaging to distinguish normal, precancerous, and cancerous cells,” Cancer 114(1), 13–21 (2008).

12. R. T. Kester et al., “Real-time snapshot hyperspectral imaging endos- copy,” J. Biomed. Opt. 16(5), 056005 (2011).

13. Y. F. Hsieh, M. Ou-Yang, and C. C. Lee, “Finite conjugate embedded relay lens hyperspectral imaging system (ERL-HIS),” Appl. Opt. 50(33), 6198–6205 (2011).

14. C. F. Poh et al., “Fluorescence visualization detection of field alterations in tumor margins of oral cancer patients,” Clin. Cancer Res. 12(22), 6716–6722 (2006).

15. J. R. Duann et al., “Single-trial variability in event-related BOLD sig- nals,” Neuroimage 15(4), 823–835 (2002).

16. S. Makeig et al., “Mining event-related brain dynamics,” Trends Cogn.

Sci. 8(5), 204–210 (2004).

17. M. J. McKeown et al., “Spatial independent activity patterns in func- tional MRI data during the Stroop color-naming task,” Proc. Natl.

Acad. Sci. U. S.A. 95(3), 803–810 (1998).

18. N. Ramanujam et al., “Cervical pre-cancer detection using a multivari- ate statistical algorithm based on laser induced fluorescence spectra at multiple excitation wavelengths,” Photochem. Photobiol. 64(4), 720–

735 (1996).

19. R. M. Siegel et al., “Spatiotemporal dynamics of the functional archi- tecture for gain fields in inferior parietal lobule of behaving monkey,”

Cereb. Cortex 17(2), 378–390 (2007).

20. N. Ramanujam, “Fluorescence spectroscopy of neoplastic and non-neo- plastic tissues,” Neoplasia 2(1–2), 89–117 (2000).

21. C. Bugli and P. Lambert, “Comparison between principal component analysis and independent component analysis in electroencephalograms modelling,” Biom. J. 48(2), 1–16 (2007).

22. P. Comon, “Independent component analysis, a new concept?,” Signal Process. 36(3), 287–314 (1994).

23. T. P. Jung et al., “Imaging brain dynamics using independent component analysis,” Proc. IEEE 89(7), 1107–1122 (2001).

24. S. Kuroki et al., “Epithelialization in oral mucous wound healing in terms of energy metabolism,” Kobe J. Med. Sci. 55(2), E5–E15 (2009).

25. S. D. Silva et al., “Fatty acid synthase expression in squamous cell car- cinoma of the tongue: clinicopathological findings,” Oral Dis. 14(4), 376–382 (2008).

method with easily identifiable ground truth. To apply the pro- posed approach to the HSI data of other types or stages of can- cerous tissue, samples should be straightforward. Basically, the same procedure still applies. Therefore, an important next step would be to test if any hyperspectral fingerprints can be found for different molecules that can reflect the stages of cancer development. It would be valuable, moreover, to devise a port- able system that could be used in physicians’ offices to help cli- nicians screen suspicious spots of epithelial mucosa in, for example, the oral cavity and thereby determine whether or not a biopsy is necessary.

Acknowledgments

The authors would like to thank Mr. Yao-Fang Hsieh and Mr.

Chih-Hsien Chen for their constant calibrations of the hyper- spectral imaging device. This study was supported in part by the National Science Council (NSC 100-2218-E-039-001) of Taiwan. J.R.D., M.O.-Y., Y.-J. L., and J.-C.C. were further sup- ported by the “Aim for the Top University Plan” of National Chiao Tung University and the Ministry of Education of Taiwan.

References

1. P. M. Lane et al., “Simple device for the direct visualization of oral- cavity tissue fluorescence,” J. Biomed. Opt. 11(2), 024006 (2006).

2. P. Sharma et al., “The utility of a novel narrow band imaging endoscopy system in patients with Barrett’s esophagus,” Gastrointest. Endosc.

64(2), 167–175 (2006).

3. M. A. Kara et al., “Endoscopic video-autofluorescence imaging fol- lowed by narrow band imaging for detecting early neoplasia in Barrett’s esophagus,” Gastrointest. Endosc. 64(6), 679–685 (2005).

4. W. L. Curvers et al., “Endoscopic tri-modal imaging for detection of early neoplasia in Barrett’s esophagus: a multi-centre feasibility study using high resolution endoscopy, autofluorescence imaging and narrow band imaging,” Gut 57(2), 167–172 (2008).

5. R. Drezek et al., “Autofluorescence microscopy of fresh cervical-tissue sections reveals alternation in tissue biochemistry with dysplasia,”

Photochem. Photobiol. 73(6), 636–641 (2001).

6. Y. Wu, W. Zheng, and J. Y. Qu, “Time-resolved confocal fluorescence spectroscopy reveals the structure and metabolic state of epithelial tis- sue,” Proc. SPIE 6430, 643013 (2007).