Categorizing Sequences of Laryngeal Data for Decision Support

Download Presentation

Categorizing Sequences of Laryngeal Data for Decision Support

Loading in 2 Seconds...

- 58 Views
- Uploaded on
- Presentation posted in: General

Categorizing Sequences of Laryngeal Data for Decision Support

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

The 4thInternational Conference onElectrical & ControlTechnologies

Categorizing Sequences of Laryngeal Datafor Decision Support

A. Gelžinis٭, E. Vaičiukynas٭, E. Kelertas٭, M. Bačauskienė٭, A. Verikas*٭, V. Uloza°, A. Vegienė°

٭Department of Electrical & Control Instrumentation, Kaunas University of Technology, Lithuania

* Intelligent Systems Laboratory, Halmstad University, Sweden

°Department of Otolaryngology, Kaunas University of Medicine, Lithuania

- Can we be more dedicated 2 basic human needs?
- eHealth investigates modern best practices in the use of Information and Communication Technologies (ICT) as tools for enhancing health promotion, health protection, quality of service, accessibility and efficiency in all aspects of health care delivery (to meet all health needs of society).
- COST Action 2103 “Advanced Voice Function Assessment” combine previously unexploited techniques with new theoretical developments to improve voice production models and analysis algorithms with a view to voice quality & diseases.

Categorizing Sequences of Laryngeal Data for Desicion Support

- In clinical practice, decision making is quite often based on subjective evaluation of patient data.
- Long-term goal of our work is decision support system for automated diagnostics of laryngeal diseases, using such information sources:
- voice signal of patient (recorded with microphone);
- images of vocal folds (laryngeal videostroboscopy);
- questionnaire data.

- In this experiment we explore kernel-based correlative learning techniques on information extracted from videostroboscopy (sequences of images) for categorizing laryngeal disorders.

Categorizing Sequences of Laryngeal Data for Desicion Support

- Well-established technique for measuring the glottal gap, glottic closure, glottic angle and the angular velocities of vocal fold abduction and adduction (with single-ﬂash-timing).
- Multiple fundamental frequencies may be recorded in the case of some diseases, such as polyps, nodules, and cysts (with multiple-ﬂash-timing).
- Duration = 20 – 80 seconds
- Resolution = 768 × 576 pixels
- Frame rate = 25 frames / second
- Data used is from Department of Otolaryngology, Kaunas University of Medicine, Lithuania.

Categorizing Sequences of Laryngeal Data for Desicion Support

- Mass lesions of vocal folds = [ healthy | diffuse | nodular ]
- Nodular lesions (localized thickenings):
- nodules
- polyps
- cysts

- Diffuse lesions:
- papillomata
- hyperplastic
- laryngitis w/ keratosis
- carcinoma

Categorizing Sequences of Laryngeal Data for Desicion Support

Categorizing Sequences of Laryngeal Data for Desicion Support

- Same incident light - different RGB values (by different acquisition equipment). Transform space!
- Similarity of two colors from their distance in the space is evaluated by using L*a*b color space.
- Multi-scale retinex theory-based color image en- hancement mitigates capturing condition variation.
- Color distribution is expressed by 3D color histogram of N bins as a random N-vector, which consists of N = 4096 (16×16×16) bins.
- Compact description of histogram is obtained by removing not so important bins from all images.

Categorizing Sequences of Laryngeal Data for Desicion Support

- Kernel Principal Component Analysis (Scholkopf)
- orthogonal transformation of coordinate system of data
- powerful method of extracting nonlinear features for classiﬁcation (finding nonlinear relationships in data)
- nonlinear generalization of PCA via the kernel trick
- linear PCA in the feature space = KPCA in the input
- helps avoid overﬁtting the distributions in the possibly inﬁnite-dimensional feature space F (or RKHS)
- only the ﬁrst few important eigenvectors are kept, effectively regularizing the resulting distributions
- first principal component = most of variability in the data

- Solving the eigenvalue problem of kernel PCA is to diagonalize the kernel (data covariance) matrix.

Categorizing Sequences of Laryngeal Data for Desicion Support

pattern

analysis

algorithm

kernel

identifiedpattern(data class)function

data

subspace

f(x)=Σαiκ(xi,x)

K(X,Z)

LS-SVM

Kernel trick: K(X, Z) = ‹ K(·,X), K(·,Z) › = ‹ φ(X), φ(Z) ›

Categorizing Sequences of Laryngeal Data for Desicion Support

Categorizing Sequences of Laryngeal Data for Desicion Support

- Primal representation (inputs)
- Dual representation (features)

Categorizing Sequences of Laryngeal Data for Desicion Support

- Primal problem (margin maximization)
- Dual problem (quadratic programming)

Categorizing Sequences of Laryngeal Data for Desicion Support

- Lagrangian with Lagrange multipliers α:
- Conditions for optimality (saddle point):
- Primal representation / Dual representation

Categorizing Sequences of Laryngeal Data for Desicion Support

- Primal problem (margin maximization)
- Dual problem (quadratic programming)

Categorizing Sequences of Laryngeal Data for Desicion Support

- Lagrangian with Lagrange multipliers α:
- Conditions for optimality (saddle point):
- Primal representation / Dual representation

Categorizing Sequences of Laryngeal Data for Desicion Support

- Ensemble matching methods generally consider the task of obtaining a similarity function which operates on pairs of sets of feature vectors, or pairs of ensembles.
- Video is considered as an ensemble of feature vectors, where each vector is extracted from corresponding video frame – image.
- The alignment of the two subspaces spanned by the elements of the two ensembles is used as a measure of similarity.
- The kernel principal angle is the angle between the principal subspaces of two matrices, each matrix composed of feature vectors as columns.

Categorizing Sequences of Laryngeal Data for Desicion Support

- Principal angles are angles between a pair of vector sets in two linear subspaces (also relates to principal correlation).
- If θ = 0, the cosine is 1 and vector sets are parallel.
- If θ = π/2 , the cosine is 0 and vector sets are orthogonal.
- Quantities cos(θk) are often referred to as principal correlations or canonical correlations of the matrix pair.

Categorizing Sequences of Laryngeal Data for Desicion Support

- Kernelized Modified Gram-Schmidt Orthogonalization used
- Kernel trick was used to compute principal angles in the feature space induced by a minor Gaussian (RBF) kernel:K(x, z) = ‹ φ(x), φ(z) › = kminor(x,z)
- Instead of dealing with the set-elements x in input space, we work with their mappings in feature space Φ(x), which are accessible only as scalar products.
- Using a linear minor kernel (polynomial degree 1) is equivalent to computing principal angles in input space.
- Positive-definite kernel (similarity metric) for our LS-SVM:
- K (Ai, Aj) ≡ f (Ai, Aj) = ∏k=1..ncos2(θk)

Categorizing Sequences of Laryngeal Data for Desicion Support

- Available data from 30 patients has:
- 10 patients evaluated as belonging to the nodular class;
- 12 patients evaluated as belonging to the diffuse class;
- 8 patients evaluated as belonging to the healthy class.

- There were 100 image frames in each 4 s image sequence, but only 10 from the middle were used.
- Data normalized to 0 mean and variance 1.
- Polynomial kernel of degree q = 1, 2, 3 in KPCA.
- Gaussian kernel as a minor kernel to estimate the KPA kernel (measure over a pair of matrices).
- Due to the small number of data points (image sequences) leave-one-out test was chosen for %.

Categorizing Sequences of Laryngeal Data for Desicion Support

Categorizing Sequences of Laryngeal Data for Desicion Support

Categorizing Sequences of Laryngeal Data for Desicion Support

Categorizing Sequences of Laryngeal Data for Desicion Support

- Least squares support vector machine (LS-SVM) was constructed in Matlab for classification into 3 classes, namely healthy | nodular | diffuse.
- Kernel function (KPA kernel) employed by LS-SVM was designed to be used over a pair of matrices, rather than over a pair of vectors, to kernelize principal angles between subspaces.
- Different number of kernel principal components was extracted from N-vector histogram & tested.
- Disease recognition is done on the premises, that:
- different disorders generate different subspaces
- principal angles between subspaces can be measured

Categorizing Sequences of Laryngeal Data for Desicion Support

- In this paper, we have investigated an approach to an automated analysis of sets of vocal cord images aiming to categorize laryngeal disorders.
- Sequences of color laryngeal images were categorized into the healthy, nodular and diffuse classes using an LS-SVM with a kernel deﬁned over a pair of matrices.
- Bearing in mind the high similarity of the decision classes, encouraging classiﬁcation performance was obtained when testing the developed tools on data recorded during routine videostroboscopy.
- Relatively high classiﬁcation accuracy obtained (85,7%) encourages further studies in this area.

Categorizing Sequences of Laryngeal Data for Desicion Support