RANDOMIZED NUMERICAL LINEAR ALGEBRA FOR KERNEL MATRIX COMPRESSION

Maniar, Saumya Kandarp

RANDOMIZED NUMERICAL LINEAR ALGEBRA FOR KERNEL MATRIX COMPRESSION

Search for this publication on Google Scholar

Maniar, S. K. (2020). RANDOMIZED NUMERICAL LINEAR ALGEBRA FOR KERNEL MATRIX COMPRESSION. Unc Charlotte Electronic Theses And Dissertations.

Download PDF

Analytics

75 views ◎
41 downloads ⇓

Abstract

Matrices are used significantly as a medium to store data in many applications likeData Science, Computer Science, Statistics, and Applied Mathematics. Matrix computationslike matrix multiplication, matrix inversion, eigenvalue decomposition, singularvalue decomposition are very substantial in real-world applications. Unfortunately,many of these matrix operations are so time and memory expensive that theyare prohibitive when the scale of data is large. Sometimes, when the data has a largeamount of meaningless information called noise, machine-precision matrix operationsare not necessary, and one can sacrifice a reasonable amount of accuracy for computationalefficiency. In addition to the applications mentioned above, in MachineLearning, Linear Algebra, Partial Differential Equations, and Optimization, the datausually boils down to an mn matrix A, and often it is very helpful to derive matrixapproximations to our original matrix A when our data is seemingly unmanageable.We try to get an approximation matrix Ak which has a particular rank k (considerablysmaller than m and n), in other words, low-rank approximation. Methods likeSingular Value Decomposition and QR Decomposition can be used to achieve suchmatrix approximations. But, such algorithms usually take a lot of time (superlinearin the number of nonzero elements of the matrix). We want to optimize our algorithmsto be used in various applications where data sets are framed by extremelylarge matrices.So, in this thesis, we primarily focus on applying a new approach called RandomizedNumerical Linear Algebra. We will introduce two different methods, the firstmethod known as Random projections and the second known as Random Sampling.Random projections have recently emerged as a powerful method for dimensionalityreduction. We will present some experimental results carried out on matrices generatedby Kernel functions and work on a specific type of kernel function called Green’sfunction. We will try to execute low-rank approximation on them using RandomizedNumerical Linear Algebra and show some approximation errors to better understandthe nature of the new approach known as RandNLA. We will try to show that usinga sparse random matrix gives additional computational savings. By projecting thedata onto a random lower-dimensional subspace yields results comparable to conventionaldimensionality reduction methods such as SVD and QR: the similarity of datavectors is preserved well under random projection. This approach is computationallysignificantly less expensive than the conventional approach.

Details

Author: Maniar, Saumya Kandarp
Title: RANDOMIZED NUMERICAL LINEAR ALGEBRA FOR KERNEL MATRIX COMPRESSION
Physical Description: 1 online resource (41 pages) : PDF
Date: 2020
Degree Granting Institution: University of North Carolina at Charlotte
Abstract: Matrices are used significantly as a medium to store data in many applications likeData Science, Computer Science, Statistics, and Applied Mathematics. Matrix computationslike matrix multiplication, matrix inversion, eigenvalue decomposition, singularvalue decomposition are very substantial in real-world applications. Unfortunately,many of these matrix operations are so time and memory expensive that theyare prohibitive when the scale of data is large. Sometimes, when the data has a largeamount of meaningless information called noise, machine-precision matrix operationsare not necessary, and one can sacrifice a reasonable amount of accuracy for computationalefficiency. In addition to the applications mentioned above, in MachineLearning, Linear Algebra, Partial Differential Equations, and Optimization, the datausually boils down to an mn matrix A, and often it is very helpful to derive matrixapproximations to our original matrix A when our data is seemingly unmanageable.We try to get an approximation matrix Ak which has a particular rank k (considerablysmaller than m and n), in other words, low-rank approximation. Methods likeSingular Value Decomposition and QR Decomposition can be used to achieve suchmatrix approximations. But, such algorithms usually take a lot of time (superlinearin the number of nonzero elements of the matrix). We want to optimize our algorithmsto be used in various applications where data sets are framed by extremelylarge matrices.So, in this thesis, we primarily focus on applying a new approach called RandomizedNumerical Linear Algebra. We will introduce two different methods, the firstmethod known as Random projections and the second known as Random Sampling.Random projections have recently emerged as a powerful method for dimensionalityreduction. We will present some experimental results carried out on matrices generatedby Kernel functions and work on a specific type of kernel function called Green’sfunction. We will try to execute low-rank approximation on them using RandomizedNumerical Linear Algebra and show some approximation errors to better understandthe nature of the new approach known as RandNLA. We will try to show that usinga sparse random matrix gives additional computational savings. By projecting thedata onto a random lower-dimensional subspace yields results comparable to conventionaldimensionality reduction methods such as SVD and QR: the similarity of datavectors is preserved well under random projection. This approach is computationallysignificantly less expensive than the conventional approach.
Genre: masters theses
Subjects--Topics: Mathematics
Degree: M.S.
Subject Area: Mathematics
Advisor(s): Chen, Duan
Committee Members: Deng, Shaozhong
Oh, Hae-Soo
Degree Note: Thesis (M.S.)--University of North Carolina at Charlotte, 2020.
Rights Statement: This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). For additional information, see http://rightsstatements.org/page/InC/1.0/.
Rights Holder Information: Copyright is held by the author unless otherwise indicated.
Identifier: Maniar_uncc_0694N_12417
Permalink: http://hdl.handle.net/20.500.13093/etd:1887

J. Murrey Atkins Library

J. Murrey Atkins Library