DATAIA Seminar | David Degras "Joint Tensor Decomposition: Methods and Models"
 
Abstract
Joint Tensor Decomposition: Methods and Models
Resume
In science and industry, data often arise as tensors, or multidimensional arrays, collected along various dimensions such as time, space, or frequency. Examples include video sequences in computer vision, 2D+ images in engineering and biomedical research, audio signals, and text embeddings in natural language processing. Preserving tensor structure in analysis can provide significant statistical and computational advantages over routine vectorization methods. Tensors retain the inherent multidimensional relationships within data, leading to more accurate and interpretable representations of complex phenomena. Additionally, tensor operations enable efficient manipulation of high-dimensional data, resulting in substantial savings in computation time and memory usage.
However, the mathematical theory of tensors remains somewhat elusive and is still under active development. While the maximum rank of a matrix of given dimensions is well understood, determining the maximum rank of a tensor is an open problem. Similarly, while the rank of a matrix can be easily determined using established algorithms like QR or SVD, finding the rank of a tensor is generally NP-hard. There is ample room for theoretical advances in tensor algebra and geometry, as well as in tensor-based optimization and statistics.
In this talk, I will delve into the problem of joint tensor decomposition, which involves identifying common variations among multiple tensor datasets collected on the same objects or persons, also known as data fusion or integration in the literature. After reviewing standard tensor decompositions, I will focus on tensorial extensions of partial least squares (PLS) and canonical correlation analysis (CCA), presenting novel algorithms based on block coordinate ascent or Riemannian gradient optimization that can jointly decompose multiple tensor datasets of arbitrary orders. I will provide numerical convergence results and statistical guarantees within the context of factor models. Moreover, I will discuss algorithm initialization, higher-order factor components, and statistical inference methods such as bootstrap and permutation techniques. If time permits, I will showcase numerical evidence of the method’s performance and outline potential applications in the multimodal integration of neuroimaging data.
Affiliation
University of Massachusetts (Boston, USA)
Associated Professor - Department of Mathematics
Invited Professor (2023-24) 
Inria Saclay & CEA Saclay - MIND team
Biography
David Degras received his PhD in Statistics from the Université Paris 6, France, in 2008. He was a Postdoctoral Researcher at the Statistical and Applied Mathematical Sciences Institute (SAMSI) in 2010-11 and served as an Assistant Professor in the Department of Mathematical Sciences at DePaul University from 2011 to 2016. He is currently an Associate Professor in the Department of Mathematics at the University of Massachusetts Boston. His research interests include statistical learning, statistical computing, functional data analysis, convex and combinatorial optimization, and neuroimaging.
The seminar will take place on Thursday, April 25, 2024, from 12:30 to 2:00 pm at CentraleSupélec, amphithéâtre e.068 (Bouygues building) in Gif s/Yvette, and will also be broadcast by videoconference : https://teams.microsoft.com/l/meetup-join/19%3ameeting_M2ZkMDVlNTgtNzAwNS00YTlkLWI2NmMtMjhmNGZiODE3MTdj%40thread.v2/0?context=%7b%22Tid%22%3a%2261f3e3b8-9b52-433a-a4eb-c67334ce54d5%22%2c%22Oid%22%3a%2240a0f890-4bbf-4d69-92b9-e53f47d70bf7%22%7d.
Don't miss the announcement of a new DATAIA seminar!
Subscribe to our seminar mailing list by clicking here.
 
        