Institute for Mathematical Sciences Programs & Activities
Meeting the Challenges of High Dimension: Statistical Methodology, Theory and Applications
(13 Aug - 26 Oct 2012)
Organizing Committee · Visitors and Participants · Overview · Activities · Venue · Enquiries
Co-chairs
- Peter Hall (The University of Melbourne)
- Xuming He (University of Michigan)
- Yingcun Xia (National University of Singapore)
Members
- Zhidong Bai (National University of Singapore)
- Peter Bickel (University of California at Berkeley)
- Jianqing Fan (Princeton University)
- Mike Titterington (University of Glasgow)
The "curse of dimensionality … has plagued the scientist from the earliest days," wrote the mathematician Richard Bellman in 1961. Specifically, Bellman's curse relates to the fact that the number of data needed to adequately represent information in p-vectors grows exponentially as a function of p. The curse achieved particular prominence in statistics in the 1970s and 1980s, with the development of projection pursuit methods and related techniques in the contexts of regression and density estimation. These methods, sometimes viewed today as "classical" approaches to dealing with high-dimensional data, required the dimension to be smaller than, or at least of the same order as, sample size. That assumption is often violated today.
To at least some extent the curse needs a specific context in order to have the proportions asserted by Bellman and others. For example, problems involving functional data analysis (FDA) are, at least in terms of their mathematical concept, infinite dimensional, yet they suffer significantly less from the curse of dimensionality than do many contemporary finite-dimensional problems, for example those in genomics. This is because, in many but not all of the forms in which FDA is commonly encountered today, it lacks the missing ingredient of sparsity that Bellman had particularly in mind. An approach to FDA based on principal component analysis often manages to arrange the dimensional components in order of decreasing importance; or at least, it manages to order the first n components, where n denotes sample size. This is a boon to analysis, and it generally puts the famous curse into abeyance.
New types of data require new statistical methodologies, but just as importantly they imply that the questions that the analysis is directed at answering are new and distinctive. Those questions have changed fundamentally the connection between statistics and the areas of science, engineering and social science where the impact of statistics is greatest.
Thus, the topic of high-dimensional data analysis has many aspects, motivated by many applications, sometimes relying heavily on dimension reduction and variable selection, and sometimes co-habiting happily with more conventional multivariate methods. The program’s first workshop (13 - 24 Aug 2012) will address all of these aspects. They lie at the frontiers along which statistical methodology, the applications that motivate it, the questions that it answers, and the theory that underpins it, are advancing today. The program’s second workshop (1 - 12 Oct 2012) will continue to address challenges of high dimensional data analysis with more focuses on the methods and applications where sparsity is present.
- Workshop 1: 13 - 24 Aug 2012
- Tutorials: TBA
- Workshop 2: 1 - 12 Oct 2012
* Our office will be closed on the Monday, 20 Aug 2012, being Singapore public holiday.
Students and researchers who are interested in attending these activities are requested to complete the online registration form.
The following do not need to register:
- Those invited to participate.
Â
For general enquiries, please email us at ims(AT)nus.edu.sg.
For enquiries on scientific aspects of the program, please email Peter Hall at halpstat(AT)ms.unimelb.edu.au,
Xuming He at xmhe(AT)umich.edu or Yingcun Xia at staxyc(AT)nus.edu.sg.
Organizing Committee · Visitors and Participants · Overview · Activities · Venue · Enquiries