RPubs

by RStudio

deleeuw

Jan de Leeuw

Recently Published

Simultaneous Diagonalization in R/C

We present an R/C implementation of optimal simultaneous diagonalization of several real symmetric matrices using Jacobi plane rotations, with compact triangular storage of symmetric matrices.

about 8 years ago

Differentiability of Stress at Local Minima

Earlier papers have shown that stress is differentiable at local minima if certain conditions on the weights and dissimilarties are satisfied. In this note we show the result remains true without these additional conditions.

about 8 years ago

The Positive Orthant Method

The positive orthant method tries to find solutions to consistent systems of inequalities, and approximate solutions to inconsistent systems, by maximizing a fit measure based on the sign function and the absolute value function. We concentrate on systems of linear inequalities and develop a convergent majorization algorithm.

about 8 years ago

Majorization of Smoothed Absolute Values

We discuss two different even and convex non-negative smooth approximations of the absolute value function and apply them to construct majorization algorithms for least absolute deviation regression. Both uniform and sharp quadratic majorizations are constructed. As an example we use the Boston housing data. In our example sharp quadratic majorization is typically 10-20 times as fast as uniform quadratic majorization.

over 8 years ago

An Alternative Majorization for Multidimensional Scaling

The Cauchy-Schwartz majorization of the distance function in SMACOF is replaced by a majorization of the squared distance function. This leads to an interesting SMACOF alternative, which we call SMOCAF.

over 8 years ago

Homogeneity Analysis of Durant Bend Sherds

This paper is a non-technical and mostly graphical introduction to Homogeneity Analysis, also known as Multiple Correspondence Analysis. It is meant as an explanation and justification of a non-standard application of Correspondence Analysis to an example from archeology.

over 8 years ago

Multidimensional Array Indexing and Storage

We give R, C, and R->C code to access lineary stored multidimensional arrays and compactly stored multidimensional super-symmetric arrays.

almost 9 years ago

Higher Partials of fStress. Who Needs Them ?

We define fDistances, which generalize Euclidean distances, squared distances, and log distances. The least squares loss function to fit fDistances to dissimilarity data is fStress. We give formulas and R/C code to compute partial derivatives of orders one to four of fStress, relying heavily on the use of Faà di Bruno’s chain rule formula for higher derivatives.

almost 9 years ago

Pseudo Confidence Regions for MDS

We compute pseudo-confidence ellipses around MDS solutions, using a new fast implementation of the Hessian of the stress loss function.

about 9 years ago

Tweaking the SMACOF Engine

The smacof algorithm for (metric, Euclidean, least squares) multidimensional scaling is rewritten so that all computation is done in C, with only the data management, memory allocation, iteration counting, and I/O handled by R. All symmetric matrices use compact, lower triangular, column-wise storage. Second derivatives of the loss function are provided, but non-metric scaling, individual differences, and constraints still have to be added.

about 9 years ago

acobi Eigen in R/C with Lower Triangular Column-wise Compact Storage

The Jacobi method for computing eigenvalues and eigenvectors of a symmetric matrix is implemented in C using column-wise compact storage of the lower triangle. The complied C code can be loaded into R using the .C() interface. We compare the C implementation with an earlier version in pure R, and with the built-in eigen function in R.

about 9 years ago

Weighted Low-rank Approximation using Majorization

We give a majorization algorithm for weighted low-rank matrix approximation, a.k.a. principal component analysis. There is one non-negative weight for each residual. A quadratic programming method is used to compute optimal rank-one weights for the majorization scheme.

about 9 years ago

Some Majorization Theory for Weighted Least Squares

In many situations in numerical analysis least squares loss functions with diagonal weight matrices are much easier to minimize than least square loss functions with full positive semi-definite weight matrices. We use majorization to replace problems with a full weight matrix by a sequence of diagonal weight matrix problems. Diagonal weights which optimally approximate the full weights are computed using a simple semi-definite programming procedure.

about 9 years ago

Simultaneous Diagonalization of Positive Semi-definite Matrices

We give necessary and sufficient conditions for solvability of Aj=XWjX′ , with the Aj are m given positive semi-definite matrices of order n. The solution X is n×p and the m solutions Wj are required to be diagonal, positive semi-definite, and adding up to the identity. We do not require that p≤n.

about 9 years ago

Infeasible Primal-Dual Quadratic Programming with Box Constraints

This describes a C version of a infeasible primal-dual algorithm for positive definite quadratic programming with box constraints, proposed by Voglis and Lagaris. We also describe a .C() interface for R.

about 9 years ago

Factor Analysis as Matrix Decomposition and Approximation: I

A general form of linear factor analysis is defined, and presented as a method to factor a data matrix, similar in many respects to principal component analysis. We discuss necessary and sufficient conditions for solvability of the factor analysis equations and give a constructive method to compute all solutions. A follow up paper will present the corresponding algorithm.

about 9 years ago

Computing and Fitting Monotone Splines

A brief introduction to spline functions and B-splines, and specifically to monotone spline functions – with code in R and C and with some applications.

over 9 years ago

Exceedingly Simple Monotone Regression (with Ties)

A C implementation of Kruskal’s up-and-down-blocks monotone regression algorithm for use with .C() is extended to include the three classic ways of handling ties. It is then compared with other implementations.

over 9 years ago

Exceedingly Simple Monotone Regression

A C implementation of Kruskal’s up-and-down-blocks monotone regressionalgorithm for use wth .C(), and a comparison with other implementations.

over 9 years ago

Exceedingly Simple Sorting with Indices

We use the system qsort to write a routine that produces both the sort an the order of a vector of doubles.

over 9 years ago

Shepard Non-metric Multidimensional Scaling

We give an algorithm, with R code, to minimize the multidimensional scaling loss function proposed in Shepard’s 1962 papers. We show the loss function can be justified by using the classical rearrangement inequality, and we investigated its differentiability.

over 9 years ago

Multidimensional Scaling with Distance Bounds

We give an algorithm, with R code, to minimize the multidimensional scaling stress loss function under the condition that some or all of the fitted distances are between given positive upper and lower bounds. This paper combines theory, algorithms, code, and results of De Leeuw (2017b) and De Leeuw (2017a).

over 9 years ago

Multidimensional Scaling with Lower Bounds

We give an algorithm, with R code, to minimize the multidimensional scaling stress loss function under the condition that some or all of the fitted distances are larger than given positive lower bounds. Some interesting majorization theory is also given.

over 9 years ago

Multidimensional Scaling from Below

We give an algorithm, with R code, to minimize the multidimensional scaling stress loss function under the condition that all fitted distances are smaller than or equal to the corresponding dissimilarities.

over 9 years ago

Quadratic Programming with Quadratic Constraints

We give a quick and dirty, but reasonably safe, algorithm for the minimization of a convex quadratic function under convex quadratic constraints. The algorithm minimizes the Lagrangian dual by using a safeguarded Newton method with non-negativity constraints.

over 9 years ago

Multidimensional Scaling with Anarchic Distances

Using anarchic distances means using a different configuration for each dissimilarity. We give the anarchic version of the smacof majorization algorithm, and apply it to additive constants, individual differences, and scaling of asymmetric dissimilarities.

over 9 years ago

Least Squares Solutions of Linear Inequality Systems

We discuss the problem of finding an approximate solution to an overdetermined system of linear inequalities, or an exact solution if the system is consistent. Theory and R code is provided for four different algorithms. Two techniques use active set methods for non-negatively constrained least squares, one uses alternating least squares, and one uses a nonsmooth Newton method.

over 9 years ago

Discrete Minimax by Quadratic Majorization

We construct piecewise quadratic majorizers for minimax problems. This is appled to finding roots of cubics. An application to a Chebyshev versions of MDS loss is also outlined.

over 9 years ago

Majorizing Cubics on Intervals

We illustrate uniform quadratic majorization, sharp quadratic majorization, and sublevel quadratic majorization using the example of a univariate cubic.

over 9 years ago

Derivatives of Low Rank PSD Approximation

In De Leeuw (2008) we studied the derivatives of the least squares rank p approximation in the case of general rectangular matrices. We modify these results for the symmetric positive semi-definite case, using basically the same derivation. We apply the formulas to compute the convergence rate of Thomson’s iterative principal component algorithm for factor analysis.

over 9 years ago

Convergence Rate of ELEGANT Algorithms

We compute the convergence rate of the ELEGANT algorithm for squared distance scaling by using an analytical expression for the derivative of the algorithmic map.

over 9 years ago

Zangwill/Ostrowski Descent Algorithms

This note collects the general results I have used over the years to prove and study convergence of alternating least squares, augmentation, and majorization algorithms. It does not aim for maximum generality or precision.

over 9 years ago

An Alternating Least Squares Approach to Squared Distance Scaling

Alternating Least Squares and Majorization approaches to squared distance scaling are discussed, starting from a "lost paper" from 1975.

over 9 years ago

Block Relaxation as Majorization

We show all block relaxation problems can be reformulated as majorization problems.

almost 10 years ago

Pictures of Stress

A low-dimensional multidimensional scaling example is used to illustrate properties of the stress loss function and of different iteration methods

about 10 years ago

Gower Rank

In Multidimensional Scaling we sometimes find that stress does not decrease if we increase dimensionality. This is explained in this note by using the Gower rank. Two examples with small Gower rank are analyzed.

about 10 years ago

Exceedingly Simple Principal Pivot Transforms

Principal pivoting transforms are described and implemented using R routines with .C() wrappers.

over 10 years ago

RPubs by Jan

List of RPubs in 2016

over 10 years ago

Minimizing qStress for small q

We derive a majorization algorithm for the multidimensional scaling loss function qStress, with q small.

over 10 years ago

APL in R

We provide the main functions of the APL array language that do not have R equivalents, using .C() and .Call() for the C interfaces.

over 10 years ago

In Praise of QR

R and C code for linear least squares, solving linear equations, computing null spaces, and computing Moore-Penrose inverses.

over 10 years ago

Exceedingly Simple Permutations and Combinations

Generate the next permutation or combination in lexicographic order.

over 10 years ago

Singularities and Zero Distances in Multidimensional Scaling

We analyze the stationary equations for Euclidean MDS when there are row-singularities, in the form of zero distances, and column singularities , the form of linear dependencies.

over 10 years ago

More on Inverse Multidimensional Scaling

For a given configuration we find the dissimilarity matrices for which the configuration is a stationary point of the corresponding least squares Euclidean multidimensional scaling problem.

over 10 years ago

Full-dimensional Scaling

We discuss least squares multidimensional scaling in high-dimensional space, where the problem becomes convex and has a unique solution.

over 10 years ago

Exceedingly Simple Isotone Regression with Ties

The primary, secondary, and tertiary approach to ties in monotone regressions are implemented in R, on top of the AS 149 Fortran algorithm of Cran.

over 10 years ago

Second Derivatives of rStress, with Applications

Derivatives of the rStress loss function for MDS are used to derive sensitivity regions and Newton-type algorithms.

over 10 years ago

Differentiability of rStress at a Local Minimum

The MDS loss function rStress is differentiable, directionally differentiable, or not differentiable, depending on the value of the power r.

over 10 years ago

Minimizing rStress using Majorization

Majorization is used to construct a class of algorithms that minimize least squares MDS loss functions such as stress used by Kruskal, stress used by Takane et el, and the loss used by Ramsay. More generally, majorization allows us to fit any positive power of Euclidean distances to a matrix of dissimilarities.

over 10 years ago

Sign In

deleeuw

Jan de Leeuw

Recently Published