RPubs

by RStudio

paterno

Marc Paterno

Recently Published

Data Structure Profling

This document uses some microbenchmarks to choose which data structures to use as candidates for detailed performance studies, to replace std::map<int,int> in the SimPhotonsLite data product.

25 days ago

PDFastSimPAR physics validation: 2 of N

This is the second in my series of physics validation documents for my work on the PDFastSimPAR module. This document looks at the effects of changing the fast_acos algorithm.

2 months ago

Naive Parallel Minimization

This document provides the start of analysis of a naive parallel global minimization based upon random start locations for BFGS local minimization.

6 months ago

Dealing with Multiple Local Minima: the Rastrigin Test Function

This document shows some performance results of using the dlib library’s find_min_global function on the Rastrigin function in 2-5 dimensions.

7 months ago

Verification of HEPnOS and ROOT runs on csresearch

The purpose of this document is to verify that our timing measurements are well-understood.

about 1 year ago

Expression Template Benchmarking

This document explores the code generation and performance of expression templates in the context of [*dual numbers*](https://en.wikipedia.org/wiki/Dual_number). In particular, we are concerned with artithmetic operations on dual numbers that occur in the field of automatic differentiation.

about 1 year ago

Data Overview 2

about 1 year ago

CosmoSIS Integration Modules

A brief and informal description of CosmoSIS integration modules.

about 1 year ago

Data Overview

This is a brief look at the timing data from ICARUS workflow runs using HEPnOS on the csresearch machines.

about 1 year ago

Analysis of MH CosmoSIS Output

This is an example containing a few visualizations of the output of CosmoSIS, using the Metropolis-Hastings sampler with multiple MPI ranks.

almost 2 years ago

Programming GPUs: A taste of CUDA and Kokkos

The goal of this presentation is to give the flavor of GPU programming, as opposed to both CPU (“normal”) programming and the use of GPU-accelerated libraries or tools (such as are common in the machine learning field). It will not be sufficient for you to go out and start programming. I hope it is sufficient to help you decide whether you are interested in further study of GPU programming to support your own work.

about 2 years ago

Analysis of m-Cubes error estimates

This document contains preliminary analysis of the error estimates returned by the version of the m-Cubes algorithm used in our paper submitted to ISC HP22.

over 2 years ago

Phase 1 regions

Preliminary analysis of detailed Phase 1 region information, using the DES integrand.

over 3 years ago

Scaling analysis of per-rank performance of eventselection

This is a preliminary analysis of the scaling of the eventselection program, reading from HEPnOS.

over 3 years ago

Analysis of per-rank performance of events election

This is a continued analysis of the June 30 eventselection run, using HEPnOS.

over 3 years ago

PandAna Performance part 3

Preliminary analysis of reading speed of PandAna, and the effect of both HDF5 compression and striping of files on the Cori filesystem.

over 3 years ago

PandAna Performance part 2

Continued analysis of PandAna performance, this time with a larger dataset and more MPI ranks.

over 3 years ago

PandAna Performance

This is a (not yet complete) analysis of the parallel performance of PandAna.

over 3 years ago

A first look at grid job output

Initial results of snapshot runs.

over 3 years ago

Mathematica integration of Genz_1_abs in 5D

This document shows the results of using Mathematica 12.1.1 to numerically integrate the Genz_1 (absolute value) function in 5.

over 3 years ago

Timing analysis of June 30 run

Preliminary analysis of timing data from the June 30 HEPnOS run.

over 3 years ago

Parallel CUHRE subregions

Using the integrand Genz_1abs_5d|, this document contains a very preliminary analysis of the subregions produced by the parallel CUHRE algorithm.

almost 4 years ago

Comparing VEGAS and CUHRE for Genz 1 (absolute value) in 5d

This document shows a comparison of the speed of the VEGAS and CUHRE algorithms, as implemented in the CUBA (http://www.feynarts.de/cuba/) library, and wrapped by cubacpp (https://bitbucket.org/mpaterno/cubacpp).

almost 4 years ago

Genz function 1 in 8d

This document shows a performance comparison between the serial and parallel implementations of the CUHRE algorithm for a non-positive-definite integrand.

almost 4 years ago

Where CUHRE evaluates functions

This document shows where the CUHRE algorithm evaluates the function it is integrating. Unlike a Monte Carlo algorithm (such as VEGAS), CUHRE evaluates the function at a set of determinalistically chosen points.

almost 4 years ago