RPubs

CBDA Vignette version 1.0

Guide to CBDA vignette for the version 1.0.0 of the CBDA R package.

over 7 years ago

This is a summary of a set of 10 replications over a single experiment (i.e., experiment # 6 with the Binomial dataset 3) to test the CBDA-SL robustness. The robustness of the CBDA-SL is shown by the consistent selection of similar top features across replications. Each replication has the same validation set for prediction purposes, but the CBDA-SL protocol is performed with diffrent seeds for each replication.

almost 8 years ago

Results_Binomial_Combined

almost 8 years ago

Null_datset_5_9000_1000_8exps

almost 8 years ago

Results_Binomial_5_Combined

almost 8 years ago

Null_datset_3_9000_1000_10exps

almost 8 years ago

Results_Binomial_3_Combined

The last histograms combine all the experiments together to sift out the signal across all the different experimental designs for the CBDA and to compare with the Null dataset results.

almost 8 years ago

Null_final_9000_1000_10exps

The final CBDA histogram generated with the MSE metric of combined results, show an almost flat distribution (i.e., no spikes/signals). Despite the Null dataset has no signal in it, the KO filter algorithm returns few spikes (i.e., false positive). Similarly, the CBDA histogram generated with the Accuracy metric returns some spikes (not matching the ones returned by the KO filter). From the Binomial dataset analysis and from the results in Figure X, we concluded how the MSE metric is the most reliable to drive CBDA feature mining. Thus, the Null dataset analysis confirms no signal detection by the CBDA if we use the MSE metric.

almost 8 years ago

Binomial_final_9000_1000_12exps

almost 8 years ago

Plot1

almost 8 years ago

DataSifter_R1_single_call_TOC

about 8 years ago

ADNI_MN3_accuracy_top50_10features_exp2

about 8 years ago

Binomial_dataset_5_final

about 8 years ago

DataSifter_R1

about 8 years ago

Binomial_dataset_3_final

about 8 years ago

Binomial_dataset_1_final

about 8 years ago

Binomial_dataset_final

about 8 years ago

Null_Dataset_1_final

about 8 years ago

Binomial 300-300 Amplitude 10

about 8 years ago

Binomial 300-900 Amplitude 10

about 8 years ago

Test Binomial 1 - 6 experiments - wrong

about 8 years ago

ADNI_MN3_top50_exp1

about 8 years ago

ADNI MN3 BALANCED

about 8 years ago

Binomial_dataset_Accuracy

The ranking is done here by accuracy of each single prediction

about 8 years ago

Binomial_dataset_MSE

The ranking is done here by MSE

about 8 years ago

CBDA - Binomial Dataset 1 Sd

about 8 years ago

Data Sifter v8

about 8 years ago

Data Sifter v7 - new obfuscation scheme

about 8 years ago

Binomial_dataset_new_sd

about 8 years ago

Binomial_dataset_5_new_sd

about 8 years ago

Binomial_dataset_1_new_sd

about 8 years ago

Binomial_dataset_3_new_sd

about 8 years ago

Null_Dataset

about 8 years ago

Null_Dataset_1

about 8 years ago

Null_Dataset_3

about 8 years ago

Null_Dataset_5

about 8 years ago

Data Sifter v4 - SINGLE INSTANCE

over 8 years ago

300-100 Exp1 Binomial

over 8 years ago

300-300 Exp1 Binomial

over 8 years ago

300-900 Exp1 Binomial

over 8 years ago

Data Sifter v4

over 8 years ago

DataSifter - plotly

over 8 years ago

Data Sifter V1 - 1 _4 MAX OBFUSCATION

over 8 years ago

Data Sifter V1 - MEDIUM OBFUSCATION

over 8 years ago

Data Sifter V1 - NO OBFUSCATION

over 8 years ago

Data Sifter V1 - MAX OBFUSCATION

over 8 years ago

Data Sifter v0

over 8 years ago

ABIDE - experiments 1 and 3

over 8 years ago

ADNI MN3 only

over 8 years ago

ADNI MN4 Binomial without Series Identifier

over 8 years ago

ADNI MN3 Binomial without SeriesIdent

over 8 years ago

ADNI MN4 Binomial without Series Identifier

over 8 years ago

ABIDE classification (5 Experiments)

Two categories

over 8 years ago

ADNI classification 2 stages (MN4 & Binomial, 4 experiments)

These are the results of classifying the ADNI patients as either Normal, LMCI, MCI or AD (4 categories). The features selected at the end of the analysis of the 4 Exps are then merged to the features selected at the end of the analysis of 9 experiments previously analyzed with a binomial set (e.g., AD vs Normal only). This 2 stages approach improves the AD class sensitivity, which is missed by using only a 1 stage approach with multinomial classification (see http://rpubs.com/simeonem/ADNI_MN4).

over 8 years ago

ADNI classification 2 stages (MN3 & Binomial, 6 experiments)

These are the results of classifying the ADNI patients as either Normal, MCI or AD. The groups MCI and LMCI are merged and labeled MCI (multinomial with 3 categories). The features selected at the end of the analysis of the Exp 1 are then merged to the features selected at the end of the analysis of 9 experiments previously analyzed with a binomial set (e.g., AD vs Normal only). This 2 stages approach greatly improves the AD class sensitivity, which is missed by using only a 1 stage approach with multinomial classification (see http://rpubs.com/simeonem/ADNI_MN3_Exp1-3-7-9).

over 8 years ago

ADNI MN4 + Binomial

ADNI dataset classification results only on Experiment 1. The methodology here is to: [STAGE 1] run the full CBDA-SL (i.e., 9000 jobs) on Exp 1 [STAGE 2] run a single CBDA-SL job merging the features selected by STAGE 1 with the binomial stage Normal vs AD (i.e., 9000x9 total jobs, see http://rpubs.com/simeonem/ADNI_Confusion_Matrix for details).

over 8 years ago

ADNI classification 1 stage - MN4

Multinomial classification with CBDA-SL with 4 categories on experiment 1

over 8 years ago

ADNI classification 2 stages

These are the results of classifying the ADNI patients as either Normal, MCI or AD. The groups MCI and LMCI are merged and labeled MCI (multinomial with 3 categories). The features selected at the end of the analysis of the Exp 1 are then merged to the features selected at the end of the analysis of 9 experiments previously analyzed with a binomial set (e.g., AD vs Normal only). This 2 stages approach greatly improves the AD class sensitivity, which is missed by using only a 1 stage approach with multinomial classification (see http://rpubs.com/simeonem/ADNI_MN3_exp1).

over 8 years ago

ADNI dataset - Multinomial Classification on Exp 1

This set of results is based only on Experiment 1. The strategy here is to merge the groups "LMCI" and "MCI" into one, to make it a multinomial with 3 categories (labeled as MN3).

over 8 years ago

ADNI results with Confusion Matrix

over 8 years ago

ADNI dataset results

CBDA-SL and Knockoff filter results over 9 experiments. No Missing value input has been used (this is because it is a real dataset with some NA that will be filled by the missForest imputation algorithm. The other specs are: i) SSR of 40-60%, 60-80% and 100%, ii) FSR of 5-15%, 15-30% and 30-50%. These specs make a total of 9 different experiments.

over 8 years ago

NULL dataset - 9000 jobs

Knockoff filter and CBDA-SL results on 12 experiments for a NULL dataset

over 8 years ago

Binomial_Test_Results 9000 jobs

over 8 years ago

Results Random/Binomial Dataset

over 8 years ago

CBDA-SL and Knockoff Filter Results on the Binomial Dataset

over 8 years ago

Gaussian dataset - CBDA-SL and Knockoff filter

Results on 30 experiments

over 8 years ago

Histograms CBDA-SL - first set of results

Absolute and relative frequency plots of 5 CBDA-SL experiments, each with 5000 iterations of the SuperLearner (SL) function. Dataset=MRI. Each plot represents the occurrences of each feature across the top 20 predictions returned by the SL function. The last 2 plots combine all the results in a single histogram.

over 8 years ago

Sign In

simeonem

Simeone Marino

Recently Published