deaviz

High-dimensional visualization methods for data envelopment analysis (DEA), gathered into a single R package. It brings together techniques that have appeared in the literature but remained scattered and largely unimplemented: cross-efficiency matrix unfolding, the Porembski reference-set graph, PCA biplots, multidimensional-scaling colour-plots, self-organizing maps, the Costa bi-dimensional efficient frontier, parallel coordinates, radar charts, panel-data trajectory biplots, peer and reference networks, and a set of descriptive plots.

The methods follow the author’s PhD thesis (Ashkiani, 2019).

Design

Everything is built around one validated object, dea_data, which every function accepts (plain data frames are coerced automatically). Functions follow a consistent naming convention plot_<source>_<method>:

Most plotting functions share the same four arguments:

A shared colour scheme is used throughout: the colour-blind-safe Okabe–Ito palette for categories and viridis for continuous scales.

The Benchmarking package is the DEA engine. The modelling and plotting packages are in Suggests; each function checks for the ones it needs and errors informatively if they are missing.

Installation

# install.packages("devtools")
devtools::install_local("deaviz")          # from the package directory

# install the suggested engines you intend to use, e.g.:
install.packages(c("Benchmarking", "ggplot2", "plotly", "lpSolve",
                   "smacof", "kohonen", "MASS", "ggrepel"))

Quick start

library(deaviz)

# Build the data object (the example data is not i_/o_ prefixed,
# so name the inputs and outputs explicitly)
d <- dea_data(
  chinese_cities,
  inputs  = c("industrial_labour_force", "working_funds", "investments"),
  outputs = c("gross_industrial_output", "profit_and_tax", "retail_sales"),
  id      = "DMU"
)

# Analysis
eff <- compute_efficiency(d, rts = "crs")          # Farrell efficiency
ce  <- compute_cross_efficiency(d)                 # cross-efficiency matrix

# Visualization (static ggplot2 by default)
plot_efficiency_distributions(d)
plot_io_pca_biplot(d)
plot_cem_heatmap(ce)
plot_cem_unfolding(ce)
plot_io_costa_frontier(d)
plot_io_peer_network(d, size_by_peers = TRUE)

# The same plot, interactive (plotly), with labels turned off
plot_io_pca_biplot(d, interactive = TRUE, labels = FALSE)

With your own data you can skip the explicit inputs/outputs by prefixing columns i_ (inputs) and o_ (outputs); dea_data() then recognises them automatically.

Function overview

Area Functions
Data object dea_data(), as_dea_data()
Efficiency analysis compute_efficiency(), compute_multiplier_weights()
Cross-efficiency analysis compute_cross_efficiency(), compute_cross_efficiency_weights(), standardize_weights()
SOM analysis compute_som()
Descriptive (inputs/outputs) plot_io_distributions(), plot_io_scatter(), plot_io_heatmap(), plot_io_parcoo(), plot_io_radar()
Efficiency plots plot_efficiency_distributions(), plot_io_efficients()
Projections & frontier plot_io_pca_biplot(), plot_io_mds(), plot_io_costa_frontier(), plot_io_3dscatter()
Reference networks plot_io_lambda_network(), plot_io_peer_network()
Panel data plot_panel_io_biplot()
Self-organizing maps plot_io_som(), plot_io_som_components()
Cross-efficiency matrix plot_cem_heatmap(), plot_cem_unfolding(), plot_cem_weights_heatmap()
Data chinese_cities

Citation

If you use this package, please cite the thesis it is based on:

Ashkiani, S. (2019). Four Essays on Data Visualization and Anomaly Detection of Data Envelopment Analysis Problems [PhD thesis, Universitat Autonoma de Barcelona]. https://ddd.uab.cat/record/240333

See citation("deaviz") for the BibTeX entry.

Development Note

This package implements my DEA-Viz methods. An AI coding assistant (Claude, Anthropic) was used to help implement and refine the R code from my specifications and prototype functions; I designed the methods, and reviewed, tested, and validated all code, taking full responsibility for the software.

License

AGPL-3.