Ratio is an interdisciplinary research institute, with a research focus on the conditions of business and enterprise.

+4684415900

info@ratio.se

802002-5212

Sveavägen 59 4trp

Box 3203

103 64 Stockholm

Bankgiro: 512-6578

About

  • About us
  • About
  • Contact us
  • Media
  • News archive
  • Cooperations
  • Eli F. Heckscher Lectures

Research

  • Areas
  • Labour Market Research
  • Competitiveness Research
  • Climate and Environmental Research
  • Ongoing research
  • Working Paper Series
People

Publications

  • Publications
  • Publications
Search Swedish flag iconSV
Swedish flag iconSVSearch

About

  • About us

    • About
    • Contact us
  • Media

    • News archive
  • Cooperations

    • Eli F. Heckscher Lectures

Research

  • Areas

    • Labour Market Research
    • Competitiveness Research
    • Climate and Environmental Research
  • Ongoing research

    • Working Paper Series
  • People
  • Publications

    • Publications

      • Publications

    Using machine learning to select variables in data envelopment analysis: Simulations and application using electricity distribution data

    PublicationArticle (with peer review)
    Magnus Söderberg

    Abstract

    Agencies that regulate electricity providers often apply nonparametric data envelopment analysis (DEA) to assess the relative efficiency of each firm. The reliability and validity of DEA are contingent upon selecting relevant input variables. In the era of big (wide) data, the assumptions of traditional variable selection techniques are often violated due to challenges related to high-dimensional data and their standard empirical properties. Currently, regulators have access to a large number of potential input variables. Therefore, our aim is to introduce new machine learning methods for regulators of the energy market. We also propose a new two-step analytical approach where, in the first step, the machine learning-based adaptive least absolute shrinkage and selection operator (ALASSO) is used to select variables and, in the second step, selected variables are used in a DEA model. In contrast to previous research, we find, by using a more realistic data-generating process common for production functions (i.e., Cobb–Douglas and Translog), that the performance of different machine learning techniques differs substantially in different empirically relevant situations. Simulations also reveal that the ALASSO is superior to other machine learning and regression-based methods when the collinearity is low or moderate. However, in situations of multicollinearity, the LASSO approach exhibits the best performance. We also use real data from the Swedish electricity distribution market to illustrate the empirical relevance of selecting the most appropriate variable selection method.

    The article in total can be read here.

    Duras, T., Javed, F., Månsson, K., Sjölander, P., & Söderberg, M. (2023). Using machine learning to select variables in data envelopment analysis: Simulations and application using electricity distribution data. Energy Economics, 106621.

    Details

    Author

    Duras, T., Javed, F., Månsson, K., Sjölander, P., & Söderberg, M.

    Publication year

    2023

    Published in

    Energy Economics, 106621.

    Related

    Magnus Söderberg
    Professor

    magnus.soderberg@ratio.se


    Similar content

    A penalization approach for estimating inefficiency in stochastic frontier panel models
    Working paperPublication
    Tchatoka, F. D., Söderberg, M., Hakeem, M. A.
    Download
    Publication year

    2025

    Published in

    University of Adelaide, School of Economics and Public Policy Working Paper.

    Abstract

    Efficiency analysis is essential for evaluating the performance of entities that deliver essential or standardized services. The estimator proposed by Jondrow et al. (1982) is widely used in this context, but it has been criticized for several shortcomings: it tends to bias inefficiency estimates toward the mean, distorts the distribution, and misrepresents the conditional distribution of inefficiency—especially in cross-sectional data.

    Zeebari et al. (2023) propose a regularization-based alternative that aligns sample and theoretical moments; however, this method is primarily designed for cross-sectional applications and does not extend naturally to panel data.

    In response, this paper introduces a penalized mode estimator for unit inefficiency in panel data. The estimator accounts for heteroskedasticity in both inefficiency and idiosyncratic errors. A closed-form expression is derived, and Monte Carlo simulations demonstrate its superior performance compared to existing methods. An empirical application using data from electricity providers in Australia, Canada, and New Zealand highlights the practical advantages of the proposed approach.

    Scale properties and efficient network structures in the Swedish electricity distribution market
    Article (with peer review)Publication
    Söderberg, M., Vesterberg, M.
    Publication year

    2025

    Published in

    Journal of Regulatory Economics

    Abstract

    This paper examines the Swedish electricity distribution sector to highlight three key findings. First, we identify significant economies of scale among electricity distribution firms, indicating that larger firms operate more efficiently. Second, we explore alternative market structures and demonstrate that these can substantially reduce the aggregated costs of electricity distribution. Third, we use novel survey data to show that firms perceive the economic incentives for mergers to be insufficient. These findings suggest that policymakers should consider creating a regulatory environment that encourages consolidation and enhance efficiency in the sector.

    Selected publication

    Absolute income mobility and the effect of parent generation inequality: An extended decomposition approach
    Liss, E., Korpi, M., & Wennberg, K.

    Selected publication

    No evidence of counteracting policy effects on European solar power invention and diffusion
    Grafström, J., & Poudineh, R.
    Are CEOs judged on how cost efficient their firms are?
    Article (with peer review)Publication
    Månsson, K., Qasim, M., Söderberg, M.
    Download
    Publication year

    2025

    Published in

    Energy Economics

    Abstract

    This paper investigates whether executive boards consider firm-specific inefficiencies when they change CEOs in the Swedish electricity distribution sector. Firm-level inefficiencies are calculated using data from all Swedish electricity distributors from 2001 to 2022 and a data envelopment analysis (DEA) approach.

    DEA has advantages over standard financial key performance indicators since it controls for heterogeneity in inputs and outputs. It is also frequently employed by energy regulators to calculate relative cost inefficiencies.

    Our baseline approach uses a multilevel model and investigates the relationship between inefficiency and CEO between-effects. This analysis shows that 9–15 % of the variation in inefficiency can be attributed to the CEO effect.

    The second modeling approach quantifies the CEO effect using a synthetic difference-in-differences approach, focusing on firms that have changed CEOs. The results reveal that new CEOs reduce cost inefficiency more when they succeed CEOs who were forced to leave.

    Show more