Application settings

A chess board in the middle of a game, with black and white pieces in mid-position on a wooden board A wide-angle photo of the interior of a large Amazon fulfillment center, with workers in orange vests pushing wheeled carts down long aisles of merchandise You can tell a lot about the methods developed by an optimization community by the problems they use to illustrate their models and algorithms.

Computer scientists have long used games and stylized control problems to test out their advances in reinforcement learning.

Our work started by optimizing a fleet of trucks, before transitioning to fleets of locomotives. We have worked on optimal learning in drug discovery and materials science before we tackled optimizing the grid for PJM Interconnections. We did extensive research on a wide range of energy storage problems before starting a company, Optimal Dynamics, to take the research to the entire truckload industry.

This page is designed to communicate the vast diversity of applications that can be described as sequential decision problems.

A sample of human activities

Consider any list of fields of activities that involve people, and you can find a rich array of decisions. For example:

Sports – How to perform better personally, how your team should perform better, improving the performance of your fantasy football team.
Finance – Trading a stock portfolio, determining when to sell a stock.
Health – This is a big topic. Some samples are:
- Personal health – Taking a medication, designing a diet.
- Public health – Designing public information campaigns, distributing naloxone kits, enforcing mask mandates.
- Medical decision making – What drug to use, what dosage.
- Clinical trials – Which drugs to send through clinical trials (and for which disease), what population to target, where to set up clinics.
Business – This is another big umbrella, but it is any business setting where you want to cut costs, increase sales, improve profit margins, improve customer service, or improve the performance of a manufacturing process.
Laboratory research – You are trying to design a new drug, create a stronger material, bake a fluffier cake.
E-commerce – How much to bid for ad space on Google or Facebook, how to price a product, what URL to display to attract the most clicks.
Supply chain management – Designing inventory replenishment policies, choosing forecasting methods, selecting suppliers, implementing demand management strategies (including pricing).

Pick any setting, and you will find people trying to invent new materials and products, create software, develop new drugs, and improve a wide range of processes.

Types of decisions

We can describe decisions as coming in three broad flavors depending on whether they are acting on physical, financial, or informational resources.

Three columns showing Physical decisions, Financial decisions, and Informational decisions, each with example activities

Physical decisions

Without question the category of managing physical resources offers the richest class of applications for making decisions. Physical resources include people, machines, facilities, chemicals, a countless variety of inventories, trucks (tractors and trailers), trains (locomotives and cars), aircraft, and all the random assorted equipment that comes into play in each of these environments. Decisions control buying, selling, moving, and any of a range of modifications (repairs, setups, education / training).

The most common source of uncertainty in the management of physical resources is serving customer demands and requests that arrive randomly over time, sometimes with some advance notification. However, other sources of uncertainty involve the time required to perform a task, the quality of the task, and the usual flow of breakdowns and maintenance requests.

Financial decisions

The management of financial resources covers the entire world of investments, whether in financial investments (stocks, certificates, funds), funding major purchases, hedging against currency shifts, and managing loans. The decision made by Ford to arrange a massive leveraged loan package saved the company from bankruptcy during the 2008 financial meltdown.

The most visible financial decisions involve trades (buying and selling investments, managing portfolios) and pricing. Financial trading represents one of the largest and most lucrative areas of application of (deterministic) optimization software.

Uncertainty is the reason that finance is even a field, so it should not be surprising that finance has raised the art of making decisions under uncertainty to a high art. Investment firms and large banks have to show that they have the financial reserves to handle a wide range of events, where the most complex dimension is handling correlations.

Informational decisions

“Information” represents parameters, values, and the choice of functions in the form of rules, terms of a contract, and even methods for making decisions (where we have to choose the method). We might have to design the terms of a contract for purchasing an aircraft, or specify the conditions under which a contract can be exercised or cancelled.

Using an optimization model to make decisions is very familiar in operations research, but the decision of which model to use, how to design the objective function, and the choice of various parameters is often overlooked as important decisions.

A diverse set of problems

Wide-angle photo of a large supermarket interior with long aisles of merchandise To imagine the vast diversity of sequential decision problems, think of the largest supermarket or retail center, and the millions of items it may carry. This is how to envision the population of sequential decision problems. Just consider all the different types of decisions, the different sources, styles and flavors of uncertainty, the variety of objective functions, and the physics that govern the evolution of physical, financial and informational resources. Our universal modeling framework can handle any sequential decision problem, and covers any method for making decisions.

Given the diversity of sequential decision problems, we put the highest priority on describing and modeling problems before the traditional academic focus on algorithms.

Motivating applications

It is hard to emphasize the importance of a rich set of motivating applications if you are going to work on general methodology. This is the only way to avoid falling into the trap of creating what appears to be a general strategy for solving stochastic optimization problems, and then illustrating it on a single class of problems.

Click on a project below to jump to a brief description:

Truckload fleet management for North American Van Lines (1984–1986)
Real-time dispatching for Burlington Motor Carriers (1990s)
Locomotive optimization for Norfolk Southern Railway (1996–2008)
ADP-based truckload fleet simulator for Schneider National (2004–2008)
Drug discovery (2008–2012)
Optimal learning in materials science (2008–2014)
Modeling the PJM grid for high levels of offshore wind (2008–2012)
Energy storage research (2011–2020)
Modeling wind and prices (2010–2015)
Managing clinical trials (2016–2018)
Optimizing utility trucks after a storm (2016–2018)
Optimal Dynamics — Optimizing the truckload industry (2017–present)

Truckload fleet management for North American Van Lines (1984–1986)

A schematic of the LoadMAP model showing a forecast network of truckload moves over time on a grey grid, with blue piecewise-linear value-function approximations overlaid and a North American Van Lines truck in the bottom right corner Our first project with a truckload carrier modeled the fleet at an aggregate level (“trucks”) using an early version of what would grow into our work on approximate dynamic programming. This model, called LoadMAP, replaced the early deterministic models of the future (the grey network in the background) with a stochastic model using piecewise-linear value functions. This was the first production model using a stochastic model of future loads. The project was a finalist in the prestigious Franz Edelman competition conducted by Informs.

Real-time dispatching for Burlington Motor Carriers (1990s)

A network diagram of the MicroMAP model showing an 'assignment network' at t=0 with driver and task nodes, transitioning into a multi-period 'forecast network' for t=1, 2, 3 with arcs labeled for driver-to-task assignments and empty repositioning This project modeled each individual driver in the first time period at a high level of detail, making it possible to capture home domicile, truck type, and hours of service. The model, called MicroMAP, was an extension of LoadMAP since it blended a detailed assignment model for the initial dispatch with the nonlinear value-function approximations in the future.

Locomotive optimization for Norfolk Southern Railway (1996–2008)

Front view of a Norfolk Southern locomotive number 9533 emerging from a snowy forest, with the railroad's white-horse logo on the front This was our first production implementation of a locomotive planning model based on a formal representation as a dynamic program solved using approximate dynamic programming. The use of ADP allowed us to reduce an intractably large deterministic optimization model to a sequence of much smaller integer programs that could be solved with CPLEX. The model (developed by Belgacem Bouzaiene-Ayari) was called PLASMA (Princeton Locomotive and Shop Management system) and is still running in 2026. A complete discussion of PLASMA is given here.

ADP-based truckload fleet simulator for Schneider National (2004–2008)

A map of the continental United States overlaid with hundreds of glowing dots representing truckload driver locations, with an orange Schneider National tractor-trailer in the upper-left corner Based on the architecture of the locomotive optimization model, this model used approximate dynamic programming as an “optimizing simulator” to simulate the dispatching of drivers over a month. The goal was to study a wide range of policy decisions such as domiciling strategies and changes in work rules. The model, developed by Hugo Simao, won the Daniel H. Wagner Prize for Excellence in Operations Research Practice in 2009. This model was later licensed to Optimal Dynamics and still forms the foundation of their planning tools. For a complete discussion of the project for Schneider National, click here.

Drug discovery (2008–2012)

A diagram showing a list of candidate atoms (H, F, Cl, Br, I, Me) being attached at positions X and Y on a base molecular scaffold, illustrating combinatorial substituent selection in drug discovery Building on our work in optimal learning, we became involved in the process of drug discovery, which motivated the use of the recently developed knowledge gradient algorithm (by Peter Frazier), which we adapted to problems where beliefs were represented using a linear model. This work was the basis of the senior thesis of Diana Negoescu; the work won an honorable mention in the Doing Good with Good OR competition in 2009. For more on the knowledge gradient used to solve this problem, click here.

Optimal learning in materials science (2008–2014)

A 2D color-coded utility surface plotted against 'Inner water droplet diameter' on the vertical axis and 'Oil droplet diameter (nm)' on the horizontal axis, with high-utility regions in the upper-right (blue) and low-utility regions in red along the lower edge Motivated by the “exploration vs. exploitation” problem well known in approximate dynamic programming, we started a research initiative into optimal learning (efficient collection of information) which led to a major AFOSR contract developing optimal learning methods in the context of materials science. This work launched an entirely new set of learning methods motivated by different types of belief models and experimental settings (joint work with Peter Frazier). See the knowledge gradient page for the broader research lineage.

Modeling the PJM grid for high levels of offshore wind (2008–2012)

A montage of the PJM transmission grid overlaid with photographs of nuclear cooling towers, fossil-fuel plants, transmission lines, solar arrays, wind turbines, and a hydroelectric dam The MAOWIT study (Mid-Atlantic Offshore Wind Interconnection and Transmission), joint with the University of Delaware (funded by DOE), produced a detailed model of the PJM power grid (called SMART-ISO) that allowed us to undertake studies of increasing levels of power from offshore wind turbines. Developed by Hugo Simao, the goal was to model PJM’s planning process at a high level of detail, using a carefully calibrated model of energy from wind. See the SMART-ISO page for a detailed description.

Energy storage research (2011–2020)

A schematic showing wind energy (time-series chart and wind turbines) and grid prices (time-series chart and transmission lines) feeding into a Megapack battery storage system, which in turn supplies a building with demand load (time-series chart on the right) Over a 10-year period I supervised research into a wide range of energy storage problems, with different time scales and configurations. This class of applications posed different modeling challenges (especially in the modeling of uncertainty) and, most important, the need to use all four classes of policies — sometimes on the same configuration (as we did for the figure to the right). One application required the proper (and widely overlooked) problem of handling rolling forecasts, which astonishingly seems to have been overlooked in the inventory literature. This research is reviewed in more depth on the Energy storage research page.

Modeling wind and prices (2010–2015)

A line graph over 24 hours showing a single 'Forecast made at midnight' (blue), the 'Actual' wind realization (black), and a fan of 'Rolling forecasts, updated each hour' (multicolored) — illustrating how forecasts shift each hour relative to the actual observed wind Energy poses complex stochastic prices to capture the volatility of electricity prices and the dynamics of rolling forecasts of wind. Combine these issues with the correlations over space and time on multiple time scales, and you get an appreciation of the difficulty of stochastic modeling. This is described in considerable depth in chapter 10 of Reinforcement Learning and Stochastic Optimization.

Managing clinical trials (2016–2018)

A diagram titled 'Phases of Clinical Trials' showing four sequential blue arrow-shapes labeled Preclinical (lab and animal studies, with a mouse icon), Phase I (Safety Studies, tens of people), Phase II (Dose-finding Efficacy, hundreds), and Phase III (Confirm efficacy and safety, thousands) Drug testing involves a variety of sequential decisions, starting with the choice of drug, the choice of disease(s) the drug might be applied to, and the entire process of managing the trial through each of the different phases — a blend of information collection along with the management of significant levels of physical and financial resources. See chapter 14 of Sequential Decision Analytics and Modeling for an illustration of different types of policies for managing the testing process.

Optimizing utility trucks after a storm (2016–2018)

A utility truck with a cherry-picker boom positioned next to a tree with damaged limbs, with a utility worker on the ground assessing the damage after a storm The largest New Jersey utility, PSE&G, gave us the problem of optimizing the routing of utility trucks after a storm. The utility received a series of “lights out” calls from customers who had lost power but had to find where the power lines had been damaged. We modeled it as an information-collecting vehicle routing problem — a sequential decision problem with both a physical state (the status of the truck) and a belief state (where we thought an outage may have happened). See the Information-Collecting Vehicle Routing Problem page for research on this class of problems.

Optimal Dynamics — Optimizing the truckload industry (2017–present)

Cover of the Optimal Dynamics technical white paper titled 'Sequential Decision Analytics For The Truckload Industry,' authored by Warren B. Powell, Chief Analytics Officer at Optimal Dynamics, dated December 14, 2020 In 2017 Optimal Dynamics was founded to bring the work in trucking to industry, which required that we engage in all the dimensions required to make a decision technology successful — spanning data ingestion, decision-making, implementation, and business process change. I wrote a white paper that includes a section on “Designing policies” that illustrates how we are using all four classes of policies. Download the white paper here.