← Return to Blog Index

Size Curve Optimization

Fitting Demand by Store and Size
Blog Series #17 | Retail AI & Analytics
Traditional approaches use standard size curves based on population statistics: the classic bell curve with most demand in middle sizes. This assumes all stores and products follow the same distribution. Reality is far more varied. Using standard curves creates systematic mismatches where some stores consistently run out of certain sizes while accumulating others. Size curve optimization solves this challenge by matching inventory allocation to actual demand patterns across locations and product categories.

The Size Curve Problem

Traditional retail approaches treat size distribution as uniform—send every store the same percentage of XS, S, M, L, and XL inventory. This one-size-fits-all methodology seems logical when based on national population statistics, but it ignores a fundamental reality: customer demographics and preferences vary dramatically by location, channel, and product type.

The result? Predictable, costly patterns emerge across retail networks. College town stores perpetually stock out of small sizes while drowning in excess large inventory. Suburban family locations run short on medium and large sizes. Urban flagship stores serve different customer segments than outlet centers, yet receive identical size allocations. These systematic mismatches translate directly into lost sales, excessive markdowns, frustrated customers, and expensive inter-store transfers.

The Cost of Generic Size Curves:
  • Lost sales when popular sizes stock out early in the season
  • Deep markdowns on slow-moving sizes to clear excess inventory
  • Emergency transfers between stores, adding handling costs and delays
  • Customer dissatisfaction when preferred sizes are unavailable
  • Reduced inventory productivity and cash flow

Store-Specific Size Demand Patterns

Store-Specific Size Demand

Each store has a unique customer base with distinct size preferences reflecting local demographics, customer segmentation, and shopping patterns. A downtown professional store attracts different body types and style preferences than a college campus location or a suburban family shopping center.

Understanding Local Demand Drivers

Demographics matter. Age, income, ethnicity, and lifestyle create measurable differences in size distributions. Younger populations skew smaller. Areas with specific ethnic concentrations show different average body types. Higher-income neighborhoods often correlate with more fitness-focused lifestyles and corresponding size preferences.

Store format influences selection. Flagship locations in urban centers serve style-conscious customers seeking the latest trends in fitted silhouettes. Outlet centers attract value shoppers less concerned with cutting-edge styles, often showing broader size distributions including more extended sizes. Mall stores in family-oriented suburbs see balanced middle-size demand.

Competitive context shapes traffic. A store near a specialty plus-size retailer may see different size patterns as some customers segment themselves to competitors. Conversely, being the only option in a geographic area creates more representative demand across all sizes.

Machine Learning Identifies Store Patterns

  • Historical analysis: Examines 12-24 months of sales data by size and location
  • Clustering algorithms: Groups stores with similar size demand characteristics
  • Stockout correction: Adjusts for periods when sizes were depleted, revealing true latent demand
  • Demographic integration: Incorporates external data on trade area characteristics
  • Continuous learning: Updates curves as customer preferences evolve over time

Machine learning models identify store-specific size curves from historical sales data while controlling for stockout bias—the phenomenon where depleted sizes appear to have lower demand simply because they weren't available. By estimating unconstrained demand rather than merely observed sales, these models reveal true customer preferences.

Product-Specific Size Curves

Size demand varies not just by store, but by product type—even within the same location. The size distribution for athletic wear differs fundamentally from business casual or evening wear. Product characteristics drive these variations in predictable ways that machine learning can capture.

Category-Level Variations

Athletic and activewear typically shows a flatter, smaller-skewed distribution. Compression fabrics and performance fits encourage size-down behavior. Customers purchasing workout clothing often aspire to or maintain slimmer body types, shifting demand toward XS and S.

Outerwear shifts larger because layering requires additional room. A customer who wears Medium tops often purchases Large jackets. Cold weather markets show more pronounced shifting as heavier layering is common. This category also sees longer purchase cycles, with customers holding inventory across seasons.

Denim presents unique complexity. Two-dimensional sizing (waist and inseam) creates a matrix rather than a simple curve. A 32-inch waist might need a 30-inch inseam in one store but a 32-inch inseam in another, depending on customer height distributions. Stretch versus rigid denim further complicates the picture, as stretch enables size-down behavior impossible with traditional fabrics.

Women's categories show higher variability than men's due to less standardized sizing, more frequent style changes, and broader acceptance of size fluidity. The same customer might wear Small in one brand's tops, Medium in another, creating noise that machine learning must filter to identify true demand.

Product Attributes Predict Size Distribution

Machine learning models analyze which product characteristics predict size curves:

  • Fabric content: Stretch percentage, weight, drape characteristics
  • Fit philosophy: Slim, regular, relaxed, oversized styling
  • Price point: Premium products often skew smaller sizes
  • Seasonal timing: Spring/summer versus fall/winter buying patterns
  • Brand heritage: Established sizing reputations (runs small/large)
  • Style details: Cropped, fitted, loose elements influence size selection

The models learn these product-specific patterns by analyzing size sell-through rates across categories, brands, and styles. This enables accurate curves even for new products without sales history—the system predicts based on similar products' historical performance, adjusted for the specific store's customer base.

Handling Size Stockouts

A critical challenge in size optimization is the stockout bias problem. Historical sales data shows what sold, not what customers wanted. When Small sells out in week two of a twelve-week season, you've captured only a fraction of potential Small demand. The sales data falsely suggests lower Small demand compared to sizes that remained in stock.

The Censored Demand Challenge

Standard analytical approaches treat zero sales as zero demand. But zero sales when inventory exists means genuine lack of interest, while zero sales after stockout means suppressed demand. Conflating these scenarios produces systematically biased size curves that perpetuate the original allocation mistakes.

Machine learning models correct for this censoring by modeling latent demand rather than just observed sales. They identify when sizes likely stocked out by examining inventory trajectories and sell-through velocity. If Small sold its last unit in week two while Medium still had inventory in week twelve, the model infers substantial unmet Small demand.

Stockout Correction Methodology:

Velocity analysis: Compare early-season sell-through rates across sizes. Faster velocity indicates stronger demand that would have continued absent stockout.

Statistical estimation: Use survival analysis and censored regression techniques to estimate full-season demand from partial observations.

Cross-validation: Test predictions against stores that didn't stock out, validating the correction factors.

Conservative adjustment: Apply correction factors cautiously to avoid over-correcting based on limited early sales.

This correction is particularly important for new product launches where initial allocations are most uncertain. Early stockouts provide valuable demand signals that must be properly interpreted to improve subsequent replenishment and future product allocations.

Observed vs. Latent Demand After Stockout Correction

Cross-Store Size Reallocation

Despite optimal initial allocation, size imbalances inevitably develop as actual sales patterns deviate from forecasts. Some variation is random noise, but persistent patterns indicate opportunities for beneficial transfers. Machine learning enables dynamic reallocation, transforming size inventory from static to fluid, flowing toward demand continuously.

When to Transfer

Not every size imbalance warrants a transfer. Transfers incur real costs—handling, shipping, timing delays, and disrupted presentation. The expected benefit must exceed these costs. Machine learning models identify beneficial transfers by analyzing current inventory positions, predicted remaining demand, and transfer costs.

Transfer Decision Framework

  • Stockout risk: Stores with high probability of stocking out in popular sizes are priority recipients
  • Excess inventory: Stores with slow-moving sizes and low probability of natural sell-through become donors
  • Remaining season: Early-season transfers capture more value than end-of-season moves
  • Transfer costs: Distance, handling fees, and timing determine the cost threshold
  • Markdown risk: Compare transfer cost against probability and magnitude of markdowns if untransferred

The optimization considers the full network simultaneously. Rather than bilateral store-to-store decisions, it solves for the best set of transfers across all locations to maximize expected profit. This might mean Store A sends Medium to Store B, Store B sends Large to Store C, and Store C sends Small to Store A—a complex web impossible to identify manually.

Dynamic Reallocation in Practice

Weekly evaluation cycles assess transfer opportunities as new sales data arrives. Early weeks see more transfers as initial allocation errors become apparent. Mid-season transfers capture emerging opportunities from unexpected demand patterns. Late-season transfers focus on consolidating remaining inventory to strongest locations.

Automated execution is essential at scale. A national retailer might evaluate thousands of potential transfers weekly. Manual review is infeasible. Machine learning systems automatically flag high-value transfers while allowing human oversight for unusual situations or high-stakes decisions.

Performance tracking measures transfer effectiveness. Did the transferred size sell through at the destination? Would it have sold at the origin? What was the realized margin impact? These metrics train the system to improve transfer decisions over time.

Measuring Size Optimization Success

Effective size optimization should deliver measurable improvements across multiple dimensions. Track these metrics to evaluate program success and identify opportunities for refinement.

Key Performance Indicators

Full-Price Sell-Through

The percentage of inventory sold at regular price before markdowns. Target: 85-90% for core products. Size optimization should narrow the variance across sizes—if XS achieves 95% while XL achieves 70%, allocation remains imperfect. The goal is consistent high performance across all sizes.

Size-Related Markdown Rate

Units requiring markdown to clear, by size. Target: Less than 15% for core sizes. Extended sizes (XXS, XXL+) naturally show higher markdown rates due to longer tail demand, but even these should improve with optimization. Track markdown depth as well—deeply discounted sizes indicate severe overallocation.

Stockout Rate by Size

Percentage of store-days when sizes are unavailable. Target: Less than 5% for core sizes during peak season. Lost sales from stockouts often exceed the cost of modest overstocks, especially for high-margin products. Balance inventory productivity against service level.

Customer Satisfaction

Survey scores and return rates related to size availability. "Could not find my size" complaints should decline. Return rates for "wrong size" should remain stable or decrease—optimization shouldn't sacrifice fit quality for inventory efficiency. NPS and repurchase rates among size-constrained customers provide leading indicators.

Transfer Volume and Cost

Transfers should initially increase as optimization identifies opportunities, then decline as allocation improves. Track transfer cost as a percentage of sales—target less than 0.5%. High ongoing transfer volume indicates allocation models need refinement.

ROI Example: National Apparel Retailer
  • Baseline: $800M revenue, 25% markdown rate, 10% stockout rate
  • After Size Optimization:
    • Markdown rate: 25% → 21% (4% reduction × $800M × 50% margin = $16M savings)
    • Stockout reduction: 10% → 5% (5% × $800M = $40M recovered sales × 50% margin = $20M)
    • Transfer efficiency: Reduced 30% of transfers ($2M savings)
    • Inventory reduction: 5% lower working capital ($15M cash flow improvement)
  • Total annual impact: $53M or 6.6% of revenue

Implementation Considerations

Data Requirements

Successful size optimization requires clean, comprehensive data. At minimum, you need historical sales by size, store, product, and time period. Enhanced results come from inventory positions, customer returns by size, and store attributes (demographics, format, square footage). The more data, the better the models, but start with what you have and expand over time.

Organizational Readiness

Size optimization touches multiple functions—merchants, planners, allocators, store operations, and supply chain. Success requires cross-functional alignment on goals, processes, and decision rights. Resistance often comes from merchants comfortable with traditional intuition-based allocation. Demonstrate results through controlled pilots before full-scale rollout.

Common Implementation Pitfalls

Over-optimization: Pursuing perfect size curves delays implementation. Start with "good enough" curves and iterate based on results.

Ignoring constraints: Case pack minimums, presentation requirements, and supplier limitations constrain pure optimization. Build these into models from the start.

Analysis paralysis: Endless model refinement without deployment wastes opportunity. Launch pilots quickly, measure rigorously, improve continuously.

Change management failure: Technical solutions fail without organizational adoption. Invest in training, communication, and stakeholder engagement.

Technology Stack

Enterprise allocation systems, data warehouses, and machine learning platforms form the technology foundation. Cloud-based solutions offer flexibility and scalability without heavy upfront investment. Many retailers start with their existing allocation systems enhanced with size optimization modules, then evolve to more sophisticated platforms as capabilities mature.

The Bottom Line

Size curve optimization represents one of the highest-ROI applications of machine learning in fashion retail. Getting sizes right ensures customers find what they need, creating satisfaction that drives loyalty and lifetime value. The alternative—generic size curves—leaves money on the table through lost sales and unnecessary markdowns while frustrating customers.

Machine learning enables personalization at scale. Each store receives the optimal size mix for its specific customer base. Each product gets allocated according to its unique demand characteristics. Dynamic reallocation keeps inventory flowing toward opportunity throughout the season.

The technology is proven and accessible. The data exists in most retail systems already. The business case is compelling—improvements of 3-7% of revenue are typical, with payback measured in months, not years. The question isn't whether to optimize size curves, but how quickly you can start.

Is your business leaving millions in potential profit trapped in size misallocation? The customers are there, the demand is real—but only if you stock the right sizes in the right places.

← Return to Blog Index