Operating at the Limits

4 Jun

I get hired to make lines repeatable and fast - never just one or the other. Lock in a tight “normal” range, then step beyond it; that’s how you turn today’s stretch into tomorrow’s standard. The idea guided a project in California this season, when a long-time customer asked me to audit an AI grading upgrade for their cherry sorter.

The vendor literature promised spectacular defect detection and an operator‑free future. I promised the owners two things: an honest verdict and a roadmap for squeezing every hidden benefit the brochure skipped. During preseason image tests we saw exactly what the new feature could do, and in the first hour of live running the 2025 crop the results spoke for themselves: major- and minor-defect rates were tucked safely inside the sales team’s quality range - numbers that used to cost us a shift of constant tweaking. Great start, but I wanted to know how much further we could push.

The question that wouldn’t leave

That evening at dinner I said what I’d been thinking since deep‑learning demos took over fruit‑tech expos, fruit industry publications and linkedin posts 2 years ago:

“If every pack‑house can bolt the same AI grading upgrade onto similar sizer hardware, where’s the point of difference? And what happens when expert operators start using that SOTA gear too?”

It wasn’t a grand epiphany; just the moment I decided to figure it out. Tomorrow I’d pull the full season-to-date record, dissect it, and see where the shiny new upgrade still left fruit quality on the table.

Turning raw numbers into a real-time mirror

At 07:00 I fired up the laptop, aimed a handful of SQL queries at our TimescaleDB, and downloaded minute‑by‑minute OEE for the season—availability, throughput, quality, plus raw grade‑by‑grade counts. Nothing flashy: I tossed the CSVs into Grafana for a quick scan.

The top-line plot looked great. Zooming into a single grower lot, the quality component bobbed like a boat wake. The AI grading was strong, yet I still found myself nudging cutpoints when the wave grew. Variation clearly lived inside the lot, invisible until you stared hard enough.

Time for a clearer lens. I piped the data into a notebook, layered rolling averages and standard deviations, then tagged every reading with a colour band:

Dark green | within ±1 σ
Light green | within ±2 σ
Amber | outside ±2 σ
Red | outside ±3 σ

To catch subtler drift I trained an IsolationForest on the same features and spun up an HDBSCAN pass to group any stubborn clusters. I like to think this is what proper data-science tech would look like but I honestly wouldnt know. Still, the rule was simple enough to explain on the catwalk: green means typical; red means “pause and look.” upon integrating this feature into something more useful -Thanks to hypertables and a materialized view the path from database write to dashboard update averaged ≈ 70 ms, so the operator saw the color before while the effect of the change was happening in real time - not when the grower lot had been reconciled.

The invisible culprit: mixing orchard blocks

We started a large grower lot at the usual 20 t/h. The band stayed green, then flashed amber—sometimes red—each time the forklift set a fresh bin on the infeed. The driver wasn’t clearing yard space; he was following the long-standing habit of feeding whatever bin sat nearest. The data told a subtler story: bins from different blocks of the orchard carried slightly different average size, colour, and firmness. The grader called it faithfully; the dashboard showed it live.

A quick request to the forklift driver to feed by block number, one block at a time. Ten minutes later the band sat solid green, manual tweaks fell roughly 60 %, and the quality variance inside the lot dropped under 2 %.

Proving there was still head-room

With variability under control I nudged the infeed +2 t/h. Quality held, the band barely left light-green, and the anomaly counter stayed below 1.8 %. Two extra tonnes an hour won’t break LinkedIn algorithms, but it means hundreds of cartons and—two weeks on—no frantic “double-check this pallet” visits from our export coordinator.

A tool that compensates for experience

The real prize is how we handle drift now. When the IsolationForest flags an outlier, the system bundles thirty minutes of contextual data - and hands it to a small local LLM agent tuned for grading ops. The return message is plain English:

“Sharp rise in ‘peddler’ grade. Check profile or adjust firmness threshold +2..”

It’s the same hunch a veteran grader would sense, surfaced in seconds, not seasons. That feed-forward loop is exactly what I mean by compensating experience—capturing years of intuition and making it available on the night shift or in the next facility or too a brand new operator.

Lessons worth keeping

The sorter is only half the system. Brilliant optics and neural nets can’t rescue a line if bin flow injects chaos faster than the algorithm can learn.
Instrumented stress tests expose processes, not steel. Running near the limit didn’t hurt the machine; it spotlighted a feeding habit we’d ignored because its cost never showed up on yesterday’s KPI board.
Lock in the win before hunting the next percent. By switching to block-based feeding the same day we discovered the issue, we turned a quick gain into a lasting edge.

Two weeks on, the grader still hums, anomaly alerts stay quiet, and the export guy isn’t chasing rogue fruit. Most important, the crew now trusts the green-amber-red language on-screen—because every color comes with a concrete action.

Will we push again? Absolutely. A comfort zone only exists so you can outgrow it. Next time we step beyond, we’ll have a live anomaly-to-action tool and a team fluent in its voice.

And we’ll probably cap the shift with another dinner toast—celebrating whatever fresh two‑percent edge we coax from the same AI technology everyone else already owns.

Roydon Adlam