Who this is for: Maintenance managers, reliability engineers, quality supervisors, and anyone who has watched an inspector squint at the same bolt pattern for the 400th time today and thought, "There has to be a better way." There is.
Read Time: 18-20 minutes
The Inspection You Missed
A utility company in the American Midwest runs 14,000 miles of transmission lines. Every year, they send helicopter crews to photograph every pole, every insulator, every splice.
450 hours of flight time. 2.1 million images. 12 inspectors reviewing them on monitors.
In 2023, they found 1,847 defects. Good work.
In 2024, they ran the same images through Maximo Visual Inspection. MVI found 2,412 defects. 565 that human eyes missed. Including 23 cracked insulators rated critical -- the kind that cause outages in ice storms.
"We thought our inspectors were thorough. They were. But thorough and complete are different things when you're looking at image number 175,000."
81% reduction in review time. 31% more defects caught. Zero additional flight hours.
That is not a vendor slide. That is what happens when you stop pretending human visual inspection scales.
The Manual Inspection Problem
You know this problem. You live it every quarter.
The physics of human visual inspection:
- The human eye fatigues after 20 minutes of sustained detail inspection
- Detection accuracy drops from 85% to 65% after the first hour
- Lighting variations, angle changes, and surface reflections create false confidence
- Inspector A calls it "acceptable wear." Inspector B calls it "replace immediately."
- Nobody photographs the same asset the same way twice
The economics of manual inspection:
THE INSPECTION MATH
===================
Average inspection: 3-5 hours
Average cost: $225-375 per inspection
Annual inspections (mid-size plant): 4,000
Annual cost: $1.2M
Defect miss rate: 20-30%
Cost of one missed critical defect: $50K-$500K+
You are paying $1.2M/year for a process that
misses one in four problems.And here is the part nobody talks about: the inspections you are not doing. The assets you inspect annually that should be inspected quarterly. The remote locations you skip because helicopter time costs $2,500/hour. The night-shift equipment that only gets looked at when something breaks.
Manual inspection does not scale. It never did. You just did not have an alternative.
Now you do.
What MVI Actually Does
Strip away the marketing. IBM Maximo Visual Inspection does three things:
1. Image Classification
Question it answers: "What category does this image belong to?"
Feed it an image of a transformer. MVI says: "Corroded" or "Good condition" or "Oil leak present."
Classification looks at the entire image and assigns it to one of your defined categories. Think of it as sorting photos into folders -- but at 50 images per second with 95%+ accuracy.
Best for: Pass/fail quality checks, condition grading, go/no-go inspection gates.
2. Object Detection
Question it answers: "Where in this image are the things I care about?"
Feed it a drone photo of a solar panel array. MVI draws bounding boxes around every cracked cell, every hot spot, every piece of debris. It tells you what it found and where it found it.
Best for: Finding specific defects in complex images, counting objects, locating damage on large assets.
3. Action Detection
Question it answers: "What is happening in this video?"
Feed it video of a manufacturing line. MVI identifies whether the operator is following the correct procedure, whether safety equipment is being worn, whether the assembly sequence is correct.
Best for: Process compliance, safety monitoring, quality control on production lines.
MVI CAPABILITY MATRIX
=====================
Capability Input Output Use Case
────────────────── ──────── ────────────────── ──────────────────────
Classification Image Category label Pass/fail, grading
Object Detection Image Bounding boxes Defect location, counting
Action Detection Video Activity labels Process compliance, safetyWhat MVI Is NOT
Let us be clear about the boundaries:
- MVI is not general-purpose AI. It detects what you train it to detect. Period.
- MVI is not a camera system. You bring the images. MVI analyzes them.
- MVI is not magic. A model trained on daytime photos will struggle with nighttime images.
- MVI does not replace judgment. It flags findings. Humans decide what to do about them.
Where MVI Fits in the MAS Ecosystem
MVI does not work in isolation. Its power comes from integration.
THE MAS VISUAL INSPECTION ARCHITECTURE
=======================================
IMAGE SOURCES MVI CORE MAS APPLICATIONS
───────────── ──────── ────────────────
Drones ──┐
Fixed Cameras ──┤ ┌──────────────┐
Mobile Devices ──┼──> Images ──>│ MVI Server │──> Results ──┐
Robots ──┤ │ - Classify │ │
Manual Upload ──┘ │ - Detect │ ├──> Manage (Work Orders)
│ - Train │ ├──> Health (Condition Scores)
└──────────────┘ ├──> Monitor (Alerts)
│ └──> Predict (Failure Input)
│
┌──────────────┐
│ MVI Edge │
│ (Local │
│ Inference) │
└──────────────┘
│
┌──────────────┐
│ MVI Mobile │
│ (Field │
│ Inspection) │
└──────────────┘The integration story matters more than the AI story:
- MVI + Manage: Defect detected? Auto-create a work order with the image attached, defect type classified, priority assigned. No human has to type "cracked insulator found at pole 4,731."
- MVI + Health: Visual condition scores feed into asset health calculations. A corroded heat exchanger photograph becomes a data point in the overall health score alongside vibration data and maintenance history.
- MVI + Monitor: Set up visual inspection triggers. Camera captures image every hour. MVI analyzes it. If anomaly detected, Monitor fires an alert. Same alerting pipeline as your IoT sensors.
- MVI + Predict: Historical visual inspection data enriches predictive models. "Assets that show this corrosion pattern at month 6 fail at month 14" -- Predict learns from what MVI sees.
Industry Use Cases: Where MVI Delivers Today
Infrastructure: Sund & Baelt (Denmark) -- Bridge Inspection at National Scale
The problem: Sund & Baelt operates the Great Belt Fixed Link, one of the world's longest suspension bridges. Concrete surface inspection across massive bridge infrastructure took months of manual work. Inspectors had to visually evaluate enormous surface areas for cracks, spalling, and deterioration -- a process that was slow, inconsistent, and limited by human stamina.
MVI solution: Maximo Visual Inspection deployed to analyze concrete surface imagery captured across the bridge infrastructure. AI models trained to detect and classify concrete deterioration patterns that previously required extensive manual inspection campaigns.
Result:
- Inspection time: Reduced from months to days
- Projected productivity increase: 15-25% over 5-10 years
- Incident response: >30% reduction in time from incident detection to repair
- Projected lifespan extension: 100 years (from original design life)
- CO2 impact: 750,000 tons of CO2 saved by extending bridge lifespan instead of rebuilding
- ROI timeline: Value realized in initial deployment phase
Key insight: The Sund & Baelt case proves that MVI's value is not limited to manufacturing defects. National-scale infrastructure inspection -- where the cost of failure is measured in lives and hundreds of millions -- is where AI visual inspection delivers its most profound impact.
Utilities: Melbourne Water (Australia) -- Stormwater Infrastructure
The problem: Melbourne Water manages stormwater infrastructure across a 14,000 square kilometer catchment area. Thousands of stormwater grates require regular inspection to prevent flooding and environmental contamination. Manual inspection consumed enormous staff hours, and traditional SCADA-based monitoring alternatives were prohibitively expensive to deploy at scale.
MVI solution: IoT camera solution integrated with Maximo Visual Inspection to monitor stormwater grate conditions. The AI system analyzes imagery to detect blockages, structural damage, and maintenance needs across the distributed network.
Result:
- Staff hours saved: Thousands annually (expected)
- Cost savings: Tens to hundreds of thousands of dollars per year
- Coverage: 14,000 sq km catchment area monitored systematically
- Cost advantage: IoT camera solution costs significantly less than SCADA alternatives
- Scalability: Designed for distributed, high-volume infrastructure monitoring
Manufacturing: Defect Detection on the Line
The problem: A precision parts manufacturer inspects 12,000 components per shift. Three quality inspectors. Miss rate: 3.2%. That 3.2% costs $847K/year in returns, rework, and warranty claims.
MVI solution: Camera mounted at end of production line captures every part. MVI model trained on 2,400 images (800 good, 800 surface defect, 800 dimensional defect). Classification runs in real-time using GoogLeNet for fast, accurate image classification.
Result:
- Miss rate: 3.2% to 0.4%
- Inspection speed: 1 part every 4 seconds (vs 15 seconds manual)
- Annual savings: $612K in quality costs
- ROI payback: 4 months
Utilities: Transmission Line and Pole Inspection
The problem: 14,000 miles of transmission lines. Annual helicopter inspection costs $1.8M. Review backlog means critical defects sit in queues for weeks.
MVI solution: Drone-captured imagery analyzed by MVI. Object detection models (Faster R-CNN for accuracy, YOLO v3 for speed) trained for: cracked insulators, woodpecker damage, vegetation encroachment, hardware corrosion, conductor damage. Five defect types, one model.
Result:
- Review time: 450 hours to 85 hours (-81%)
- Defects found per cycle: 1,847 to 2,412 (+31%)
- Critical defect response: Weeks to 48 hours
- Annual savings: $980K
Oil and Gas: Corrosion and Damage Assessment
The problem: Offshore platform inspection requires rope access teams. $15K per day for a 4-person team. 200 inspection days per year across the fleet. Human inconsistency means corrosion ratings vary 2 grades between inspectors.
MVI solution: Camera-equipped inspection robots capture images of pipe surfaces, welds, and structural members. MVI classifies corrosion severity (Grade 1-5) and detects specific damage types using Detectron2 for polygon-level instance segmentation: pitting, cracking, coating failure, mechanical damage.
Result:
- Inspection cost: $3M to $1.1M (-63%)
- Consistency: Grade variance eliminated (AI does not have bad days)
- Coverage: 40% more surface area inspected per cycle
- Safety: 60% reduction in rope access hours
Transportation: Rail and Wheel Inspection
The problem: Rail operator inspects 2,400 wheels per week using manual ultrasonic and visual inspection. Each wheel takes 8 minutes. Two inspectors work full-time on nothing but wheels.
MVI solution: Cameras mounted at track level capture wheel tread images as trains pass at low speed. MVI detects using YOLO v3 for real-time speed: flat spots, thermal cracks, flange wear, and tread buildup. Flagged wheels routed for detailed ultrasonic follow-up.
Result:
- First-pass screening: 8 minutes to 0.3 seconds per wheel
- Inspector redeployment: From 100% screening to 15% verification
- Detection rate: 94% (vs 87% manual first-pass)
- Missed critical defect rate: Near zero (double-checked by ultrasonic)
The Business Case: Numbers That Matter
Let us build a generic business case you can adapt.
MVI BUSINESS CASE TEMPLATE
==========================
CURRENT STATE (Manual Inspection)
─────────────────────────────────
Annual inspections: 4,000
Average time per inspection: 3.5 hours
Fully loaded inspector cost: $85/hour
Annual inspection labor: $1,190,000
Defect miss rate: 25%
Average cost per missed defect: $75,000
Estimated missed defects/year: 120
Cost of missed defects: $9,000,000
TOTAL ANNUAL COST: $10,190,000
FUTURE STATE (MVI-Augmented)
────────────────────────────
MVI handles first-pass screening: 80% of volume
Human inspectors verify flagged items: 20% of volume
Inspection time reduction: 70%
New annual inspection labor: $357,000
MVI miss rate: 4%
Missed defects/year: 19
Cost of missed defects: $1,425,000
MVI licensing + infrastructure: $250,000/year
TOTAL ANNUAL COST: $2,032,000
ANNUAL SAVINGS: $8,158,000
ROI: 3,163%
PAYBACK: < 2 monthsYour numbers will vary. The ratios will not. Every organization we have worked with sees 60-85% inspection cost reduction once the models are trained and deployed.
Key insight: The biggest ROI from MVI is not the labor savings on inspection time. It is the cost avoidance from catching defects that human inspectors miss. One prevented transformer failure pays for 3 years of MVI licensing.
The Hype vs. Reality Check
What Vendors Tell You
- "AI replaces your inspection team"
- "Train a model in minutes"
- "Works out of the box"
- "99.9% accuracy"
What Actually Happens
- AI augments your inspection team. Your best inspectors become your best model trainers. They teach the AI what they know.
- Your first model takes 2-4 weeks. Image collection, labeling, training, validation, iteration. It is not hard. It is not instant.
- You need your own training data. Transfer learning gives you a head start. Your specific assets, lighting, angles, and defect types require your own images.
- 93-97% accuracy is production reality. And 93% consistent accuracy beats 80% variable accuracy every single day.
The Honest MVI Maturity Curve
MVI MATURITY TIMELINE
=====================
Month 1-2: First model trained. "This actually works."
Month 3-4: Model deployed to one inspection point. Edge cases appear.
Month 5-6: Model refined. Second defect type added. Process integrated.
Month 7-9: Multiple models in production. Work order integration active.
Month 10-12: Mobile/edge deployment. Field inspectors using daily.
Month 12-18: Enterprise rollout. Multiple sites. Governance established.
Month 18+: Continuous improvement. Retraining pipelines. Full ROI.This is not a weekend project. But it is not a 3-year research initiative either. Real organizations are getting real value in 3-6 months.
Why Now: The Three Converging Forces
1. Hardware Got Cheap
A 4K inspection camera: $500. A commercial drone with camera: $2,000. A GPU server for training: $15K (or cloud). The hardware barrier that kept computer vision in research labs evaporated.
2. Transfer Learning Changed the Game
You do not need 1 million images to train a model. Pre-trained neural networks like GoogLeNet (MVI's default for classification) already understand edges, shapes, textures, and patterns. You just teach them what your specific defects look like. 200-500 images per defect type gets you started. 1,000+ images per type gets you to production quality.
3. No-Code Platforms Arrived
MVI lets the reliability engineer who has spent 20 years looking at corroded pipes train a computer vision model without writing a single line of code. Click, label, train, deploy. The domain expert is now the model builder.
Key Takeaways
- MVI automates visual inspection using deep learning -- It classifies images, detects objects, and identifies actions in video without requiring data science skills. The no-code interface lets domain experts build models directly.
- Human inspectors miss 20-30% of defects; MVI achieves 93-97% -- Not because humans are bad, but because the human eye fatigues, conditions vary, and subjectivity creeps in. AI provides consistent, tireless analysis across thousands of images.
- Three core capabilities serve different use cases -- Image classification (pass/fail), object detection (find and locate defects), and action detection (process compliance). Choose based on your inspection question.
- Integration with MAS is where MVI becomes operational -- Manage (work orders), Health (condition scores), Monitor (alerts), and Predict (failure enrichment). A detection without an action is wasted compute.
- ROI comes from caught defects, not just labor savings -- 60-85% inspection cost reduction is real, but the cost avoidance from finding the 25% of defects that humans miss delivers 5-10x more value.
- Production maturity takes 3-6 months for first use case -- Not overnight, not years. Image collection, labeling, training, deployment, and integration in a focused pilot produces value in one to two quarters.
- The three converging forces make now the time -- Cheap hardware, transfer learning reducing data requirements, and no-code platforms putting model building in the hands of domain experts.
What Comes Next
In Part 2, we break open the black box. How does computer vision actually work? What is a convolutional neural network doing when it "sees" a crack? And why does it matter for asset managers who will never write a line of Python?
You do not need to become a data scientist. But you need to understand enough to ask the right questions, set realistic expectations, and know when your model is telling you something real versus something random.
Previous: Series Index
Next: Part 2 - Computer Vision Fundamentals for Asset Managers
Series: MAS VISUAL INSPECTION | Part 1 of 12
TheMaximoGuys | Enterprise Maximo. No fluff. Just results.



