Skip to main content

2 posts tagged with "saas-development"

View all tags

Building a Multi-Vertical Lead Generation System with LLM Visual Analysis

Β· 6 min read

Encountered this architecture challenge while designing a building renovation lead identification system for a client. The same LLM visual pipeline proved reusable across multiple industries with zero additional development cost.

TL;DR​

Feed Google Street View + Satellite images to an LLM, extract structured building attributes (glass type, orientation, facade material, condition), and generate a renovation potential score. The same pipeline β€” without modification β€” can produce targeted leads for window film installers and window covering retailers. POC cost for 150 buildings: ~$125.

Why This System​

The US commercial building renovation market relies on manual canvassing to identify targets β€” low efficiency, limited coverage. The window film and solar shading industries face the same problem: no tool exists to identify high-intent buildings based on physical attributes.

Core insight: building facade images contain quantifiable commercial signals. LLM Vision can extract these signals zero-shot. Build once, reuse across industries.

Pipeline Design​

8-step pipeline. Input: address or ZIP code. Output: scored building lead CSV.

Step 1 β€” Geocoding Google Geocoding API converts user input to lat/lng coordinates.

Step 2 β€” Building Discovery Google Places Nearby Search (radius = 500m) with type filters: office buildings, hotels, apartments, industrial facilities.

Step 3 β€” Deduplication & Filtering Deduplicate by place_id, remove results missing geometry data.

Step 4 β€” Dual Image Retrieval (Parallel)

  • Satellite: Maps Static API, zoom=18, 640Γ—640px
  • Street view: Street View Static API, 640Γ—640px, fov=90
  • Street View Metadata API (free) checks coverage before requesting images β€” avoids unnecessary billing

Step 5 β€” LLM Visual Perception Two separate prompts (not merged β€” merging reduces structured output stability):

  • Prompt 1 (satellite): floor count, building volume, window density
  • Prompt 2 (street view): facade material, glass type, condition, occlusion level
{
"material": "glass_curtain_wall / brick / concrete / mixed",
"glass_type": "single / double / unknown",
"condition": "good / fair / poor",
"orientation": "N / S / E / W / mixed",
"estimated_age": "0-10 / 10-20 / 20+ years",
"occlusion_level": "none / minor / major",
"confidence": "high / medium / low"
}

Step 6 β€” Scoring Engine Fixed-weight rule engine, 0–100 score:

ConditionScore Added
glass_type = single-pane+40
material = aged (brick/concrete)+20
estimated_age > 20 years+20
condition = poor+20

Confidence discount: medium Γ— 0.8 / low Γ— 0.6

Step 7 β€” Lead Enrichment Places Details API called only for top 30% scored buildings. Returns: name, address, phone, website.

Step 8 β€” CSV Export Full field output with empty manual annotation column for human review.

Street View Fallback Strategy​

Buildings are never discarded. Three-tier graceful degradation:

ScenarioHandling
Street view availableFull dual-image analysis
No street view coverageSatellite-only analysis; material/glass marked unknown; confidence auto-downgraded to medium
Heavy occlusionSame as above; flagged as priority manual annotation sample

Commercial Extension​

LLM-extracted fields are generic physical attributes β€” not bound to the renovation use case. Same fields, different filter conditions, different buyer.

Primary Verticals​

Window Film Installers Target signals: glass_type=single + orientation=W/S + window_density=high

Single-pane glass means poor thermal insulation. West/south-facing facades receive peak solar gain. High window density increases project scale. All three signals combined identify buildings with strong intent for window film installation.

Window Covering / Blind Retailers Target signals: orientation=W/S + window_density=high (floor-to-ceiling windows β†’ high order value)

The same high-scoring buildings are valuable to multiple non-competing buyers. One scored building can be sold as a lead to both verticals simultaneously.

Lead Flow​

ZIP code input
↓
Building discovery + LLM analysis
↓
Top 30% scored buildings
↓
Field signals matched per vertical:
β€’ Window film: glass_type=single + orientation=W/S
β€’ Window coverings: orientation=W/S + window_density=high
↓
Sellable leads per vertical

One pipeline, multiple verticals. A 150-building POC scan costing ~$125 can generate leads for 2+ industries simultaneously. Marginal cost per additional vertical approaches zero.

Key Design Decisions​

Why two separate prompts instead of one merged prompt? Merging prompts when processing multiple images increases structured JSON instability β€” field omission rates rise. Two independent prompts each focus on a single image type, producing more reliable output.

Why exclude LangChain? The system is a fixed linear pipeline with no dynamic decision-making. Introducing an agent framework adds debugging complexity with no real benefit.

Why not train a custom model? UCL 2024 research confirms GPT-4 Vision can extract building age from facade images zero-shot, with no pre-labeling required. POC phase validates the zero-shot accuracy baseline first β€” fine-tuning only if needed.

POC Execution Plan​

  • Scale: 1–2 US cities, 2–3 ZIP codes each, targeting 100–150 buildings
  • Validation: Manual ground-truth labeling β†’ compare with LLM output β†’ calculate per-field accuracy
  • Decision gate: proceed to full development only after baseline accuracy is confirmed acceptable

Estimated API Cost β€” 150 Buildings​

APICost
Geocoding~$5
Places Nearby Search~$10
Street View Static~$35
Maps Static (satellite)~$25
LLM Vision~$50
Total~$125

Research References​

1. Housing Passport β€” World Bank (2019)

World Bank-supported project using street view + ML to automatically identify vulnerable buildings and generate a 'Housing Passport' record for each.

  • Validates technical feasibility of street view + ML for building material and condition identification
  • Processing speed: ~$1.50 per 300,000 images/hour after training
  • Key difference: that project used proprietary street imagery + custom-trained models; this system uses Google Street View + LLM Vision zero-shot (no pre-labeling required, but accuracy must be POC-validated)

2. UCL β€” Zero-Shot Building Age Classification Using GPT-4 (ISPRS 2024)

University College London research using GPT-4 Vision for zero-shot building age classification from facade images, with no labeled training data required.

  • Overall accuracy: 39.69% (coarse-grained classification)
  • Mean absolute error: 0.85 decades (~8–9 years)
  • Confirms LLM Vision can extract building age from facade images without training
  • Glass type recognition has no published benchmark β€” accuracy must be measured in POC

Interested in a similar AI-powered lead generation system? Let's talk

China Manufacturer Go-Global Strategy: Independent Site + Google Ads from Zero

Β· 3 min read

Delivered an overseas market entry strategy for an industrial manufacturer with an established domestic business, covering channel selection analysis and website platform recommendations for a zero-to-one international expansion.

Background​

The client is a Chinese industrial product manufacturer with a proven domestic operation. Competitors were still spending heavily to figure out overseas channels. B2B overseas customers represent significantly higher order values than domestic ones. The timing was right β€” the client needed a low-dependency, controllable, long-term viable overseas acquisition strategy.

Global Trade Shipping

Challenges​

  • Mainstream overseas channels (Alibaba International, Made-in-China) carry high annual fees, strong platform dependency, and uncontrollable bidding costs
  • No prior overseas digital marketing experience; budget constrained; required a testable approach that could be stopped if data underperformed
  • B2B inquiry quality varies widely β€” needed a channel that attracts active, high-intent buyers rather than passive browsing traffic

Strategy​

Channel Selection​

Systematic evaluation of three primary channels:

ChannelCost StructureInquiry QualityControllabilityRisk
Alibaba InternationalHigh annual fee, recurringLow–MediumPoorBidding uncontrollable, ROI depends on platform
Made-in-ChinaMedium annual fee, recurringMediumGoodLow ceiling, limited conversion
Independent Site + Google AdsOne-time build + performance-based ad spendHigh (active search)HighCan stop anytime if data underperforms

Conclusion: Independent site + Google Ads is the only option that simultaneously achieves low platform dependency, high inquiry quality, and controllable budget.

Extended Channel Scoring​

ChannelScore /25Recommendation
Independent Site + Google Ads18Primary focus
Trade shows (electronics)17Selective participation
Made-in-China16Worth supplementing
Alibaba International14Low-investment maintenance
Amazon Business14Test for standardized products
YouTube14Long-term content investment
LinkedIn / Facebook / TikTok≀12Defer or skip

Scoring scale 1–5; platform cost is reverse-scored (lower cost = higher score)

Website Platform Selection​

CriteriaAlibaba Cloud (Singapore)Shopify BasicWordPress + WooCommerce
Transaction feeNone2% (third-party payment)None
Self-ownershipHighLowHigh
B2B supportβœ…Limitedβœ…
Server locationSingapore β€” covers SEA + Western marketsShopify-hosted, uncontrollableFlexible

Recommended: Alibaba Cloud Singapore β€” zero transaction fees, full ownership, B2B inquiry support, server location aligned with target markets.

Execution Path​

  1. Launch independent site (deployed on Alibaba Cloud Singapore)
  2. Start Google Ads with small test budget
  3. Identify high-performing regions based on inquiry data
  4. Concentrate budget on validated channels

Deliverables​

  • Three-channel comparison analysis
  • Website platform selection recommendation
  • 7-channel scoring matrix
  • Google Ads phased budget framework

Need an overseas market entry strategy for your factory? Let's work together