Waterfall Enrichment
Cleanlist.ai uses an advanced waterfall enrichment system to maximize data accuracy and coverage. This technology automatically tries multiple data providers in sequence until we find the information you need.
How It Works
The Waterfall Process
Request Email Enrichment
↓
Try Provider 1 → Found? → Return Email ✓
↓ Not Found
Try Provider 2 → Found? → Return Email ✓
↓ Not Found
Try Provider 3 → Found? → Return Email ✓
↓ Not Found
Mark as Not Found (No charge)Key Benefits
- Higher Success Rates: By trying multiple providers, we achieve significantly higher match rates than single-provider solutions
- Better Data Quality: We validate results from each provider and return the most accurate information
- Cost Efficiency: You only pay once, even though we query multiple providers on your behalf
- Automatic Fallback: No manual intervention needed - the system handles everything automatically
Enrichment Types
Email Enrichment (1 credit)
When you request an email address, our waterfall system:
- Checks our cached database first (instant results)
- Tries primary email enrichment providers
- Falls back to secondary providers if needed
- Validates the email format and deliverability
- Returns the best match found
Providers used (in order):
- Cached results (if available)
- Primary enrichment providers
- Alternative data sources
- Public data validation
Phone Enrichment (10 credits)
When you request a phone number, our waterfall system:
- Checks our cached database first
- Tries primary phone enrichment providers
- Falls back to secondary providers if needed
- Validates the phone number format
- Returns the best match found with metadata (type, carrier, etc.)
Providers used (in order):
- Cached results (if available)
- Primary phone data providers
- Alternative data sources
- Public records
Data Validation
Every piece of data goes through our validation pipeline:
Email Validation
- ✅ Format verification (RFC 5322 compliance)
- ✅ Domain validation (MX record checks)
- ✅ Disposable email detection
- ✅ Role-based email detection
- ✅ Deliverability scoring
Phone Validation
- ✅ Format standardization (E.164)
- ✅ Country code verification
- ✅ Carrier identification
- ✅ Line type detection (mobile, landline, VoIP)
- ✅ Active status verification
Smart Caching
Our waterfall system includes intelligent caching:
- Recent Results: Previously found data is cached for faster retrieval
- Cost Savings: Cached results don't trigger new provider queries
- Data Freshness: Cache is refreshed periodically to maintain accuracy
- No Extra Cost: Cached results use the same credit pricing
Success Metrics
Our waterfall approach achieves:
- Email Match Rate: 70-85% (varies by industry)
- Phone Match Rate: 40-60% (varies by region)
- Average Response Time: 2-5 seconds per lead
- Data Accuracy: 95%+ verified accuracy
When Enrichment Fails
If our waterfall system can't find the requested information after trying all providers:
- No Charge: You are NOT charged credits for failed enrichments
- Partial Results: You may get one field (email) even if another (phone) fails
- Retry Available: You can retry later as new data becomes available
- Status Tracking: The lead list shows which enrichments failed
Technical Details
Provider Selection
Our system intelligently selects which providers to use based on:
- Lead geography (different providers excel in different regions)
- Company size (enterprise vs. SMB data sources)
- Industry (specialized providers for certain sectors)
- Historical success rates (we track which providers work best)
Response Time Optimization
- Providers are queried with optimized timeouts
- Parallel requests where possible
- Circuit breaker pattern to skip slow providers
- Automatic retry logic with exponential backoff
Comparison: Single Provider vs. Waterfall
| Metric | Single Provider | Cleanlist Waterfall |
|---|---|---|
| Email Match Rate | 45-60% | 90-95% |
| Phone Match Rate | 25-35% | 70-85% |
| Data Coverage | Limited | Comprehensive |
| Cost per Success | Higher | Lower |
| Speed | Fast | Optimized |
Best Practices
To get the most from our waterfall enrichment:
- Provide Accurate Input: The more accurate your input data (name, company, location), the better results you'll get
- Use Both Email & Phone: Enriching both fields together provides more context for matching
- Bulk Enrichment: Larger batches benefit from parallel processing
- Monitor Results: Review match rates to improve input data quality