AI-Powered Matching
How True Record uses machine learning to find duplicates that rule-based systems miss.
Overview
True Record combines AI vector embeddings with traditional matching rules to find duplicates. This hybrid approach catches both exact matches and fuzzy/semantic similarities.
Understands meaning, not just text
Finds matches in milliseconds
AI + rules for best accuracy
Matching Pipeline
Every scan runs through a multi-stage pipeline to find and score potential duplicates.
Vector Embeddings
Embeddings capture the semantic meaning of records, allowing us to find duplicates even when fields are formatted differently or contain typos.
Custom Field Selection
You can configure which fields are used for embedding in the Settings tab. Choose fields that uniquely identify records for best results.
K-NN Search
K-Nearest Neighbors (K-NN) search finds records with the most similar embeddings. We use approximate nearest neighbor (ANN) search for scalability.
Hybrid Matching
AI-only matching can surface false positives. We combine K-NN results with blocking rules for precision.
K-NN (Recall)
Casts a wide net using semantic similarity. Catches typos, abbreviations, and alternate formats.
Blocking Rules (Precision)
Filters candidates using exact-match or rule-based conditions (same domain, same phone, etc.).
Confidence Scoring
Each match receives a confidence score from 0-100% based on weighted field comparisons.
Review Carefully
Likely Match
Very High Confidence
Embedding Cache
Embeddings are expensive to generate. We cache them aggressively to minimize API costs and improve scan speed.
Configuration
AI matching settings can be tuned per object type.