Smart Dedupe FAQs
What is Smart Dedupe?
#Smart Dedupe surfaces likely duplicate and near-duplicate blocks, shows side-by-side previews, and helps you decide what should stay separate, merge manually, archive, or ignore.
Does 1.00 similarity mean an exact duplicate?
#Often yes, especially when hash matching triggers, but always confirm before deleting, merging, or archiving anything.
Can I cancel a scan?
#Yes. Full-vault scans can be cancelled and still return partial results. See Smart Dedupe.
What is a good default threshold?
#0.90 is a strong starting point for likely duplicates. Use higher thresholds for safer first review and lower thresholds when hunting paraphrases. See Smart Dedupe.
Will this delete my notes?
#No. Smart Dedupe is review-first and should not be treated as automatic deletion or automatic merging.
How does it know two blocks are duplicates?
#Smart Dedupe uses a fast exact-match pass when available, then semantic similarity over block embeddings. Treat results as likely matches and confirm manually.
What if two similar notes are both useful?
#Keep them separate. Good cleanup preserves nuance while reducing repeated decisions. Related notes are not automatically duplicates. See Smart Connections.
Is this just Connections?
#No. Connections surfaces related notes while you work. Dedupe reviews repeated work for cleanup decisions. See Smart Dedupe.
Will this improve AI output?
#Cleaner source material can make context packs easier to review and reduce bloat, but it does not guarantee better answers. Rebuild the relevant Smart Context bundle after cleanup.
Will scanning a large vault be slow?
#Start with a bounded scope. Vault scans can be heavier, so use stricter thresholds, smaller max results, or current-note scans first. Check Smart Environment settings.
What scope should I start with?
#Start with one current note, one project folder, or one dense topic area where repeated work already hurts.