Skip to content

Official Smart Plugins site

Smart Plugins are independent third-party plugins for Obsidian. Smart Connections is the flagship plugin.

Review-first duplicate cleanup for Obsidian

Smart Dedupe Pro for Obsidian: review duplicate and near-duplicate notes before they become rework

Repeated notes and near-duplicate blocks create hidden work: you reread the same idea, rewrite it again, and send bloated context to AI. Smart Dedupe surfaces likely exact and semantic duplicate candidates so you can compare sources and decide what stays distinct, what gets merged manually, and what should stop taxing your attention.

Problem

Repeated material creates repeated reading, repeated decisions, conflicting drafts, and noisy AI context.

Turning point

Similarity creates a question. Review turns the candidate into a cleanup decision.

Outcome

Cleaner source material makes retrieval, context packs, and review easier to trust.

Candidate review Exact and semantic duplicate candidates Human-owned cleanup Context-noise reduction

Fastest proof state

One likely duplicate group has been reviewed and turned into a decision.

The goal is not a perfect vault on day one. The goal is one bounded scan, one candidate group, and one cleanup decision you can trust.

Start with one scope, not a whole-vault cleanup project

Dedupe works best when the first pass is small enough to finish. Choose one active project, one folder, one dense topic area, or one current note where repeated material already hurts.

  1. Step 1

    Pick a bounded scope

    Start where the cost is visible: a project folder, topic cluster, draft area, or dense note.

  2. Step 2

    Run a focused review

    Let Dedupe surface likely overlap. Keep the first review narrow enough that you can finish it.

  3. Step 3

    Compare one candidate group

    Read the previews and open the source notes when needed. Similarity is a signal to inspect, not a verdict.

  4. Step 4

    Decide what happens next

    Keep separate, merge manually, archive, or ignore based on what the notes actually do in your workflow.

First-use rule

Do not begin with the belief that every similar note must disappear. Similar notes can be useful neighbors. Dedupe helps you decide whether overlap is repeated work or useful distinction.

Use Smart Dedupe when repeated material creates a decision

Dedupe is not about making every note unique. It earns its place when repeated material creates rework, conflict, or context bloat.

You rewrote the same idea

Catch the older version before a fresh paragraph becomes another conflicting source.

Project notes drifted

Review overlap across project notes when wording changes but the reasoning is the same.

Context packs feel bloated

If AI context repeats the same point in several notes, clean the source material before delegating again.

Connections keeps surfacing near twins

Related notes may be useful neighbors. Dedupe helps review whether the overlap is repeated work.

What to do with duplicate candidates

A candidate group is not a command to delete. It is a review surface for deciding what each note or block should do next.

Exact duplicates are easiest to confirm

Exact or near-identical text can often be reviewed quickly, but it still deserves verification before changing trusted notes.

Semantic matches need judgment

Paraphrases, rewrites, and repeated reasoning can overlap in meaning without being interchangeable.

Review keeps cleanup human-owned

Compare the candidate, open the sources, and decide whether overlap is useful distinction or repeated work.

Decision

Keep separate

Use when similar notes serve different jobs, audiences, projects, or stages of thinking.

Decision

Merge manually

Use when one source should absorb the useful parts of another after review.

Decision

Archive

Use when stale repeated material should leave active work without being deleted on impulse.

Decision

Ignore

Use when the candidate is not worth cleanup or the distinction is already clear.

Where Smart Dedupe fits

Current note -> Connections. Question -> Lookup. Exact phrase -> Obsidian search. Landscape -> Graph. Reusable set -> Context. Repeated work -> Dedupe.

Cleaner source material makes AI context easier to review

Dedupe does not guarantee better model answers. It helps when repeated source material makes context bloated, contradictory, or harder for you to inspect before delegation.

Before delegation

Review repeated material before sending the same idea to AI in five slightly different notes.

After discovery

If Connections or Lookup keeps surfacing near twins, use Dedupe to decide whether they are repeated work or useful neighbors.

Back to Context

Once repeated sources are reviewed, rebuild the affected pack with Smart Context so the next prompt starts cleaner.

Core retrieval vs Pro cleanup

Core Smart Connections can surface related notes without Dedupe. Smart Dedupe is advanced cleanup for users who want to reduce repeated work and context noise after retrieval starts showing overlap.

Core path

Find and reuse related notes

  • Use Connections for note-first related material.
  • Use Lookup for question-first semantic retrieval.
  • Use Context when selected notes need to become AI-ready scope.
  • Core retrieval does not require cleanup first.

Pro workflow

Review repeated work and context noise

  • Review likely exact and semantic duplicate candidates when overlap becomes costly.
  • Compare candidates before editing trusted notes.
  • Keep useful neighbors when similarity is not sameness.
  • Return to Context with cleaner source material when AI inputs were repetitive.

Review-first trust boundaries

Cleanup workflows create data-loss anxiety. Smart Dedupe should feel like candidate discovery and review, not an automatic cleanup agent.

What stays under your control

  • Candidates are presented for review, not treated as final truth.
  • You decide whether similar notes are duplicates or useful neighbors.
  • Manual merging keeps trusted-note changes deliberate.
  • Archiving is safer than deleting when the decision is uncertain.

What not to assume

  • It does not mean every duplicate has been found.
  • It does not mean every similar note should be consolidated.
  • It does not guarantee better AI answers by itself.
  • It does not replace your judgment about what belongs in the trusted note.

FAQ

Quick answers for people deciding whether Dedupe is the right cleanup surface.

What is Smart Dedupe for Obsidian?

Smart Dedupe is a review-first cleanup workflow for finding likely exact and semantic duplicate candidates in an Obsidian vault. It helps you compare overlap before deciding whether to keep notes separate, merge manually, archive, or ignore the candidate.

Will Smart Dedupe delete my notes?

Smart Dedupe should be treated as candidate discovery and review, not automatic deletion. Any change to trusted notes should remain a deliberate user action.

Does a high similarity score mean two notes are duplicates?

Not always. A high score is a strong signal that two candidates overlap, but you still decide whether they are duplicate, overlapping, or meaningfully distinct.

How is Dedupe different from Connections?

Connections surfaces related notes while you write. Dedupe reviews repeated or near-repeated material when the next job is cleanup, consolidation, or reducing context noise.

When should I use Dedupe instead of Lookup or Obsidian search?

Use Lookup when you have a question and do not remember the exact words. Use Obsidian search for exact phrases, titles, tags, operators, or regex. Use Dedupe when similar material needs a cleanup decision.

Should I scan the whole vault first?

Usually no. Start with one current note, one project, or one dense topic area. Whole-vault scans can be useful later, but first value comes faster when one candidate group becomes one decision.

Can Dedupe improve AI output?

It can make source material easier to review by reducing repeated context, but it does not guarantee better model answers. Use Dedupe when repeated notes make your context pack bloated or harder to trust.

Do I need Dedupe for Smart Connections Core?

No. Core retrieval can be useful without cleanup. Dedupe is an advanced Pro workflow for users who want to reduce repeated work and context noise as their vault grows.

Where are the current duplicate detector docs?

The current detailed walkthrough lives in the Smart Duplicate Detector guide. Use that guide for current commands, settings, screenshots, and scan behavior.

Similarity is not sameness. Use Smart Dedupe to turn repeated material into a reviewable decision before it becomes more rework.