Blog Open the app

How to Find and Merge Duplicate Companies in HubSpot (The Complete Cleanup Guide)

June 1, 2026 · 11 min read

You've been using HubSpot for two years. Your company count shows 8,500 records. And somewhere between 15% and 30% of those are duplicates.

"Acme Corp" and "Acme Corporation." "IBM" and "International Business Machines." "Johnson & Johnson" and "Johnson and Johnson Inc." Each looks fine on its own. Together, they're silently destroying your reporting, messing up your sequences, and making your sales team look unprepared when they reach out to a company that's already a customer under a different record.

HubSpot has a duplicate management tool. It helps — a little. But if you've ever used it, you've noticed the problem: it only catches exact or near-exact matches. "Acme Corp" and "Acme Corporation" don't show up. Neither do "FedEx" and "Federal Express." The fuzzy duplicates — the ones that actually cause problems — slip right through.

This guide walks through why HubSpot duplicates pile up, what HubSpot's native tools actually catch (and miss), and a faster process for finding all your duplicate companies — including the variations no CRM can detect on its own.

How Duplicate Companies Sneak Into HubSpot

Nobody creates duplicates on purpose. They appear through normal daily operations:

Over time, this accumulates. The typical HubSpot portal with 5,000+ companies has hundreds of duplicates hiding in plain sight.

Why Duplicate Companies Actually Cost You Revenue

This isn't just a data hygiene problem. Duplicate companies create real business damage:

Deals attached to the wrong record. Your biggest opportunity might be linked to a duplicate company record that nobody checks. When someone pulls the "Acme" account to prep for a renewal call, they see incomplete history — missing the $200K deal sitting under "Acme Corp."

Embarrassing outreach. Sales emails a "prospect" who's actually been a customer for two years — just under a different company record. This happens constantly with duplicates, and it destroys credibility.

Broken automation. Your workflow enrolls companies based on properties. When company data is split across duplicates, automations fire incorrectly — or don't fire at all.

Useless reporting. Your "top accounts by deal value" report is wrong if those accounts are fragmented across multiple records. You can't see true customer lifetime value when the data is scattered.

Wasted effort. Multiple reps work the same account without knowing it. Marketing nurtures contacts who should be in customer success sequences. The left hand doesn't know what the right hand is doing.

The math: If 20% of your company records are duplicates and your team spends just 2 minutes per day dealing with the confusion (looking up the wrong record, finding missing context, explaining to a customer why they got a sales email), that's over 8 hours per rep per year. For a 10-person team, that's 80+ hours — two full work weeks — lost to duplicate company chaos.

What HubSpot's Duplicate Management Tool Actually Does

HubSpot offers a built-in tool to find duplicates. Here's what it catches — and what it misses.

Where to find it: Go to CRM → Companies, click Actions, then Manage duplicates. HubSpot shows pairs of company records it suspects are duplicates.

What it catches:

What it misses:

This is the same limitation that makes VLOOKUP fail at matching company names — exact-match logic can't handle the way company names actually vary in the real world.

The result: HubSpot's tool catches maybe 20-30% of your actual duplicates. The rest — the fuzzy variations that humans recognize instantly but computers miss — stay hidden.

The Complete Cleanup Process

Here's how to find every duplicate company in HubSpot, not just the ones the native tool catches. The approach: export your company data, find duplicates with fuzzy matching, then merge them back inside HubSpot.

Step 1: Export your company list

Go to CRM → Companies. Click Export (top right). Select the properties you need:

Export as CSV. This gives you a single file with every company in your portal.

Step 2: Find duplicates with fuzzy matching

This is the step HubSpot can't do well. Upload your CSV to DedupFuzzy and select the company name column. It compares every name against every other name using algorithms that catch what exact matching misses:

In a few minutes, you get a list of potential duplicate pairs with confidence scores. High scores (90%+) are almost always real duplicates. Lower scores need a quick manual review. Export this list — it's your merge plan.

This same process works when you need to deduplicate a contact list before importing — catch the duplicates before they enter HubSpot instead of cleaning up afterward.

Step 3: Review and verify matches

Don't merge blindly. Sort your duplicate list by confidence score and work from the top down.

High-confidence matches (90%+): Spot-check a few. If they share the same domain, same address, or same contacts, they're definitely duplicates. These are usually safe to merge.

Medium-confidence matches (70-90%): Review each one. Some are real duplicates ("Salesforce" and "Salesforce.com Inc"). Some are different companies with similar names ("ABC Plumbing LLC" and "ABC Electric LLC"). A 30-second look at each pair separates them.

Lower-confidence matches (below 70%): Skim these quickly. Most will be false positives, but occasionally you'll find a real duplicate the algorithm almost missed.

Mark each pair: merge, not a duplicate, or needs investigation. For the "needs investigation" pile, open both records in HubSpot and compare contacts, deals, and activity history.

Step 4: Decide which record to keep

For each confirmed duplicate pair, pick a "winner" — the record that survives the merge. The other record's data will roll into it.

Good rules for choosing:

  1. More complete data wins. Keep the record with filled-in properties — address, industry, phone number, company domain.
  2. More associations win. The record with more contacts, deals, or activities usually has richer history worth preserving.
  3. Correct naming wins. If one record has the proper legal name and the other has a typo or abbreviation, keep the correct one.
  4. Older records often win. They've accumulated more history. But not always — sometimes the newer record is better maintained.

HubSpot merges by combining both records' associations and keeping the winner's property values (unless the winner's property is empty, in which case it pulls from the loser). Knowing this helps you pick wisely.

Step 5: Merge inside HubSpot

Now do the actual merges. HubSpot offers two ways:

One at a time (small cleanup): Open the company record you want to retire. Click Actions → Merge. Search for the record you want to keep. Select it, review the merge preview, and confirm. All contacts, deals, tickets, and activities move to the surviving record.

Bulk merge (Professional/Enterprise tiers): If you have HubSpot Professional or Enterprise, you can use the duplicate management tool to merge multiple pairs in one session. Go to Actions → Manage duplicates, review the pairs, and merge each one. This is faster than opening individual records, though you still confirm each merge.

Important: Merging is permanent. There's no undo button. Before starting a large cleanup, export your full company list as a backup. If something goes wrong, you'll at least have a record of the before-state.

Step 6: Clean up associated contacts

Once company duplicates are merged, check for contact duplicates too. The same people often exist under multiple records, especially if they were originally associated with duplicate companies.

Run the same export-and-fuzzy-match process on your contact list, using name and email as matching fields. HubSpot's contact duplicate tool is slightly better than the company version (emails help), but it still misses name variations like "Mike Smith" vs "Michael Smith" vs "M. Smith."

Preventing Future Duplicates

Cleanup is only worthwhile if duplicates don't immediately reappear. These practices keep your company data clean going forward:

Set up duplicate-catching workflows. Create a workflow that triggers when a new company is created. Use it to alert an admin if the company name is similar to an existing record. This requires some setup with custom properties and Operations Hub, but it catches duplicates at creation time.

Standardize data entry. Document naming conventions: Do you include "Inc" and "LLC"? How do you handle "The" at the start of names? Train your team and add these standards to your HubSpot onboarding.

Clean before importing. Every time you import data — tradeshow lists, purchased leads, migrated records — run it through fuzzy matching first. This is the advice in the CRM data cleaning guide: catch duplicates before they enter your system, not after.

Audit quarterly. Export your company list every quarter and run a quick duplicate check. Finding 20 new duplicates each quarter is a 15-minute task. Finding 500 duplicates once every two years is a week-long project.

Use company domains as identifiers. When possible, associate companies by domain rather than name. Two records with the same domain are obvious duplicates. Encourage your team to always fill in the Company Domain field.

Special Case: After a Data Migration

If you recently migrated to HubSpot from another CRM — Salesforce, Pipedrive, Zoho, or something else — your duplicate rate is probably worse than average. Migrations layer new data on top of whatever was already in HubSpot, and name formats often differ between systems.

For post-migration cleanup:

  1. Export companies and run duplicate detection immediately, before your team starts working in the polluted data.
  2. Pay special attention to records from the old system vs records that were already in HubSpot — this is where most duplicates hide.
  3. Check that contacts are associated with the correct (surviving) company records after merging.

The Salesforce duplicate cleanup guide has more detail on CRM-specific migration challenges.

What About Third-Party Deduplication Tools?

Several HubSpot integrations claim to handle deduplication. Most work the same way as HubSpot's native tool — they catch exact and near-exact matches but miss the fuzzy variations that cause the real problems.

The integration tools are useful for ongoing maintenance (they can auto-merge obvious duplicates as they're created), but they won't find the backlog of fuzzy duplicates already in your system. For that initial cleanup, you need actual fuzzy matching on your exported data.

After the cleanup, an integration tool can help prevent new duplicates from accumulating — but it's not a substitute for the initial deep clean.

The Bottom Line

Duplicate companies in HubSpot are nearly inevitable. Every form submission, import, and manual entry is a chance for a new variation of an existing company to slip in. HubSpot's duplicate management tool helps, but it only catches the obvious cases — the exact matches that anyone could find by sorting alphabetically.

The fuzzy duplicates — "IBM" vs "International Business Machines," "FedEx" vs "Federal Express" — are what actually cause problems. They hide in plain sight, fragmenting your customer data and making your CRM less useful every day.

Export your company list, run it through fuzzy matching to find every likely duplicate, review the results, then merge them back in HubSpot. A few hours of focused cleanup gives you accurate reporting, confident outreach, and a CRM your team can actually trust.

Think your HubSpot company list has duplicates hiding in it? Export your companies to CSV and run a quick check. DedupFuzzy finds duplicate company names — including abbreviations, legal suffix variations, and typos — in about 60 seconds. Free for 500 rows, no signup needed.

Try DedupFuzzy Free