Accounts Merge - Solution & Explanation

MediumArray Hash Table String Depth-First Search15 min readAsked at: Amazon, Microsoft, Apple +17

Problem Statement

Given a list of accounts where each element accounts[i] is a list of strings, where the first element accounts[i][0] is a name, and the rest of the elements are emails representing emails of the account.

Now, we would like to merge these accounts. Two accounts definitely belong to the same person if there is some common email to both accounts. Note that even if two accounts have the same name, they may belong to different people as people could have the same name. A person can have any number of accounts initially, but all of their accounts definitely have the same name.

After merging the accounts, return the accounts in the following format: the first element of each account is the name, and the rest of the elements are emails in sorted order. The accounts themselves can be returned in any order.

Example 1:

Input: accounts = [["John","johnsmith@mail.com","john_newyork@mail.com"],["John","johnsmith@mail.com","john00@mail.com"],["Mary","mary@mail.com"],["John","johnnybravo@mail.com"]]
Output: [["John","john00@mail.com","john_newyork@mail.com","johnsmith@mail.com"],["Mary","mary@mail.com"],["John","johnnybravo@mail.com"]]
Explanation:
The first and second John's are the same person as they have the common email "johnsmith@mail.com".
The third John and Mary are different people as none of their email addresses are used by other accounts.
We could return these lists in any order, for example the answer [['Mary', 'mary@mail.com'], ['John', 'johnnybravo@mail.com'], 
['John', 'john00@mail.com', 'john_newyork@mail.com', 'johnsmith@mail.com']] would still be accepted.

Example 2:

Input: accounts = [["Gabe","Gabe0@m.co","Gabe3@m.co","Gabe1@m.co"],["Kevin","Kevin3@m.co","Kevin5@m.co","Kevin0@m.co"],["Ethan","Ethan5@m.co","Ethan4@m.co","Ethan0@m.co"],["Hanzo","Hanzo3@m.co","Hanzo1@m.co","Hanzo0@m.co"],["Fern","Fern5@m.co","Fern1@m.co","Fern0@m.co"]]
Output: [["Ethan","Ethan0@m.co","Ethan4@m.co","Ethan5@m.co"],["Gabe","Gabe0@m.co","Gabe1@m.co","Gabe3@m.co"],["Hanzo","Hanzo0@m.co","Hanzo1@m.co","Hanzo3@m.co"],["Kevin","Kevin0@m.co","Kevin3@m.co","Kevin5@m.co"],["Fern","Fern0@m.co","Fern1@m.co","Fern5@m.co"]]

Constraints:

1 <= accounts.length <= 1000
2 <= accounts[i].length <= 10
1 <= accounts[i][j].length <= 30
accounts[i][0] consists of English letters.
accounts[i][j] (for j > 0) is a valid email.

Approach Overview

Problem Overview: You receive a list of user accounts where each entry contains a name followed by multiple email addresses. Two accounts belong to the same person if they share at least one common email. The task is to merge such accounts and return the consolidated list with unique emails sorted lexicographically.

Approach 1: Graph-Based Approach (DFS/BFS) (Time: O(E log E), Space: O(E))

Treat each email as a node in a graph. If two emails appear in the same account, connect them with an edge. This creates connected components where all emails in the same component belong to the same user. Build an adjacency list using a hash table, then run DFS or BFS from each unvisited email to collect all reachable emails in that component. Track the account name associated with each email during construction. After collecting a component, sort the emails before adding them to the result. The key insight is that shared emails create graph connectivity, so the merge problem becomes a connected-components problem.

Approach 2: Union-Find Approach (Disjoint Set) (Time: O(E log E), Space: O(E))

Use a Union-Find data structure to group emails belonging to the same person. Iterate through each account and union the first email with every other email in that account. Maintain a parent mapping for each email and apply path compression to keep operations nearly constant time. After processing all unions, group emails by their root parent using a hash map. Each group represents a merged account. Sort the emails in each group and prepend the corresponding user name. This approach avoids explicit graph traversal and is often cleaner for connectivity problems.

Recommended for interviews: The Union-Find approach is typically preferred because it models the merging operation directly and scales well when many accounts share emails. The graph approach demonstrates strong understanding of connected components using DFS or BFS, which interviewers also value. Showing the graph interpretation first and then optimizing with Union-Find demonstrates solid problem-solving depth.

Approach 1: Graph-Based Approach

In this approach, we treat each email as a node in a graph and create edges between nodes that appear in the same account. By doing so, connected components in the graph will represent emails belonging to the same person.

Once the graph is constructed, we perform a Depth-First Search (DFS) to enumerate all emails in each connected component, sort the collected emails, and then append the owner’s name to produce the final result.

The Python solution uses a dictionary to map emails to a graph structure. Each email is treated as a node, and accounts describe connections. We apply a depth-first search to explore connected components and collect emails of each component. Each collection is sorted and paired with the account owner’s name before appending to the result list.

Code

Python C++

Python

C++

Complexity

Time complexity is O(NK log NK), where N is the number of accounts, and K is the maximum number of emails in an account, due to sorting each component. Space complexity is O(NK) for storing the graph and visited nodes.

Try this approach in the editor →

Approach 2: Union-Find Approach

This approach uses a Union-Find data structure for efficient merging of accounts. Each email is identified with a parent, initially itself, and accounts are unified if they share an email. We then deduce separate email sets from parent-child relationships.

In the Java solution, Union-Find is used to connect and find associated emails. We first store each email with its own parent, then use the find operation to determine shared components. Finally, a mapping between connected components and their parent node helps in gathering and sorting emails accordingly.

Code

Java JavaScript

Java

JavaScript

Complexity

Time complexity is O(NK log NK) due to sorting each email list. Space complexity is O(NK) owing to email mappings and Union-Find arrangements.

Try this approach in the editor →

Approach 3: Union-Find + Hash Table

Based on the problem description, we can use a union-find data structure to merge accounts with the same email address. The specific steps are as follows:

First, we iterate through all the accounts. For the ith account, we iterate through all its email addresses. If an email address appears in the hash table d, we use the union-find to merge the account's index i with the previously appeared account's index; otherwise, we map this email address to the account's index i.

Next, we iterate through all the accounts again. For the ith account, we use the union-find to find its root node, and then add all the email addresses of that account to the hash table g, where the key is the root node, and the value is the account's email addresses.

The time complexity is O(n times log n), and the space complexity is O(n). Here, n is the number of accounts.

Code

Python Java C++Go TypeScript

Python

Java

C++

TypeScript

Try this approach in the editor →

Complexity Comparison

Approach	Complexity
Graph-Based Approach	Time complexity is O(NK log NK), where N is the number of accounts, and K is the maximum number of emails in an account, due to sorting each component. Space complexity is O(NK) for storing the graph and visited nodes.
Union-Find Approach	Time complexity is O(NK log NK) due to sorting each email list. Space complexity is O(NK) owing to email mappings and Union-Find arrangements.
Union-Find + Hash Table	—

Detailed Complexity Analysis

Approach	Time	Space	When to Use
Graph Traversal (DFS/BFS)	O(E log E)	O(E)	When modeling the problem as connected components in a graph and using DFS/BFS is more intuitive
Union-Find (Disjoint Set)	O(E log E)	O(E)	Best general solution for merging sets with shared identifiers like emails

Video Solution

G-50. Accounts Merge - DSU • take U forward • 191,102 views views

Watch 9 more video solutions →

Frequently Asked Questions

Is Accounts Merge easy or hard?

Accounts Merge is considered a Medium difficulty problem. The challenge comes from recognizing that shared emails create connectivity between accounts and choosing the right technique such as DFS graph traversal or the Union-Find data structure.

Accounts Merge Python/Java solution

Python solutions usually build an adjacency list with dictionaries and run DFS to collect connected emails, or implement Union-Find using dictionaries. Java solutions commonly use HashMap with a Disjoint Set class to union emails and group them by root parent before sorting.

How to solve Accounts Merge in O(n)?

Pure O(n) is difficult because the output requires emails to be sorted. The merging step using Union-Find or DFS can run in near O(E) time, but sorting the emails in each merged account results in an overall complexity of O(E log E).

What is the best approach for Accounts Merge?

The Union-Find (Disjoint Set) approach is generally the best solution. It efficiently groups emails that belong to the same person by unioning emails within the same account and later collecting them by root parent. With path compression, union and find operations are nearly constant time, and the overall complexity is dominated by sorting emails, giving O(E log E).

Is Accounts Merge asked at Google/Amazon/Meta?

Accounts Merge is a common interview-style problem involving graph connectivity and Union-Find. Variants of this problem have appeared in interviews at companies like Google, Amazon, and Meta because it tests understanding of disjoint sets, hashing, and graph traversal.

What data structure is used in Accounts Merge?

The most important data structures are hash maps and Union-Find (Disjoint Set). Hash maps map emails to owners or adjacency lists, while Union-Find efficiently groups connected emails. Graph representations with DFS or BFS are also widely used.

What is the time complexity of Accounts Merge?

The overall time complexity is O(E log E), where E is the total number of unique emails. Building connections using Union-Find or a graph traversal takes near linear time, while sorting the emails in each merged account contributes the log factor.

Ready to solve this problem?

Practice Accounts Merge with our built-in code editor and test cases.

Practice on FleetCode

Redundant Connection

Sentence Similarity

Problem Info

DifficultyMedium

Acceptance61.0%

Approaches3

Reading time15 min

Asked at

Amazon Microsoft Apple Meta Uber

Practice this problem

Open in Editor

Accounts Merge - Solution & Explanation

Problem Statement

Approach Overview

Approach 1: Graph-Based Approach

Code

Complexity

Approach 2: Union-Find Approach

Code

Complexity

Approach 3: Union-Find + Hash Table

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Ready to solve this problem?

Problem Info

Table of Contents

Accounts Merge - Solution & Explanation

Problem Statement

Approach Overview

Approach 1: Graph-Based Approach

Code

Complexity

Approach 2: Union-Find Approach

Code

Complexity

Approach 3: Union-Find + Hash Table

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Ready to solve this problem?

Problem Info

Table of Contents

Problem Statement

Approach Overview

Approach 1: Graph-Based Approach

Code

Complexity

Approach 2: Union-Find Approach

Code

Complexity

Approach 3: Union-Find + Hash Table

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Related Problems

Ready to solve this problem?

Problem Info

Table of Contents

Problem Statement

Approach Overview

Approach 1: Graph-Based Approach

Code

Complexity

Approach 2: Union-Find Approach

Code

Complexity

Approach 3: Union-Find + Hash Table

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Related Problems

Ready to solve this problem?

Problem Info

Table of Contents