Given an array of distinct strings words, return the minimal possible abbreviations for every word.
The following are the rules for a string abbreviation:
1.
["abcdef","abndef"] both initially abbreviated as "a4f". Then, a sequence of operations would be ["a4f","a4f"] -> ["ab3f","ab3f"] -> ["abc2f","abn2f"].
Example 1:
Input: words = ["like","god","internal","me","internet","interval","intension","face","intrusion"] Output: ["l2e","god","internal","me","i6t","interval","inte4n","f2e","intr4n"]
Example 2:
Input: words = ["aa","aaa"] Output: ["aa","aaa"]
Constraints:
1 <= words.length <= 4002 <= words[i].length <= 400words[i] consists of lowercase English letters.words are unique.Problem Overview: Given a list of distinct words, generate the shortest possible abbreviation for each word such that no two abbreviations collide. An abbreviation keeps the first letter, last letter, and replaces the middle characters with their count. If multiple words produce the same abbreviation, you must increase the prefix length until every abbreviation becomes unique.
Approach 1: Pairwise Prefix Expansion (Brute Force) (Time: O(n2 * L), Space: O(n))
Start by generating the default abbreviation for every word using prefix length 1. Then compare every pair of words. When two abbreviations collide, increase the prefix length for both words and recompute their abbreviations. This process continues until all conflicts disappear. The key operation is repeatedly checking pairs and rebuilding abbreviations, which becomes expensive for large inputs. This approach is straightforward and demonstrates the rule behind abbreviation conflicts but scales poorly because every conflict may trigger multiple recomputations.
Approach 2: Sorting + Longest Common Prefix (Time: O(n log n * L), Space: O(n))
Sort words by their potential abbreviation group: same first letter, last letter, and length. Within each group, only nearby words in sorted order can collide. Compute the longest common prefix between adjacent words to determine the minimum prefix required to differentiate them. This reduces unnecessary comparisons across unrelated words. Sorting clusters similar words together, and prefix comparison determines how many characters must remain uncompressed. This approach improves performance significantly and uses standard sorting and string operations.
Approach 3: Grouped Trie (Optimal) (Time: O(n * L), Space: O(n * L))
Group words by (length, first character, last character) because only these can produce identical abbreviations. For each group, build a Trie where each node tracks how many words pass through it. While inserting characters, the first node with count 1 identifies the unique prefix length for that word. Once the unique prefix is known, build the abbreviation using that prefix plus the compressed middle count. The Trie efficiently detects the earliest divergence between words, eliminating repeated comparisons. Each character is processed once during insertion and lookup, producing near-linear complexity relative to total characters.
Recommended for interviews: The grouped Trie approach is the expected solution. It demonstrates control over array grouping, Trie design, and greedy prefix selection. Mentioning the brute-force idea shows you understand the conflict rule, but implementing the Trie-based grouping proves you can scale the solution efficiently.
We notice that if two words have the same abbreviation, their first and last letters must be the same, and their lengths must be the same. Therefore, we can group all words by length and last letter, and use a trie to store the information of each group of words.
The structure of each node in the trie is as follows:
children: An array of length 26, representing all child nodes of this node.cnt: The number of words passing through this node.For each word, we insert it into the trie and record the cnt value of each node.
When querying, we start from the root node. For the current letter, if the cnt value of its corresponding child node is 1, then we have found the unique abbreviation, and we return the length of the current prefix. Otherwise, we continue to traverse downwards. After the traversal, if we have not found a unique abbreviation, then we return the length of the original word. After getting the prefix lengths of all words, we check whether the abbreviation of the word is shorter than the original word. If it is, then we add it to the answer, otherwise we add the original word to the answer.
The time complexity is O(L), and the space complexity is O(L). Here, L is the sum of the lengths of all words.
Python
Java
C++
Go
TypeScript
| Approach | Time | Space | When to Use |
|---|---|---|---|
| Pairwise Prefix Expansion (Brute Force) | O(n^2 * L) | O(n) | Small input sizes or when demonstrating the basic conflict resolution logic |
| Sorting + Longest Common Prefix | O(n log n * L) | O(n) | When using standard string operations without building additional data structures |
| Grouped Trie (Optimal) | O(n * L) | O(n * L) | Best for large datasets and interview settings where efficient prefix detection is required |
LeetCode 527. Word Abbreviation • Happy Coding • 3,455 views views
Watch 5 more video solutions →Practice Word Abbreviation with our built-in code editor and test cases.
Practice on FleetCode