Extra Characters in a String - Solution & Explanation

Q: Is Extra Characters in a String easy or hard?

Extra Characters in a String is classified as a Medium problem. The challenge comes from recognizing the dynamic programming structure and efficiently checking dictionary matches within the string.

Q: Extra Characters in a String Python/Java solution

Both Python and Java implementations usually follow the same DP structure: store dictionary words in a set, iterate backward through the string, and compute the minimum extra characters for each position. The logic remains identical across Python, Java, C++, and JavaScript.

Q: How to solve Extra Characters in a String in O(n)?

A strict O(n) solution is generally not achievable because substring checks depend on possible word boundaries. The closest improvement uses a Trie combined with memoization to reduce redundant substring scanning, bringing the complexity closer to O(n * L).

Q: What is the best approach for Extra Characters in a String?

The most practical solution uses dynamic programming with a hash set for dictionary lookups. For each index in the string, compute the minimum extra characters for the remaining suffix. This approach runs in O(n²) time and O(n) space and is commonly expected in coding interviews.

Q: Is Extra Characters in a String asked at Google/Amazon/Meta?

Problems involving dictionary segmentation and dynamic programming on strings are common in interviews at companies like Amazon, Google, and Meta. Variants of this problem appear as word break or string segmentation questions that test DP and string processing skills.

Q: What data structure is used in Extra Characters in a String?

The solution typically uses a hash set for fast dictionary lookups and a dynamic programming array to store optimal results for each index. An alternative solution uses a Trie to efficiently match dictionary prefixes while traversing the string.

Q: What is the time complexity of Extra Characters in a String?

The standard dynamic programming solution runs in O(n²) time because every index may check multiple substring endings. Space complexity is O(n) for the DP array. A Trie-based optimization reduces substring checks and runs roughly in O(n * L), where L is the maximum word length in the dictionary.

MediumArray Hash Table String Dynamic Programming34 min readAsked at: Amazon, Meta, Google

Practice this problem

Problem Statement

You are given a 0-indexed string s and a dictionary of words dictionary. You have to break s into one or more non-overlapping substrings such that each substring is present in dictionary. There may be some extra characters in s which are not present in any of the substrings.

Return the minimum number of extra characters left over if you break up s optimally.

Example 1:

Input: s = "leetscode", dictionary = ["leet","code","leetcode"]
Output: 1
Explanation: We can break s in two substrings: "leet" from index 0 to 3 and "code" from index 5 to 8. There is only 1 unused character (at index 4), so we return 1.

Example 2:

Input: s = "sayhelloworld", dictionary = ["hello","world"]
Output: 3
Explanation: We can break s in two substrings: "hello" from index 3 to 7 and "world" from index 8 to 12. The characters at indices 0, 1, 2 are not used in any substring and thus are considered as extra characters. Hence, we return 3.

Constraints:

1 <= s.length <= 50
1 <= dictionary.length <= 50
1 <= dictionary[i].length <= 50
dictionary[i] and s consists of only lowercase English letters
dictionary contains distinct words

Approach Overview

Problem Overview: You are given a string s and a dictionary of valid words. The goal is to split the string into dictionary words so that the number of leftover characters (characters not part of any word) is minimized.

Approach 1: Dynamic Programming (O(n²) time, O(n) space)

This approach treats the problem as a prefix optimization task using dynamic programming. Create a DP array where dp[i] represents the minimum number of extra characters in the substring starting at index i. Iterate from the end of the string toward the beginning. At each position, assume the current character is extra (1 + dp[i+1]), then check every substring s[i:j]. If the substring exists in the dictionary (stored in a hash table for O(1) lookups), update dp[i] with dp[j]. This effectively tries all valid word breaks while minimizing leftover characters. Time complexity is O(n²) due to checking substrings, and space complexity is O(n) for the DP array.

Approach 2: Trie and Memoization (O(n * L) average time, O(n + totalDictChars) space)

This version improves substring checking using a Trie. Insert all dictionary words into the Trie, then traverse the string while simultaneously walking the Trie. From index i, explore characters forward as long as they match a Trie path. Every time you hit a word-ending node, recursively compute the result for the remaining suffix. Memoization caches results for each index so the same suffix is never recomputed. The key advantage is that substring checks become incremental character traversals instead of repeated string slicing. The time complexity is roughly O(n * L), where L is the maximum dictionary word length, while space complexity includes the memo array plus Trie storage.

Recommended for interviews: The dynamic programming solution is typically expected because it clearly demonstrates optimal substructure and efficient state transitions. It is straightforward to implement and easy to reason about during an interview. The Trie + memoization approach shows deeper knowledge of string optimization and can reduce redundant substring checks when the dictionary contains many overlapping prefixes.

Approach 1: Dynamic Programming Approach

In this approach, we use dynamic programming to solve the problem. We define a DP array where dp[i] represents the minimum number of extra characters from index 0 to i of the string s. Initially, we set dp[0] to 0 because there are no extra characters in an empty prefix. For each index i while iterating through the string s, we check every j from 0 to i-1 and see if substring s[j:i] exists in the dictionary. If yes, we update dp[i] to min(dp[i], dp[j]), otherwise, we set dp[i] to dp[i-1] + 1.

This C solution uses a dynamic array to store the minimum number of extra characters needed for each prefix of the string. For each end index i, we check all starting indices to see if a substring is in the dictionary and update the DP accordingly, minimizing extra characters added.

Code

C C++Java Python C#JavaScript

C++

Java

Python

JavaScript

Complexity

Time Complexity: O(N^2*M), where N is the length of the string s and M is the average length of words in the dictionary.

Space Complexity: O(N) because of the DP array.

Try this approach in the editor →

Approach 2: Trie and Memoization Approach

To solve the problem using a Trie and memoization, we first build a Trie from the dictionary. We then use a recursive function with memoization to attempt to decompose the string s into valid segments. For each position in the string, we check possible substrings against the Trie, saving calculated results to avoid redundant computations.

This C solution utilizes a Trie to store dictionary words for fast matching. It also uses memoization to recursively find the minimum extra characters. Each call attempts to split the string from the current index, using the Trie to validate substrings.

Code

C C++Java Python C#JavaScript

C++

Java

Python

JavaScript

Complexity

Time Complexity: O(N*M), with N being the string length and M the average dictionary word length due to Trie traversal.

Space Complexity: O(N + T), N is for the memo array and T is for Trie storage.

Try this approach in the editor →

Approach 3: Hash Table + Dynamic Programming

We can use a hash table ss to record all words in the dictionary, which allows us to quickly determine whether a string is in the dictionary.

Next, we define f[i] to represent the minimum number of extra characters in the first i characters of string s, initially f[0] = 0.

When i \ge 1, the ith character s[i - 1] can be an extra character, in which case f[i] = f[i - 1] + 1. If there exists an index j \in [0, i - 1] such that s[j..i) is in the hash table ss, then we can take s[j..i) as a word, in which case f[i] = f[j].

In summary, we can get the state transition equation:

$f[i] = min { f[i - 1] + 1, min_{j \in [0, i - 1]} f[j] }

where i \ge 1, and j \in [0, i - 1] and s[j..i) is in the hash table ss.

The final answer is f[n].

The time complexity is O(n^3 + L), and the space complexity is O(n + L). Here, n is the length of string s, and L$ is the sum of the lengths of all words in the dictionary.

Code

Python Java C++Go TypeScript Rust JavaScript

Python

Java

C++

TypeScript

Rust

JavaScript

Try this approach in the editor →

Approach 4: Trie + Dynamic Programming

We can use a trie to optimize the time complexity of Solution 1.

Specifically, we first insert each word in the dictionary into the trie root in reverse order, then we define f[i] to represent the minimum number of extra characters in the first i characters of string s, initially f[0] = 0.

When i \ge 1, the ith character s[i - 1] can be an extra character, in which case f[i] = f[i - 1] + 1. We can also enumerate the index j in reverse order in the range [0..i-1], and determine whether s[j..i) is in the trie root. If it exists, then we can take s[j..i) as a word, in which case f[i] = f[j].

The time complexity is O(n^2 + L), and the space complexity is O(n + L times |\Sigma|). Here, n is the length of string s, and L is the sum of the lengths of all words in the dictionary. Additionally, |\Sigma| is the size of the character set. In this problem, the character set is lowercase English letters, so |\Sigma| = 26.

Code

Python Java C++Go TypeScript

Python

Java

C++

TypeScript

Try this approach in the editor →

Complexity Comparison

Approach	Complexity
Dynamic Programming Approach	Time Complexity: O(N^2*M), where N is the length of the string s and M is the average length of words in the dictionary. Space Complexity: O(N) because of the DP array.
Trie and Memoization Approach	Time Complexity: O(N*M), with N being the string length and M the average dictionary word length due to Trie traversal. Space Complexity: O(N + T), N is for the memo array and T is for Trie storage.
Hash Table + Dynamic Programming	—
Trie + Dynamic Programming	—

Detailed Complexity Analysis

Approach	Time	Space	When to Use
Dynamic Programming with Hash Set	O(n²)	O(n)	General case. Simple implementation and common interview expectation.
Trie with Memoization	O(n * L)	O(n + dictionary size)	When dictionary has many overlapping prefixes and repeated substring checks become expensive.

Video Solution

Extra Characters in a String - Leetcode 2707 - Python • NeetCodeIO • 23,906 views views

Watch 9 more video solutions →

Frequently Asked Questions

Is Extra Characters in a String easy or hard?

Extra Characters in a String is classified as a Medium problem. The challenge comes from recognizing the dynamic programming structure and efficiently checking dictionary matches within the string.

Extra Characters in a String Python/Java solution

Both Python and Java implementations usually follow the same DP structure: store dictionary words in a set, iterate backward through the string, and compute the minimum extra characters for each position. The logic remains identical across Python, Java, C++, and JavaScript.

How to solve Extra Characters in a String in O(n)?

A strict O(n) solution is generally not achievable because substring checks depend on possible word boundaries. The closest improvement uses a Trie combined with memoization to reduce redundant substring scanning, bringing the complexity closer to O(n * L).

What is the best approach for Extra Characters in a String?

The most practical solution uses dynamic programming with a hash set for dictionary lookups. For each index in the string, compute the minimum extra characters for the remaining suffix. This approach runs in O(n²) time and O(n) space and is commonly expected in coding interviews.

Is Extra Characters in a String asked at Google/Amazon/Meta?

Problems involving dictionary segmentation and dynamic programming on strings are common in interviews at companies like Amazon, Google, and Meta. Variants of this problem appear as word break or string segmentation questions that test DP and string processing skills.

What data structure is used in Extra Characters in a String?

The solution typically uses a hash set for fast dictionary lookups and a dynamic programming array to store optimal results for each index. An alternative solution uses a Trie to efficiently match dictionary prefixes while traversing the string.

What is the time complexity of Extra Characters in a String?

The standard dynamic programming solution runs in O(n²) time because every index may check multiple substring endings. Space complexity is O(n) for the DP array. A Trie-based optimization reduces substring checks and runs roughly in O(n * L), where L is the maximum word length in the dictionary.

Ready to solve this problem?

Practice Extra Characters in a String with our built-in code editor and test cases.

Practice on FleetCode

Word Break

Problem Info

DifficultyMedium

Acceptance57.4%

Approaches4

Reading time34 min

Asked at

Amazon Meta Google

Practice this problem

Open in Editor

Extra Characters in a String - Solution & Explanation

Problem Statement

Approach Overview

Approach 1: Dynamic Programming Approach

Code

Complexity

Approach 2: Trie and Memoization Approach

Code

Complexity

Approach 3: Hash Table + Dynamic Programming

Code

Approach 4: Trie + Dynamic Programming

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Ready to solve this problem?

Problem Info

Table of Contents

Extra Characters in a String - Solution & Explanation

Problem Statement

Approach Overview

Approach 1: Dynamic Programming Approach

Code

Complexity

Approach 2: Trie and Memoization Approach

Code

Complexity

Approach 3: Hash Table + Dynamic Programming

Code

Approach 4: Trie + Dynamic Programming

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Ready to solve this problem?

Problem Info

Table of Contents

Problem Statement

Approach Overview

Approach 1: Dynamic Programming Approach

Code

Complexity

Approach 2: Trie and Memoization Approach

Code

Complexity

Approach 3: Hash Table + Dynamic Programming

Code

Approach 4: Trie + Dynamic Programming

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Related Problems

Ready to solve this problem?

Problem Info

Table of Contents

Problem Statement

Approach Overview

Approach 1: Dynamic Programming Approach

Code

Complexity

Approach 2: Trie and Memoization Approach

Code

Complexity

Approach 3: Hash Table + Dynamic Programming

Code

Approach 4: Trie + Dynamic Programming

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Related Problems

Ready to solve this problem?

Problem Info

Table of Contents