Most Common Word - Solution & Explanation

Q: Is Most Common Word easy or hard?

Most Common Word is classified as an Easy problem on LeetCode. The challenge mainly involves correct string parsing, ignoring punctuation, and applying hash map frequency counting efficiently.

Q: Most Common Word Python/Java solution

In Python, use a dictionary for counting and a set for banned words while parsing the string. In Java, use HashMap for frequencies and HashSet for banned checks. Both implementations follow the same O(n) time and O(n) space logic.

EasyArray Hash Table String Counting15 min readAsked at: Amazon, Microsoft, Meta +2

Practice this problem

Problem Statement

Given a string paragraph and a string array of the banned words banned, return the most frequent word that is not banned. It is guaranteed there is at least one word that is not banned, and that the answer is unique.

The words in paragraph are case-insensitive and the answer should be returned in lowercase.

Example 1:

Input: paragraph = "Bob hit a ball, the hit BALL flew far after it was hit.", banned = ["hit"]
Output: "ball"
Explanation: 
"hit" occurs 3 times, but it is a banned word.
"ball" occurs twice (and no other word does), so it is the most frequent non-banned word in the paragraph. 
Note that words in the paragraph are not case sensitive,
that punctuation is ignored (even if adjacent to words, such as "ball,"), 
and that "hit" isn't the answer even though it occurs more because it is banned.

Example 2:

Input: paragraph = "a.", banned = []
Output: "a"

Constraints:

1 <= paragraph.length <= 1000
paragraph consists of English letters, space ' ', or one of the symbols: "!?',;.".
0 <= banned.length <= 100
1 <= banned[i].length <= 10
banned[i] consists of only lowercase English letters.

Approach Overview

Problem Overview: You receive a paragraph string and a list of banned words. The task is to return the most frequent word that is not banned. Words are case-insensitive and punctuation should be ignored, so parsing and normalization are key parts of the solution.

Approach 1: Frequency Count with HashMap (O(n) time, O(n) space)

This approach treats the paragraph as a stream of characters and builds words while scanning. Convert characters to lowercase and ignore punctuation. Each completed word is checked against a banned set using constant-time lookup. If the word is not banned, update its frequency in a HashMap (or dictionary). While counting, track the word with the highest frequency so you avoid a second pass over the map.

The key insight is that counting frequencies during a single pass over the text avoids repeated scanning. A hash-based structure gives O(1) average lookup for both banned checks and frequency updates. Total complexity is O(n) time where n is the paragraph length, and O(n) space for storing unique words. This approach relies heavily on hash table lookups and simple string processing.

Approach 2: Advanced String Manipulation and Collection (O(n) time, O(n) space)

This variation focuses on preprocessing the paragraph using string utilities. Replace punctuation characters with spaces, convert the text to lowercase, and split the paragraph into tokens. After tokenization, iterate through the resulting list of words and maintain counts in a map while skipping banned entries stored in a set.

The advantage of this approach is cleaner implementation in languages that support powerful string operations and collections. Splitting the text converts the problem into a straightforward array traversal with frequency counting. Each word update and banned lookup still runs in O(1) average time using hash structures. The overall complexity remains O(n) time and O(n) space because every character and token is processed once.

Recommended for interviews: The HashMap frequency counting approach is what most interviewers expect. It demonstrates that you can normalize input, use a banned set for fast filtering, and maintain counts efficiently with a map. Starting with the straightforward counting idea shows problem understanding, while implementing it in a single pass with proper string handling shows practical engineering skill.

Approach 1: Approach 1: Frequency Count with HashMap

This approach involves using a hash map to count the frequency of each word in the paragraph after converting it to lowercase and removing punctuation. Then, the word with the highest count that is not in the banned list is selected as the result.

This C solution uses an array of structures to store word frequencies, given C doesn't have default data structures for maps like other higher-level languages. It tokenizes the paragraph by spaces and punctuation, converts to lowercase, filters out banned words, and counts frequencies. The most frequent non-banned word is returned.

Code

C C++Java Python C#JavaScript

C++

Java

Python

JavaScript

Complexity

Time Complexity: O(N + M), where N is the length of the paragraph and M is the number of banned words. Space Complexity: O(N) for storing word frequencies.

Try this approach in the editor →

Approach 2: Approach 2: Advanced String Manipulation and Collection

This approach leverages advanced string manipulation functions available in each language for efficient parsing and counting. The words are extracted, normalized, and counted using advanced language-specific methods and libraries for cleaner code.

In this advanced C solution, we utilize qsort for sorting based on frequency after processing the paragraph. We tokenize the paragraph, convert to lowercase, check against banned set, and store in custom structure. The sorting provides the most frequent non-banned word.

Code

C C++Java Python C#JavaScript

C++

Java

Python

JavaScript

Complexity

Time Complexity: O(N log N) due to sorting, where N is total words extracted. Space Complexity: O(N).

Try this approach in the editor →

Approach 3: Default Approach

Code

Python Java C++Go TypeScript Rust

Python

Java

C++

TypeScript

Rust

Try this approach in the editor →

Complexity Comparison

Approach	Complexity
Approach 1: Frequency Count with HashMap	Time Complexity: O(N + M), where N is the length of the paragraph and M is the number of banned words. Space Complexity: O(N) for storing word frequencies.
Approach 2: Advanced String Manipulation and Collection	Time Complexity: O(N log N) due to sorting, where N is total words extracted. Space Complexity: O(N).
Default Approach	—

Detailed Complexity Analysis

Approach	Time	Space	When to Use
Frequency Count with HashMap	O(n)	O(n)	General case. Efficient single-pass solution commonly expected in coding interviews.
Advanced String Manipulation and Collection	O(n)	O(n)	Useful when the language provides convenient string replace and split utilities.

Video Solution

LeetCode Most Common Word Solution Explained - Java • Nick White • 10,813 views views

Watch 9 more video solutions →

Frequently Asked Questions

Is Most Common Word easy or hard?

Most Common Word is classified as an Easy problem on LeetCode. The challenge mainly involves correct string parsing, ignoring punctuation, and applying hash map frequency counting efficiently.

Most Common Word Python/Java solution

In Python, use a dictionary for counting and a set for banned words while parsing the string. In Java, use HashMap<String, Integer> for frequencies and HashSet<String> for banned checks. Both implementations follow the same O(n) time and O(n) space logic.

How to solve Most Common Word in O(n)?

Scan the paragraph character by character, converting letters to lowercase and skipping punctuation. Build words incrementally and check them against a banned set. For each valid word, increment its count in a hash map and track the maximum frequency. This single-pass counting approach achieves O(n) time complexity.

What is the best approach for Most Common Word?

The most efficient approach uses a HashMap for frequency counting and a HashSet for banned words. Parse the paragraph, normalize characters to lowercase, ignore punctuation, and update counts for each valid word. This method processes the paragraph in a single pass with O(n) time complexity and O(n) space.

Is Most Common Word asked at Google/Amazon/Meta?

This problem pattern appears frequently in interviews at large tech companies such as Amazon and Google. It tests practical skills like string parsing, hash map usage, and handling edge cases with punctuation and case normalization.

What data structure is used in Most Common Word?

The core data structures are a HashMap (or dictionary) to store word frequencies and a HashSet to store banned words. These structures provide constant-time average lookups, which makes the overall algorithm efficient.

What is the time complexity of Most Common Word?

The optimal solution runs in O(n) time where n is the length of the paragraph. Each character is processed once while building words and updating frequencies. Hash map insertions and lookups run in average O(1), so the total runtime remains linear.

Ready to solve this problem?

Practice Most Common Word with our built-in code editor and test cases.

Practice on FleetCode

Two Sum

Median of Two Sorted Arrays

Problem Info

DifficultyEasy

Acceptance45.0%

Approaches3

Reading time15 min

Asked at

Amazon Microsoft Meta Datadog Google

Practice this problem

Open in Editor

Most Common Word - Solution & Explanation

Problem Statement

Approach Overview

Approach 1: Approach 1: Frequency Count with HashMap

Code

Complexity

Approach 2: Approach 2: Advanced String Manipulation and Collection

Code

Complexity

Approach 3: Default Approach

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Ready to solve this problem?

Problem Info

Table of Contents

Most Common Word - Solution & Explanation

Problem Statement

Approach Overview

Approach 1: Approach 1: Frequency Count with HashMap

Code

Complexity

Approach 2: Approach 2: Advanced String Manipulation and Collection

Code

Complexity

Approach 3: Default Approach

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Ready to solve this problem?

Problem Info

Table of Contents

Problem Statement

Approach Overview

Approach 1: Approach 1: Frequency Count with HashMap

Code

Complexity

Approach 2: Approach 2: Advanced String Manipulation and Collection

Code

Complexity

Approach 3: Default Approach

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Related Problems

Ready to solve this problem?

Problem Info

Table of Contents

Problem Statement

Approach Overview

Approach 1: Approach 1: Frequency Count with HashMap

Code

Complexity

Approach 2: Approach 2: Advanced String Manipulation and Collection

Code

Complexity

Approach 3: Default Approach

Code

Complexity Comparison

Detailed Complexity Analysis

Video Solution

Frequently Asked Questions

Related Problems

Ready to solve this problem?

Problem Info

Table of Contents