#819 Most Common Word - Solution

Given a string paragraph and a string array of the banned words banned, return the most frequent word that is not banned. It is guaranteed there is at least one word that is not banned, and that the answer is unique.

The words in paragraph are case-insensitive and the answer should be returned in lowercase.

Example 1:

Input: paragraph = "Bob hit a ball, the hit BALL flew far after it was hit.", banned = ["hit"]
Output: "ball"
Explanation: 
"hit" occurs 3 times, but it is a banned word.
"ball" occurs twice (and no other word does), so it is the most frequent non-banned word in the paragraph. 
Note that words in the paragraph are not case sensitive,
that punctuation is ignored (even if adjacent to words, such as "ball,"), 
and that "hit" isn't the answer even though it occurs more because it is banned.

Example 2:

Input: paragraph = "a.", banned = []
Output: "a"

Constraints:

1 <= paragraph.length <= 1000
paragraph consists of English letters, space ' ', or one of the symbols: "!?',;.".
0 <= banned.length <= 100
1 <= banned[i].length <= 10
banned[i] consists of only lowercase English letters.

The words in paragraph are case-insensitive and the answer should be returned in lowercase.

Example 1:

Input: paragraph = "Bob hit a ball, the hit BALL flew far after it was hit.", banned = ["hit"]
Output: "ball"
Explanation: 
"hit" occurs 3 times, but it is a banned word.
"ball" occurs twice (and no other word does), so it is the most frequent non-banned word in the paragraph. 
Note that words in the paragraph are not case sensitive,
that punctuation is ignored (even if adjacent to words, such as "ball,"), 
and that "hit" isn't the answer even though it occurs more because it is banned.

Example 2:

Input: paragraph = "a.", banned = []
Output: "a"

Constraints:

1 <= paragraph.length <= 1000
paragraph consists of English letters, space ' ', or one of the symbols: "!?',;.".
0 <= banned.length <= 100
1 <= banned[i].length <= 10
banned[i] consists of only lowercase English letters.

For #819 Most Common Word, the goal is to identify the most frequent word in a paragraph that is not included in a banned list. The key idea is to normalize and count words efficiently.

Start by converting the paragraph to lowercase and removing punctuation so that words like "Ball," and "ball" are treated the same. Split the paragraph into tokens (words). Use a hash set to store banned words for constant-time lookup, and a hash map (dictionary) to count the frequency of each valid word.

While iterating through the words, skip any word present in the banned set. For the remaining words, increment their frequency in the hash map and track the word with the maximum count. This approach leverages efficient lookups and counting using hash-based data structures.

The algorithm processes each character or word once, making it highly efficient for large paragraphs.

Approach	Time Complexity	Space Complexity
Hash Map Counting with Banned Set	O(n)	O(k)

This approach involves using a hash map to count the frequency of each word in the paragraph after converting it to lowercase and removing punctuation. Then, the word with the highest count that is not in the banned list is selected as the result.

Time Complexity: O(N + M), where N is the length of the paragraph and M is the number of banned words. Space Complexity: O(N) for storing word frequencies.

This JavaScript solution uses regular expressions to find words in a paragraph and converts all words to lowercase. Using a Set for banned words, it counts the frequency of non-banned words with a plain object. The word with the maximum frequency is chosen as the result.

This approach leverages advanced string manipulation functions available in each language for efficient parsing and counting. The words are extracted, normalized, and counted using advanced language-specific methods and libraries for cleaner code.

Time Complexity: O(N log N) due to sorting, where N is total words extracted. Space Complexity: O(N).

Ashish Pratap Singh

13:001,002,140 views

This Python approach leverages Counter and re.findall for succinctly parsing and counting words, followed by determining the max occurrence non-banned word using the max function with a key parameter.

819. Most Common Word

Problem Statement

819. Most Common Word

Problem Statement

Approach

Complexity

Video Solution Available

Solutions (12)

Approach 1: Frequency Count with HashMap

Explanation

Approach 2: Advanced String Manipulation and Collection

Video Solutions

LeetCode was HARD until I Learned these 15 Patterns

Top K Frequent Elements - Bucket Sort - Leetcode 347 - Python

8 patterns to solve 80% Leetcode problems

The unfair way I got good at Leetcode

LeetCode Most Common Word Solution Explained - Java

Most Common Word | LeetCode 819 | Amazon Coding Interview Tutorial

Most Common Word - LeetCode Solutions #819 (Python)

Leetcode 819. Most Common Word Python

2085 Count Common Words With One Occurrence (Leetcode Easy)

Leetcode 819 - Most Common Word (JAVA Solution Explained!)

Asked By Companies

Prepare for Interviews

Notes

Personal Notes

Similar Problems

Related Topics

Problem Stats

Practice on LeetCode

Frequently Asked Questions

Is Most Common Word asked in FAANG interviews?

What is the optimal approach for Most Common Word?

What data structure is best for solving Most Common Word?

How do you handle punctuation in the Most Common Word problem?

Explanation