#1178 Number of Valid Words for Each Puzzle - Solution

With respect to a given puzzle string, a word is valid if both the following conditions are satisfied:

word contains the first letter of puzzle.
For each letter in word, that letter is in puzzle.
- For example, if the puzzle is "abcdefg", then valid words are "faced", "cabbage", and "baggage", while
- invalid words are "beefed" (does not include 'a') and "based" (includes 's' which is not in the puzzle).

Return an array answer, where answer[i] is the number of words in the given word list words that is valid with respect to the puzzle puzzles[i].

Example 1:

Input: words = ["aaaa","asas","able","ability","actt","actor","access"], puzzles = ["aboveyz","abrodyz","abslute","absoryz","actresz","gaswxyz"]
Output: [1,1,3,2,4,0]
Explanation: 
1 valid word for "aboveyz" : "aaaa" 
1 valid word for "abrodyz" : "aaaa"
3 valid words for "abslute" : "aaaa", "asas", "able"
2 valid words for "absoryz" : "aaaa", "asas"
4 valid words for "actresz" : "aaaa", "asas", "actt", "access"
There are no valid words for "gaswxyz" cause none of the words in the list contains letter 'g'.

Example 2:

Input: words = ["apple","pleas","please"], puzzles = ["aelwxyz","aelpxyz","aelpsxy","saelpxy","xaelpsy"]
Output: [0,1,3,2,0]

Constraints:

1 <= words.length <= 10⁵
4 <= words[i].length <= 50
1 <= puzzles.length <= 10⁴
puzzles[i].length == 7
words[i] and puzzles[i] consist of lowercase English letters.
Each puzzles[i] does not contain repeated characters.

The key challenge in #1178 Number of Valid Words for Each Puzzle is efficiently checking which words satisfy each puzzle's constraints. A valid word must contain the puzzle’s first letter and use only characters present in the puzzle. A brute-force comparison of every word with every puzzle is too slow, so we need a smarter representation.

A common optimization is to encode each word as a bitmask using 26 bits (one for each letter). Store the frequency of these masks in a hash table. For each puzzle, also build a bitmask and enumerate all possible submasks of its letters that include the first character. Each valid submask corresponds to a set of letters that a word could use. Summing the stored frequencies for these submasks gives the count of valid words.

This approach significantly reduces comparisons because each puzzle has at most 2^6 relevant subsets after fixing the first letter. Some solutions also explore a Trie-based approach, but the bitmask + subset enumeration method is typically simpler and faster.

Approach	Time Complexity	Space Complexity
Bitmask + Subset Enumeration with Hash Map	O(W + P * 2^6)	O(W)
Trie-based Filtering	O(W * L + P * 2^6)	O(W * L)

Solutions (5)

Bitmask Approach

Utilize bitmasks to represent words and puzzles. This allows for quick inclusion checks (using bitwise AND) and processing of the large dataset efficiently.

We construct a bitmask for each word and store it in a dictionary with counts. For each puzzle, generate all possible submasks (the example uses a different method) and check if they are present in the preprocessed dictionary.

Time Complexity: O(W + P * 2⁷), where W is the number of words and P is the number of puzzles.
Space Complexity: O(W), due to storing the bitmask of each word.

1def find_num_of_valid_words(words, puzzles):
2    from collections import defaultdict
3    word_count = defaultdict(int)
4    def bitmask(word):
5        bm = 0
6        for char in set(word):
7            bm |= 1 << (ord(char) - ord('a'))
8        return bm
9
10    for word in words:
11        word_count[bitmask(word)] += 1
12
13    results = []
14
15    for puzzle in puzzles:
16        first = 1 << (ord(puzzle[0]) - ord('a'))
17        puzzle_mask = bitmask(puzzle)
18        count = 0
19        submask = puzzle_mask
20
21        while submask:
22            if submask & first:
23                count += word_count[submask]
24            submask = (submask - 1) & puzzle_mask
25
26        results.append(count)
27
28    return results
29
30# Example usage
31words = ["aaaa","asas","able","ability","actt","actor","access"]
32puzzles = ["aboveyz","abrodyz","abslute","absoryz","actresz","gaswxyz"]
33print(find_num_of_valid_words(words, puzzles))

Explanation

The solution involves the following steps:

Create a bitmask for each word that indicates which characters are included.
Store these bitmasks in a dictionary (or hashmap) with their occurrence counts.
For each puzzle, generate all possible bitmasks that include the first character and check against the stored word bitmasks.

Brute Force Approach

Loop through each puzzle and for each puzzle, check every word in the word list to see if it is valid. This approach is straightforward but inefficient for larger datasets.

Time Complexity: O(W * P * L), where W is the number of words, P is the number of puzzles, and L is the average length of the words.
Space Complexity: O(1), as no additional data structures are used that scale with input size.

1def find_num_of_valid_words(words, puzzles):
2    results = []
3

Bitmasking with Precomputation

Using a bitmask, we can represent the presence of each letter in a word or puzzle as a single integer. Each bit in this integer corresponds to a letter from 'a' to 'z'.

For each word, compute its bitmask and store it in a hash map with its frequency. When processing puzzles, derive all potential subsets of the puzzle's bitmask that contain the first letter of the puzzle. Use the precomputed hash map to count how many words have matching character sets.

Time Complexity: O(N + P * 2^L), where N is the number of words, P is the number of puzzles, and L is the max number of letters in the puzzle (L = 7), resulting in 2^L subsets.

Space Complexity: O(N) for storing bitmasks and their frequencies.

Python JavaScript

1def findNumOfValidWords(words, puzzles):

Direct Check with Early Stopping

This approach uses direct validation of each word against each puzzle by checking the conditions separately using set operations. Early stopping is applied if any character not in the puzzle is found or if the word doesn't contain the first puzzle character.

Time Complexity: O(P * N * L), where P is the number of puzzles, N is the number of words, and L is the average length of each word.

Space Complexity: O(1) additional space besides input storage.

1import java.util.*;
2
3public class Solution

1178. Number of Valid Words for Each Puzzle

Problem Statement

Approach

Complexity

Video Solution Available

Problem Hints

Solutions (5)

Bitmask Approach

Explanation

Brute Force Approach

Bitmasking with Precomputation

Direct Check with Early Stopping

Video Solutions

Valid Sudoku - Amazon Interview Question - Leetcode 36 - Python

Word Search - Backtracking - Leetcode 79 - Python

Letter Combinations of a Phone Number - Backtracking - Leetcode 17

Number of Valid Words for Each Puzzle | Bit Manipulation | Leetcode Hard Solutions

Maximum Score Words Formed By Letters - Leetcode 1255 - Python

花花酱 LeetCode 1178. Number of Valid Words for Each Puzzle - 刷题找工作 EP267

Number Of Valid Words For Each Puzzle | Leetcode 1178 | Live coding session 🔥🔥🔥

2047. Number of Valid Words in a Sentence | LEETCODE WEEKLY CONTEST 264 | CODE EXPLAINER

Number of Valid Words for Each Puzzle: Leetcode 1178

Dropbox Coding Interview Question | Leetcode 1178 | Number of Valid Words for Each Puzzle

Asked By Companies

Prepare for Interviews

Notes

Personal Notes

Similar Problems

Related Topics

Problem Stats

Practice on LeetCode

Frequently Asked Questions

Is Number of Valid Words for Each Puzzle asked in FAANG interviews?

What data structure is best for Number of Valid Words for Each Puzzle?

What is the optimal approach for Number of Valid Words for Each Puzzle?

Why is bit manipulation useful in Number of Valid Words for Each Puzzle?

Explanation

Explanation

Explanation