#692 Top K Frequent Words - Solution

Given an array of strings words and an integer k, return the k most frequent strings.

Return the answer sorted by the frequency from highest to lowest. Sort the words with the same frequency by their lexicographical order.

Example 1:

Input: words = ["i","love","leetcode","i","love","coding"], k = 2
Output: ["i","love"]
Explanation: "i" and "love" are the two most frequent words.
Note that "i" comes before "love" due to a lower alphabetical order.

Example 2:

Input: words = ["the","day","is","sunny","the","the","the","sunny","is","is"], k = 4
Output: ["the","is","sunny","day"]
Explanation: "the", "is", "sunny" and "day" are the four most frequent words, with the number of occurrence being 4, 3, 2 and 1 respectively.

Constraints:

1 <= words.length <= 500
1 <= words[i].length <= 10
words[i] consists of lowercase English letters.
k is in the range [1, The number of unique words[i]]

Follow-up: Could you solve it in O(n log(k)) time and O(n) extra space?

Approach	Time Complexity	Space Complexity
Hash Map + Min Heap	O(n log k)	O(n)
Bucket Sort + Lexicographical Sort	O(n log n)	O(n)

Solutions (9)

Approach 1: Hash Map and Sorting

Explanation:

To solve this problem, we can use a hash map (or dictionary) to count the frequency of each word in the given array. Once we have the frequencies, we can then create a list of words and sort them based on their frequency in descending order. An additional condition for sorting is that if two words have the same frequency, we sort them alphabetically.

This approach simplifies the solution since sorting (bound by the number of unique words) helps easily extract the top k frequent words.

Time Complexity: O(n log n), where n is the number of unique words. This arises from the sorting operation.

Space Complexity: O(n), used for storing unique words.

C C++Java Python C#JavaScript

1from collections import Counter
2
3def topKFrequent(words, k):
4    count = Counter(words)
5    candidates = list(

Explanation

Here, Counter from the collections module is utilized to simplify frequency counting. We sort the words using a lambda function to prioritize frequency and lexicographical order. Finally, the top k words are selected as the result.

Approach 2: Min-Heap

Explanation:

This method takes advantage of a min-heap to efficiently determine the top k frequent words. Instead of sorting the entire set of words, which incurs an O(n log n) cost, we can maintain a min-heap of size k. As we iterate through the frequency map, we push elements into the heap. If heap size exceeds k, we remove the smallest element (i.e., least frequent word or alphabetically greater if frequencies are tied).

This is guided by the Follow-up question, showcasing an alternative solution bounded by O(n log k) time complexity.

Time Complexity: O(n log k), optimizing by constraining the heap's growth beyond k elements.

Space Complexity: O(n + k), combining map and heap usage.

C++Python JavaScript

1#include <vector>
2#include <string>
#include <unordered_map>
#include <queue>

class Solution {
public:
    struct Comp {
        bool operator()(const std::pair<int, std::string>& a, const std::pair<int, std::string>& b) const {
            if (a.first == b.first)
                return a.second < b.second;
            return a.first > b.first;
        }
    };

    std::vector<std::string> topKFrequent(std::vector<std::string>& words, int k) {
        std::unordered_map<std::string, int> count;
        for (const std::string& word : words) {
            count[word]++;
        }

        std::priority_queue<std::pair<int, std::string>, std::vector<std::pair<int, std::string>>, Comp> minHeap;
        for (const auto& [word, freq] : count) {
            minHeap.emplace(freq, word);
            if (minHeap.size() > k) {
                minHeap.pop();
            }
        }

        std::vector<std::string> result;
        while (!minHeap.empty()) {
            result.push_back(minHeap.top().second);
            minHeap.pop();
        }
        std::reverse(result.begin(), result.end());
        return result;
    }
};

In this C++ example, we create a custom comparator to maintain a priority queue (min-heap) based on the frequency and lexicon criteria. By limiting the heap to size k, we efficiently manage only the most critical elements, producing a final list once extraneous words are removed from the heap.

692. Top K Frequent Words

Problem Statement

692. Top K Frequent Words

Problem Statement

Approach

Complexity

Video Solution Available

Solutions (9)

Approach 1: Hash Map and Sorting

Explanation

Approach 2: Min-Heap

Video Solutions

Top K Frequent Elements - Bucket Sort - Leetcode 347 - Python

Meta Coding Interview Question - Top K Frequent Elements - Leetcode 347

Top K Frequent Elements (LeetCode 347) | Full solution with examples | Interview | Study Algorithms

Top K Frequent Words - Priority Queue Approach (LeetCode)

TOP K FREQUENT WORDS (Leetcode) - Code & Whiteboard

TOP K FREQUENT WORDS| LEETCODE 692 | PYTHON CUSTOM HEAP SOLUTION

LeetCode: Top K Frequent Words

Leetcode 692 Top K Frequent Words | Coding Decoded SDE Sheet

LeetCode 692 Top K Frequent Words |Top 15% Solution Hash Table|Sorting|Python|FAANG Coding Interview

692. Top K Frequent Words - Day 19/31 Leetcode October Challenge

Asked By Companies

Prepare for Interviews

Notes

Personal Notes

Similar Problems

Related Topics

Problem Stats

Practice on LeetCode

Frequently Asked Questions

Is Top K Frequent Words asked in FAANG interviews?

What is the optimal approach for Top K Frequent Words?

What data structure is best for solving Top K Frequent Words?

Why is lexicographical order important in Top K Frequent Words?

Explanation